Dr Yipeng Qin
BSc (Hons), PhD, FHEA
- Available for postgraduate supervision
Teams and roles for Yipeng Qin
Senior Lecturer; Deputy Director of Research
Overview
I am the Deputy Director of Research, Lead of the Computer Vision Research Group of the Visual Computing research section, and a Senior Lecturer (Associate Professor) in the School of Computer Science and Informatics. I am also a member of the EPSRC Peer Review College. I received my PhD in Computer Science from the National Centre for Computer Animation (NCCA), Bournemouth University, UK in 2017, and my BSc in Electrical Engineering from Shanghai Jiao Tong University, China in 2013.
My research interests lie at the intersection of machine learning and its applications in computer vision, computer graphics, and human–computer interaction. Currently, my work focuses on three main themes: (i) opening the deep learning "black box", (ii) advancing creative artificial intelligence (AI), and (iii) developing wearable prototypes for motion monitoring and analysis. Beyond these areas, I am also engaged in related topics such as semantic segmentation, domain adaptation, and semi-supervised learning. My research has been recognized with the SIGGRAPH 2025 Best Paper Award (News Cover Image) and the CVPR 2024 Best Paper Award Candidate.
All kinds of collaboration are welcome!
For CSC applicants: Cardiff University benefits from an official partnership with China Scholarship Council (CSC). If you are interested in doing research with me, please contact me via email as soon as possible as additional deadlines/steps may apply.
Publication
2026
- Meng, Z. et al., 2026. Improving sparse IMU-based motion capture with motion label smoothing. Presented at: The 40th Annual AAAI Conference on Artificial Intelligence (AAAI) 2026 Singapore 20-27 January, 2026.
2025
- Alwadee, E. J. et al. 2025. LATUP-Net: A lightweight 3D attention U-Net with parallel convolutions for brain tumor segmentation. Computers in Biology and Medicine 184 109353. (10.1016/j.compbiomed.2024.109353)
- He, Z. et al., 2025. VTON 360: High-fidelity virtual try-on from any viewing direction. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2025 Nashville, USA 11-15 June 2025. Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE. , pp.26388-26398. (10.1109/CVPR52734.2025.02457)
- Jiang, Z. , Qin, Y. and Finnegan, D. 2025. ‘Chattable’ Avatars: Using LLMs to power visitor engagement with historical persons. Presented at: BCS 38th International Conference on Human Computer Interaction Cardiff, Wales 09 - 11 November. Proceedings of the 38th International BCS Human-Computer Interaction Conference. British Computer Society.
- Lai, P. et al., 2025. LLM-driven multimodal and multi-identity listening head generation. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2025 Nashville, USA 11 - 15 June 2025. Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE. , pp.10656-10666. (10.1109/CVPR52734.2025.00996)
- Ren, T. et al., 2025. Diverse motion in-betweening from sparse keyframes with dual posture stitching. IEEE Transactions on Visualization and Computer Graphics 31 (2), pp.1402-1413. (10.1109/TVCG.2024.3363457)
- Wu, Y. , Guo, S. and Qin, Y. 2025. MODA: Motion-drift augmentation for inertial human motion analysis. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2025 Nashville, TN, USA 10-17 June 2025. Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE. , pp.27771-27781. (10.1109/CVPR52734.2025.02586)
- Wu, Z. et al., 2025. Hierarchically controlled deformable 3D gaussians for talking head synthesis. Presented at: The 39th Annual AAAI Conference on Artificial Intelligence (AAAI) 2025 Pennsylvania, USA 25 February – 04 March 2025. Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 39(8).Association for the Advancement of Artificial Intelligence. , pp.8532-8540. (10.1609/aaai.v39i8.32921)
- Yao, Y. et al., 2025. ToF-IP: time-of-flight enhanced sparse inertial poser for real-time human motion capture. Presented at: The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025) San Diego, California, USA 2-7 December 2025.
- Ying, E. et al., 2025. WristSketcher: Creating 2D dynamic sketches in AR with a sensing wristband. International Journal of Human-Computer Interaction 41 (1), pp.557-573. (10.1080/10447318.2024.2301857)
- Zuo, C. et al., 2025. Transformer IMU calibrator: Dynamic on-body IMU calibration for inertial motion capture. Presented at: SIGGRAPH 2025 Vancouver, Canada 10-14 August 2025. Vol. 44.Vol. 4. New York, NY, USA: Association for Computing Machinery. , pp.45-45. (10.1145/3730937)
2024
- Alshewaier, H. , Qin, Y. and Sun, X. 2024. (ExMod) model for medical image segmentation using scribble annotations. Presented at: The 5th International Conference on Medical Imaging and Computer-Aided Diagnosis Manchester, UK 19-21 November 2024. Published in: Su, R. and Frangi, A. F. eds. Proceedings of 2024 International Conference on Medical Imaging and Computer-Aided Diagnosis. Vol. 1372.Lecture Notes in Electrical Engineering Singapore: Springer. , pp.133-143. (10.1007/978-981-96-3863-5_13)
- Alwadee, E. et al. 2024. Assessing and enhancing the robustness of brain tumor segmentation using a probabilistic deep learning architecture. Presented at: 2024 ISMRM & ISMRT Annual Meeting & Exhibition Singapore 4-9 May 2024. Proceedings 2024 ISMRM & ISMRT Annual Meeting & Exhibition. Vol. 4526. International Society for Magnetic Resonance in Medicine. , pp.1-6.
- Chen, J. et al., 2024. NeRF-HuGS: Improved neural radiance fields in non-static scenes using heuristics-guided segmentation. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024 Seattle, WA, USA 17-21 June 2024. Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE. , pp.19436-19446. (10.1109/CVPR52733.2024.01838)
- Chen, X. et al., 2024. Full-body human motion reconstruction with sparse joint tracking using flexible sensors. ACM Transactions on Multimedia Computing, Communications and Applications 20 (2) 44. (10.1145/3564700)
- Fang, J. et al., 2024. SuDA: Support-based domain adaptation for Sim2Real hinge joint tracking with flexible sensors. Presented at: The Forty-First International Conference on Machine Learning (ICML) Vienna, Austria 21 - 27 July 2024. Published in: Salakhutdinov, R. et al., Proceedings of the 41st International Conference on Machine Learning. Vol. 235.ML Research Press. , pp.22042-22061.
- Hou, B. et al., 2024. DCCTNet: Kidney tumors segmentation based on dual-level combination of CNN and transformer. Presented at: IEEE International Conference on Image Processing (ICIP 2024) Abu Dhabi, United Arab Emirates 27-30 October 2024. Proceedings of International Conference on Image Processing. IEEE. , pp.3112-3116. (10.1109/ICIP51287.2024.10647912)
- Liang, Y. et al. 2024. Deep generative model based rate-distortion for image downscaling assessment. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024 Seattle, WA, USA 17-21 June 2024. Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE. , pp.19363-19372. (10.1109/CVPR52733.2024.01832)
- Liang, Y. et al. 2024. Efficient precision and recall metrics for assessing generative models using hubness-aware sampling. Presented at: The Forty-first International Conference on Machine Learning (ICML) Vienna, Austria 21-27 July 2024. Vol. 235., pp.29682-29699.
- Ning, S. et al., 2024. PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsign. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024 Seattle, USA 16-22 June 2024. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. , pp.6976-6985. (10.1109/CVPR52733.2024.00666)
- Wu, Y. et al., 2024. Accurate and steady inertial pose estimation through sequence structure learning and modulation. Presented at: Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024) Vancouver, Canada 10-15 December 2024.
- Zhan, L. et al., 2024. SATPose: Improving monocular 3D pose estimation with spatial-aware ground tactility. Presented at: ACM Multimedia 2024 Melbourne, Australia 28 October - 1 November 2024. MM '24: Proceedings of the 32nd ACM International Conference on Multimedia. ACM. , pp.6192-6201. (10.1145/3664647.3681654)
- Zhao, G. et al., 2024. Exploration and exploitation of unlabeled data for open-set semi-supervised learning. International Journal of Computer Vision 132 , pp.5888-5904. (10.1007/s11263-024-02155-y)
- Zuo, C. et al., 2024. Loose inertial poser: Motion capture with IMU-attached loose-wear jacket. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024 Seattle, USA 17-21 June 2024. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. , pp.2209-2219. (10.1109/CVPR52733.2024.00215)
2023
- Fang, F. et al., 2023. Handwriting velcro: Endowing AR glasses with personalized and posture-adaptive text input using flexible touch sensor. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT) 6 (4), pp.1-31. 163. (10.1145/3569461)
- Huang, R. et al., 2023. Parametric implicit face representation for audio-driven facial reenactment. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023 Vancouver, Canada 18 - 22 June 2023.
- Jones, O. , Poudevigne-Durance, T. and Qin, Y. 2023. Synthesis of time-series with missing observations using generative adversarial networks. Presented at: 34th Panhellenic Statistics Conference 19-22 May 2022. Greek Statistical Institute. , pp.154-166.
- Song, S. et al. 2023. Feature proliferation — the "cancer" in StyleGAN and its treatments. Presented at: International Conference on Computer Vision (ICCV) 2023 Paris, France October 1 - 6, 2023. Proceedings of IEEE/CVF International Conference on Computer Vision. IEEE. , pp.2360-2370. (10.1109/ICCV51070.2023.00224)
- Wang, K. et al., 2023. Computational design of wiring layout on tight suits with minimal motion resistance. Presented at: The 16th ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH ASIA 2023) Sydney, Australia 12 - 15 December 2023. Published in: Kim, J. , Lin, M. C. and Bickel, B. eds. SA '23: SIGGRAPH Asia 2023 Conference Papers. New York: Association for Computing Machinery. , pp.1-12. (10.1145/3610548.3618200)
- Yan, Z. et al., 2023. Universal semi-supervised model adaptation via collaborative consistency training. Presented at: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024) Waikoloa, Hawaii, United States 4 - 8 January 2024.
- Zhan, L. et al., 2023. TouchEditor: Interaction design and evaluation of a flexible touchpad for text editing of head-mounted displays in speech-unfriendly environments. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 7 (4), pp.1-29. 198. (10.1145/3631454)
- Zhao, G. et al., 2023. Improved distribution matching for dataset condensation. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023 Vancouver, Canada 18 - 22 June 2023.
- Zhao, X. et al. 2023. CUDAS: Distortion-aware saliency benchmark. IEEE Access 11 , pp.58025-58036. (10.1109/ACCESS.2023.3283344)
- Zhou, W. et al. 2023. Reduced-reference quality assessment of point clouds via content-oriented saliency projection. IEEE Signal Processing Letters 30 , pp.354-358. (10.1109/LSP.2023.3264105)
- Zuo, C. et al., 2023. Self-adaptive motion tracking against on-body displacement of flexible sensors. Presented at: Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023) New Orleans, United States 10 - 16 December 2023. Published in: Oh, A. et al., Proceedings of the 37th Conference on Neural Information Processing Systems. Vol. 36.Neural information processing systems foundation. , pp.198465-198465.
2022
- Chen, C. et al., 2022. Real-world blind super-resolution via feature matching with implicit high-resolution priors. Presented at: the 30th ACM International Conference on Multimedia (ACMMM 2022) Lisbon, Portugal 10 - 14 October 2022. Proceedings of the 30th ACM International Conference on Multimedia (ACMMM 2022). ACM. , pp.1329-1338. (10.1145/3503161.3547833)
- Liang, Y. et al. 2022. Exploring and exploiting hubness priors for high-quality GAN latent sampling. Presented at: The 39th International Conference on Machine Learning (ICML 2022) Baltimore, Maryland USA 17-23 July 2022. Vol. 162.ML Research Press. , pp.13271-13284.
- Poudevigne-Durance, T. , Jones, O. D. and Qin, Y. 2022. MaWGAN: a generative adversarial network to create synthetic data from datasets with missing data. Electronics 11 (6) 837. (10.3390/electronics11060837)
- Yan, Z. et al., 2022. Multi-level consistency learning for semi-supervised domain adaptation. Presented at: 31st International Joint Conference on Artificial Intelligence (IJCAI-ECAI 2022) Vienna, Austria 23-29 July 2022. Published in: De Raedt, L. ed. Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence. International Joint Conferences on Artificial Intelligence Organization. , pp.1530-1536. (10.24963/ijcai.2022/213)
- Yu, X. et al., 2022. PVSeRF: joint pixel-, voxel- and surface-aligned radiance field for single-image novel view synthesis. Presented at: 30th ACM International Conference on Multimedia (ACMMM 2022) Lisbon, Portugal 10 - 14 October 2022. Proceedings of the 30th ACM International Conference on Multimedia. New York: ACM. , pp.1572-1583. (10.1145/3503161.3547893)
- Zhao, G. et al., 2022. Centrality and consistency: two-stage clean samples identification for learning with instance-dependent noisy labels. Presented at: European Conference on Computer Vision (ECCV 2022) Tel Aviv, Israel 23-27 October 2022. Published in: Avidan, S. et al., Proceedings of the Computer Vision – ECCV 2022. IEEE. , pp.21-37. (10.1007/978-3-031-19806-9_2)
2021
- Chen, Z. et al., 2021. Human posture tracking with flexible sensors for motion recognition. Computer Animation and Virtual Worlds (10.1002/cav.1993)
- Su, J. et al., 2021. Correcting corrupted labels using mode dropping of ACGAN. Presented at: 15th International Symposium on Medical Information and Communication Technology (ISMICT 2021) Xiamen, China 14-16 April 2021. 2021 15th International Symposium on Medical Information and Communication Technology (ISMICT). IEEE. , pp.98-103. (10.1109/ISMICT51748.2021.9434911)
- Yan, Z. et al., 2021. Pixel-level intra-domain adaptation for semantic segmentation. Presented at: ACM Multimedia 2021 Chengdu, China 20-24 October 2021. MM '21: Proceedings of the 29th ACM International Conference on Multimedia. ACM. , pp.404-413. (10.1145/3474085.3475174)
- Zhu, Z. et al., 2021. Robust elbow angle prediction with aging soft sensors via output-level domain adaptation. IEEE Sensors Journal 21 (20), pp.22976-22984. (10.1109/JSEN.2021.3091004)
2020
- Abdal, R. , Qin, Y. and Wonka, P. 2020. Image2StyleGAN++: how to edit the embedded images?. Presented at: Conference on Computer Vision and Pattern Recognition (CVPR 2020) Seattle, Washington, USA 16-18 June 2020. Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE. , pp.8293-8302. (10.1109/CVPR42600.2020.00832)
- Qin, Y. , Mitra, N. and Wonka, P. 2020. How does Lipschitz regularization influence GAN training?. Presented at: 16th European Conference on Computer Vision (ECCV 2020) Glasgow, Scotland 23-28 August 2020. Published in: Vevaldi, A. et al., Computer Vision – ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XVI. Lecture Notes in Computer Science Springer. , pp.310-326. (10.1007/978-3-030-58517-4_19)
- Zhu, P. et al., 2020. SEAN: image synthesis with semantic region-adaptive normalization. Presented at: Conference on Computer Vision and Pattern Recognition (CVPR 2020) Seattle, Washington, USA 14-19 June 2020. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE. , pp.5103-5112. (10.1109/CVPR42600.2020.00515)
2019
- Abdal, R. , Qin, Y. and Wonka, P. 2019. Image2StyleGAN: How to embed images into the styleGAN latent space?. Presented at: International Conference on Computer Vision (ICCV) 2019 Seoul, South Korea 27 October 2019 - 3 November 2019. Proceedings of the International Conference on Computer Vision (ICCV) 2019. IEEE. , pp.4431-4440. (\10.1109/ICCV.2019.00453)
2017
- Qin, Y. , Yu, H. and Zhang, J. 2017. Fast and memory-efficient Voronoi diagram construction on triangle meshes. Computer Graphics Forum 36 (5), pp.93-104. (10.1111/cgf.13248)
2016
- Qin, Y. et al. 2016. Fast and exact discrete geodesic computation based on triangle-oriented wavefront propagation. ACM Transactions on Graphics 35 (4) 125. (10.1145/2897824.2925930)
2015
- Yu, H. , Qin, Y. and Zhang, J. J. 2015. Eigenspace-based surface completeness. Journal of Electronic Imaging 24 (2) 023037. (10.1117/1.JEI.24.2.023037)
Articles
- Alwadee, E. J. et al. 2025. LATUP-Net: A lightweight 3D attention U-Net with parallel convolutions for brain tumor segmentation. Computers in Biology and Medicine 184 109353. (10.1016/j.compbiomed.2024.109353)
- Chen, X. et al., 2024. Full-body human motion reconstruction with sparse joint tracking using flexible sensors. ACM Transactions on Multimedia Computing, Communications and Applications 20 (2) 44. (10.1145/3564700)
- Chen, Z. et al., 2021. Human posture tracking with flexible sensors for motion recognition. Computer Animation and Virtual Worlds (10.1002/cav.1993)
- Fang, F. et al., 2023. Handwriting velcro: Endowing AR glasses with personalized and posture-adaptive text input using flexible touch sensor. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT) 6 (4), pp.1-31. 163. (10.1145/3569461)
- Poudevigne-Durance, T. , Jones, O. D. and Qin, Y. 2022. MaWGAN: a generative adversarial network to create synthetic data from datasets with missing data. Electronics 11 (6) 837. (10.3390/electronics11060837)
- Qin, Y. et al. 2016. Fast and exact discrete geodesic computation based on triangle-oriented wavefront propagation. ACM Transactions on Graphics 35 (4) 125. (10.1145/2897824.2925930)
- Qin, Y. , Yu, H. and Zhang, J. 2017. Fast and memory-efficient Voronoi diagram construction on triangle meshes. Computer Graphics Forum 36 (5), pp.93-104. (10.1111/cgf.13248)
- Ren, T. et al., 2025. Diverse motion in-betweening from sparse keyframes with dual posture stitching. IEEE Transactions on Visualization and Computer Graphics 31 (2), pp.1402-1413. (10.1109/TVCG.2024.3363457)
- Ying, E. et al., 2025. WristSketcher: Creating 2D dynamic sketches in AR with a sensing wristband. International Journal of Human-Computer Interaction 41 (1), pp.557-573. (10.1080/10447318.2024.2301857)
- Yu, H. , Qin, Y. and Zhang, J. J. 2015. Eigenspace-based surface completeness. Journal of Electronic Imaging 24 (2) 023037. (10.1117/1.JEI.24.2.023037)
- Zhan, L. et al., 2023. TouchEditor: Interaction design and evaluation of a flexible touchpad for text editing of head-mounted displays in speech-unfriendly environments. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 7 (4), pp.1-29. 198. (10.1145/3631454)
- Zhao, G. et al., 2024. Exploration and exploitation of unlabeled data for open-set semi-supervised learning. International Journal of Computer Vision 132 , pp.5888-5904. (10.1007/s11263-024-02155-y)
- Zhao, X. et al. 2023. CUDAS: Distortion-aware saliency benchmark. IEEE Access 11 , pp.58025-58036. (10.1109/ACCESS.2023.3283344)
- Zhou, W. et al. 2023. Reduced-reference quality assessment of point clouds via content-oriented saliency projection. IEEE Signal Processing Letters 30 , pp.354-358. (10.1109/LSP.2023.3264105)
- Zhu, Z. et al., 2021. Robust elbow angle prediction with aging soft sensors via output-level domain adaptation. IEEE Sensors Journal 21 (20), pp.22976-22984. (10.1109/JSEN.2021.3091004)
Conferences
- Abdal, R. , Qin, Y. and Wonka, P. 2020. Image2StyleGAN++: how to edit the embedded images?. Presented at: Conference on Computer Vision and Pattern Recognition (CVPR 2020) Seattle, Washington, USA 16-18 June 2020. Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE. , pp.8293-8302. (10.1109/CVPR42600.2020.00832)
- Abdal, R. , Qin, Y. and Wonka, P. 2019. Image2StyleGAN: How to embed images into the styleGAN latent space?. Presented at: International Conference on Computer Vision (ICCV) 2019 Seoul, South Korea 27 October 2019 - 3 November 2019. Proceedings of the International Conference on Computer Vision (ICCV) 2019. IEEE. , pp.4431-4440. (\10.1109/ICCV.2019.00453)
- Alshewaier, H. , Qin, Y. and Sun, X. 2024. (ExMod) model for medical image segmentation using scribble annotations. Presented at: The 5th International Conference on Medical Imaging and Computer-Aided Diagnosis Manchester, UK 19-21 November 2024. Published in: Su, R. and Frangi, A. F. eds. Proceedings of 2024 International Conference on Medical Imaging and Computer-Aided Diagnosis. Vol. 1372.Lecture Notes in Electrical Engineering Singapore: Springer. , pp.133-143. (10.1007/978-981-96-3863-5_13)
- Alwadee, E. et al. 2024. Assessing and enhancing the robustness of brain tumor segmentation using a probabilistic deep learning architecture. Presented at: 2024 ISMRM & ISMRT Annual Meeting & Exhibition Singapore 4-9 May 2024. Proceedings 2024 ISMRM & ISMRT Annual Meeting & Exhibition. Vol. 4526. International Society for Magnetic Resonance in Medicine. , pp.1-6.
- Chen, C. et al., 2022. Real-world blind super-resolution via feature matching with implicit high-resolution priors. Presented at: the 30th ACM International Conference on Multimedia (ACMMM 2022) Lisbon, Portugal 10 - 14 October 2022. Proceedings of the 30th ACM International Conference on Multimedia (ACMMM 2022). ACM. , pp.1329-1338. (10.1145/3503161.3547833)
- Chen, J. et al., 2024. NeRF-HuGS: Improved neural radiance fields in non-static scenes using heuristics-guided segmentation. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024 Seattle, WA, USA 17-21 June 2024. Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE. , pp.19436-19446. (10.1109/CVPR52733.2024.01838)
- Fang, J. et al., 2024. SuDA: Support-based domain adaptation for Sim2Real hinge joint tracking with flexible sensors. Presented at: The Forty-First International Conference on Machine Learning (ICML) Vienna, Austria 21 - 27 July 2024. Published in: Salakhutdinov, R. et al., Proceedings of the 41st International Conference on Machine Learning. Vol. 235.ML Research Press. , pp.22042-22061.
- He, Z. et al., 2025. VTON 360: High-fidelity virtual try-on from any viewing direction. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2025 Nashville, USA 11-15 June 2025. Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE. , pp.26388-26398. (10.1109/CVPR52734.2025.02457)
- Hou, B. et al., 2024. DCCTNet: Kidney tumors segmentation based on dual-level combination of CNN and transformer. Presented at: IEEE International Conference on Image Processing (ICIP 2024) Abu Dhabi, United Arab Emirates 27-30 October 2024. Proceedings of International Conference on Image Processing. IEEE. , pp.3112-3116. (10.1109/ICIP51287.2024.10647912)
- Huang, R. et al., 2023. Parametric implicit face representation for audio-driven facial reenactment. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023 Vancouver, Canada 18 - 22 June 2023.
- Jiang, Z. , Qin, Y. and Finnegan, D. 2025. ‘Chattable’ Avatars: Using LLMs to power visitor engagement with historical persons. Presented at: BCS 38th International Conference on Human Computer Interaction Cardiff, Wales 09 - 11 November. Proceedings of the 38th International BCS Human-Computer Interaction Conference. British Computer Society.
- Jones, O. , Poudevigne-Durance, T. and Qin, Y. 2023. Synthesis of time-series with missing observations using generative adversarial networks. Presented at: 34th Panhellenic Statistics Conference 19-22 May 2022. Greek Statistical Institute. , pp.154-166.
- Lai, P. et al., 2025. LLM-driven multimodal and multi-identity listening head generation. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2025 Nashville, USA 11 - 15 June 2025. Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE. , pp.10656-10666. (10.1109/CVPR52734.2025.00996)
- Liang, Y. et al. 2024. Deep generative model based rate-distortion for image downscaling assessment. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024 Seattle, WA, USA 17-21 June 2024. Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE. , pp.19363-19372. (10.1109/CVPR52733.2024.01832)
- Liang, Y. et al. 2024. Efficient precision and recall metrics for assessing generative models using hubness-aware sampling. Presented at: The Forty-first International Conference on Machine Learning (ICML) Vienna, Austria 21-27 July 2024. Vol. 235., pp.29682-29699.
- Liang, Y. et al. 2022. Exploring and exploiting hubness priors for high-quality GAN latent sampling. Presented at: The 39th International Conference on Machine Learning (ICML 2022) Baltimore, Maryland USA 17-23 July 2022. Vol. 162.ML Research Press. , pp.13271-13284.
- Meng, Z. et al., 2026. Improving sparse IMU-based motion capture with motion label smoothing. Presented at: The 40th Annual AAAI Conference on Artificial Intelligence (AAAI) 2026 Singapore 20-27 January, 2026.
- Ning, S. et al., 2024. PICTURE: PhotorealistIC virtual Try-on from UnconstRained dEsign. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024 Seattle, USA 16-22 June 2024. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. , pp.6976-6985. (10.1109/CVPR52733.2024.00666)
- Qin, Y. , Mitra, N. and Wonka, P. 2020. How does Lipschitz regularization influence GAN training?. Presented at: 16th European Conference on Computer Vision (ECCV 2020) Glasgow, Scotland 23-28 August 2020. Published in: Vevaldi, A. et al., Computer Vision – ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XVI. Lecture Notes in Computer Science Springer. , pp.310-326. (10.1007/978-3-030-58517-4_19)
- Song, S. et al. 2023. Feature proliferation — the "cancer" in StyleGAN and its treatments. Presented at: International Conference on Computer Vision (ICCV) 2023 Paris, France October 1 - 6, 2023. Proceedings of IEEE/CVF International Conference on Computer Vision. IEEE. , pp.2360-2370. (10.1109/ICCV51070.2023.00224)
- Su, J. et al., 2021. Correcting corrupted labels using mode dropping of ACGAN. Presented at: 15th International Symposium on Medical Information and Communication Technology (ISMICT 2021) Xiamen, China 14-16 April 2021. 2021 15th International Symposium on Medical Information and Communication Technology (ISMICT). IEEE. , pp.98-103. (10.1109/ISMICT51748.2021.9434911)
- Wang, K. et al., 2023. Computational design of wiring layout on tight suits with minimal motion resistance. Presented at: The 16th ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH ASIA 2023) Sydney, Australia 12 - 15 December 2023. Published in: Kim, J. , Lin, M. C. and Bickel, B. eds. SA '23: SIGGRAPH Asia 2023 Conference Papers. New York: Association for Computing Machinery. , pp.1-12. (10.1145/3610548.3618200)
- Wu, Y. , Guo, S. and Qin, Y. 2025. MODA: Motion-drift augmentation for inertial human motion analysis. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2025 Nashville, TN, USA 10-17 June 2025. Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE. , pp.27771-27781. (10.1109/CVPR52734.2025.02586)
- Wu, Y. et al., 2024. Accurate and steady inertial pose estimation through sequence structure learning and modulation. Presented at: Thirty-Eighth Annual Conference on Neural Information Processing Systems (NeurIPS 2024) Vancouver, Canada 10-15 December 2024.
- Wu, Z. et al., 2025. Hierarchically controlled deformable 3D gaussians for talking head synthesis. Presented at: The 39th Annual AAAI Conference on Artificial Intelligence (AAAI) 2025 Pennsylvania, USA 25 February – 04 March 2025. Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 39(8).Association for the Advancement of Artificial Intelligence. , pp.8532-8540. (10.1609/aaai.v39i8.32921)
- Yan, Z. et al., 2022. Multi-level consistency learning for semi-supervised domain adaptation. Presented at: 31st International Joint Conference on Artificial Intelligence (IJCAI-ECAI 2022) Vienna, Austria 23-29 July 2022. Published in: De Raedt, L. ed. Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence. International Joint Conferences on Artificial Intelligence Organization. , pp.1530-1536. (10.24963/ijcai.2022/213)
- Yan, Z. et al., 2023. Universal semi-supervised model adaptation via collaborative consistency training. Presented at: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024) Waikoloa, Hawaii, United States 4 - 8 January 2024.
- Yan, Z. et al., 2021. Pixel-level intra-domain adaptation for semantic segmentation. Presented at: ACM Multimedia 2021 Chengdu, China 20-24 October 2021. MM '21: Proceedings of the 29th ACM International Conference on Multimedia. ACM. , pp.404-413. (10.1145/3474085.3475174)
- Yao, Y. et al., 2025. ToF-IP: time-of-flight enhanced sparse inertial poser for real-time human motion capture. Presented at: The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025) San Diego, California, USA 2-7 December 2025.
- Yu, X. et al., 2022. PVSeRF: joint pixel-, voxel- and surface-aligned radiance field for single-image novel view synthesis. Presented at: 30th ACM International Conference on Multimedia (ACMMM 2022) Lisbon, Portugal 10 - 14 October 2022. Proceedings of the 30th ACM International Conference on Multimedia. New York: ACM. , pp.1572-1583. (10.1145/3503161.3547893)
- Zhan, L. et al., 2024. SATPose: Improving monocular 3D pose estimation with spatial-aware ground tactility. Presented at: ACM Multimedia 2024 Melbourne, Australia 28 October - 1 November 2024. MM '24: Proceedings of the 32nd ACM International Conference on Multimedia. ACM. , pp.6192-6201. (10.1145/3664647.3681654)
- Zhao, G. et al., 2022. Centrality and consistency: two-stage clean samples identification for learning with instance-dependent noisy labels. Presented at: European Conference on Computer Vision (ECCV 2022) Tel Aviv, Israel 23-27 October 2022. Published in: Avidan, S. et al., Proceedings of the Computer Vision – ECCV 2022. IEEE. , pp.21-37. (10.1007/978-3-031-19806-9_2)
- Zhao, G. et al., 2023. Improved distribution matching for dataset condensation. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023 Vancouver, Canada 18 - 22 June 2023.
- Zhu, P. et al., 2020. SEAN: image synthesis with semantic region-adaptive normalization. Presented at: Conference on Computer Vision and Pattern Recognition (CVPR 2020) Seattle, Washington, USA 14-19 June 2020. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE. , pp.5103-5112. (10.1109/CVPR42600.2020.00515)
- Zuo, C. et al., 2023. Self-adaptive motion tracking against on-body displacement of flexible sensors. Presented at: Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023) New Orleans, United States 10 - 16 December 2023. Published in: Oh, A. et al., Proceedings of the 37th Conference on Neural Information Processing Systems. Vol. 36.Neural information processing systems foundation. , pp.198465-198465.
- Zuo, C. et al., 2025. Transformer IMU calibrator: Dynamic on-body IMU calibration for inertial motion capture. Presented at: SIGGRAPH 2025 Vancouver, Canada 10-14 August 2025. Vol. 44.Vol. 4. New York, NY, USA: Association for Computing Machinery. , pp.45-45. (10.1145/3730937)
- Zuo, C. et al., 2024. Loose inertial poser: Motion capture with IMU-attached loose-wear jacket. Presented at: The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024 Seattle, USA 17-21 June 2024. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. , pp.2209-2219. (10.1109/CVPR52733.2024.00215)
Research
My research interests are centered around machine learning (ML) and its applications in computer vision (CV), computer graphics (CG), human-computer interactions (HCI), etc. My current research revolves around the following three themes:
- Unboxing the deep learning black-box, which aims to understand the training dynamics of various deep neural networks, e.g. Generative Adversarial Networks (GANs).
- Creative Artificial Intelligence (AI), including GAN inversion, region-wise image disentanglement for diverse and controllable image synthesis/manipulation, etc.
- Prototyping wearable devices for motion monitoring and analysis, which aims to model the relationships between sensor signals and human motion using machine learning. This is collaborative research with Prof. Shihui Guo from Xiamen University, China.
I am also interested in other related topics like semantic segmentation, domain adaptation, semi-supervised learning, etc.
Projects
- Taith Research Mobility Funding, £925, PI, Welsh Government, 2025.10 - 2025.10.
- Empathetic Chatbots, £2,790, PI, Cardiff University On-campus Internship Funding, 2025.06 - 2025.08.
- A Natural Language Interface for Directing Virtual Character Performances, £59,981, PI, XR Network+ project (EP/W020602/1) of EPSRC, Sep 2024 - Feb 2025
- Uncovering the “Instincts” of Deep Generative Models for Fair and Unbiased Visual Content Creation, approx. £72,922, Main Supervisor, EPSRC DTP, No. EP/T517951/1 (2599521), Oct 2021 - Mar 2025
- Prototyping Smart Clothes for Population-level Motion Monitoring and Analysis, £12,000, PI, Royal Society International Exchanges 2021 Cost Share (NSFC), No. IEC\NSFC\211022, Mar 2022 - Mar 2025
- CU-XMU Collaboration Funding, £1,880, Co-I, Cardiff University, Nov 2024 - Dec 2024
- Cardiff University 2024 Research Culture Fund, £2,500, Co-I, HEFCW, May 2024 - Aug 2024
- Roadmap Development – Creating Chattable AI Personas for Museums and Archives, £4,000, PI, Digital Transformation Innovation Institute (DTII) Seedcorn Funding, No. DTIIR3SC04, Apr 2024 - Aug 2024
- Revolutionizing Visitor Experiences of Cultural Heritage: Unleashing the Power of AI-Powered Chattable Avatars in Interactive Exhibitions, £15,000, PI, UKRI Harmonised Impact Acceleration Account (IAA), No. 521632 (525436), Nov 2023 - May 2024
- Charting New Frontiers: An Exploratory Expedition and Pilot Study on Chattable Virtual Avatars, Unveiling Ethical and Social Dimensions in Content Delivery, £5,000, PI, GW4 Crucible 2023 Seed Funding, No. Cru23_01, Sep 2023 - Mar 2024
- Sensor Layout Optimization for 3D Body Shape Reconstruction, £3,200, PI, Cardiff-Xiamen University Research Collaboration Funding, Jul 2020 - Aug 2020
Activities/Events
- Dr Jenner’s House Ideation workshop, co-organised with Dr. Deborah Brewis from the University of Bath, October 27, 2025.
- LSW ECR Colloquium 2025 Committee Member, organised by the Learned Society of Wales (LSW), March - June 2025.
- 2024 Festival of Social Science, Artificial Intelligence in culture and heritage, organised by Dr. Jenny Kidd with support from our team, November 7, 2024.
- Conversations with History: bringing historical characters to life through AI-powered avatars, organised by National Trust with support from our team, October 26-27, 2024.
- Roadmap Development Workshops, organised by Dr. Yipeng Qin, May 31 and July 24, 2024.
- [AI UK Fringe 2024 Event] Past Meets Future: Can AI Personas Bring Historic Figures to Life?, organised by Dr. Yipeng Qin, March 27, 2024
- Cardiff University and National Trust Workshop on AI Avatars, co-organised with Dr. Daniel Finnegan, March 13, 2024
- Chattable Virtual Avatars for Museum & Archive Workshop, co-organised with Dr. Barbara Caddick from the University of Bristol, March 11, 2024.
Teaching
Modular Teaching
- 2021/22 - now, CMT307 Applied Machine Learning
- 2021/22 - now, CMT316 Applications of Machine Learning: Natural Language Processing/Computer Vision
- 2019/20 - now, CM1205 Architecture and Operating Systems, Module Lead
Awards
- Champion for Equality, Diversity and Inclusion (Nomination)
- Enriching Student Life Awards (ESLAs) 2025
- BY: Cardiff University and Cardiff Students Union (CUSU)
- Learning and Teaching Collaboration of the Year (Nomination)
- Enriching Student Life Awards (ESLAs) 2024
- BY: Cardiff University and Cardiff Students Union (CUSU)
- Champion for Equality, Diversity and Inclusion (Nomination)
- Enriching Student Life Awards (ESLAs) 2024
- BY: Cardiff University and Cardiff Students Union (CUSU)
External Examiner
- BSc Programme
- BSc Artificial Intelligence (UI4AA), University of Derby, May 2025 - Sep 2030
- BSc AI and Data Science (UI4AD), University of Derby, May 2025 - Sep 2030
Biography
Education and Qualifications
- 2017: PhD in Computer Science, National Centre for Computer Animation, Bournemouth University, UK
- 2013: BEng in Electrical Engineering, Shanghai Jiao Tong University, China
Career overview
- 2023 - present: Senior Lecturer, Cardiff University, UK
- 2019 - 2023: Lecturer, Cardiff University, UK
- 2017 - 2019: Postdoctoral Research Fellow, Visual Computing Center, King Abdullah University of Science and Technology, Saudi Arabia
Administrative Role
- 2024.09 - now, Deputy Director of Research
- 2024.02 - now, Lead of Computer Vision Research Group
- 2024.04 - 2024.09, PGT Admissions Tutor
- 2020.10 - 2024.09, PGT Programme Lead (COMSC) of MSc Data Science and Analytics and MSc Data Analytics for Government
Honours and awards
(Research)
- SIGGRAPH 2025 Best Paper Award
- News Cover Image
- BY: ACM SIGGRAPH Committees
- CVPR 2025 Outstanding Reviewer Award
- 711 out of 12,593 (5.6%)
- BY: The Computer Vision Foundation (CVF)
- CVPR 2024 Best Paper Award Candidate
- BY: The Computer Vision Foundation (CVF)
- CHCI 2024 Best Technology Demonstration Award
- Human Motion Capture System Based on Loose-wear Inertial Sensors
- BY: The 20th Chinese Human-Computer Interaction Conference (CHCI 2024)
(Teaching)
- Champion for Equality, Diversity and Inclusion (Nomination)
- Enriching Student Life Awards (ESLAs) 2025
- BY: Cardiff University and Cardiff Students Union (CUSU)
- Learning and Teaching Collaboration of the Year (Nomination)
- Enriching Student Life Awards (ESLAs) 2024
- BY: Cardiff University and Cardiff Students Union (CUSU)
- Champion for Equality, Diversity and Inclusion (Nomination)
- Enriching Student Life Awards (ESLAs) 2024
- BY: Cardiff University and Cardiff Students Union (CUSU)
Professional memberships
- EPSRC Peer Review College
- Member of ACM SIGGRAPH, Computer Vision Foundation (CVF), AsiaGraphics
- European Laboratory for Learning and Intelligent Systems (ELLIS)
- HEA Fellowship
- Learned Society of Wales - Advisory Group for Researcher Development
- Humanities and Data Science Special Interest Group @ The Alan Turing Institute
- British Academy Early Career Researcher Network
Academic positions
- 2023 - present: Senior Lecturer, Cardiff University, UK
- 2019 - 2023: Lecturer, Cardiff University, UK
- 2017 - 2019: Postdoctoral Research Fellow, Visual Computing Center, King Abdullah University of Science and Technology, Saudi Arabia
Committees and reviewing
- Area Chair: ICML 2026, ICML 2025, BMVC 2025, ICML 2024
- Grant Reviewer:
- Turing AI World-Leading Researcher Fellowships
- UKRI Future Leaders Fellowships (FLF)
- EPSRC Peer Review College
- EPSRC
- EPSRC New Investigator Award (NIA)
- EPSRC Additional Funding Programme for Mathematical Science
- Belgium Concerted Research Actions (CRAs)
- Journal Reviewer:
- IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
- ACM Transactions on Graphics (TOG)
- IEEE Transactions on Image Processing (TIP)
- IEEE Transactions on Visualization and Computer Graphics (TVCG)
- IEEE Transactions on Multimedia (TMM)
- Transactions on Machine Learning Research (TMLR)
- Computer Graphics Forum (CGF)
- Pattern Recognition
- Neurocomputing
- The Visual Computer
- Computer Animation and Virtual Worlds (CAVW)
- Computers & Graphics
- Expert Systems With Applications
- Signal, Image and Video Processing
- Biomedical Signal Processing and Control
- Journal of Open Humanities Data
- IEEE Access
- Visual Informatics
- World Wide Web
- Image and Vision Computing
- Frontiers in Imaging
- Conference Program Committee (PC) Member / Reviewer:
- ACM SIGGRAPH
- ACM SIGGRAPH Asia
- CVPR
- ICCV
- ECCV
- NeurIPS
- NeurIPS Datasets and Benchmarks Track
- ICLR
- ICLR BlogPosts Track
- ICML
- AAAI
- ACM MM
- AISTATS
- Eurographics
- Pacific Graphics
- BMVC
- WACV
- ACCV
- CVM
- CGI
- CASA
- IJCNN
- VRCAI
- ICXR
- GPC
- Book Reviewer: CRC Press/Taylor and Francis Group
Supervisions
I am interested in supervising PhD students in the areas of:
- AI for Generative Modelling
- Image Synthesis and Manipulation
- Interpretable ML/AI for Visual Content Generation
- Bias and Fairness in AI
- 3D Geometry Processing
For CSC applicants: Cardiff University benefits from an official partnership with China Scholarship Council (CSC). If you are interested in doing research with me, please contact me via email as soon as possible as additional deadlines/steps may apply.
Current PhD Supervision
- Stephen Miles (2021/07 - now) - Image Restoration with Deep Learning (Part-time).
- Shuang Song (2021/10 - now) - A Novel Depth Estimation Network For Image Stitching (CSC).
- Jinqi Wang (2022/10 - now) - AI-based Anime Content Creation
- Zhuoling Jiang (2024/01 - now) - AI-driven NPCs for Next-Generation Gaming (School-funded)
- Yuan Wang (2024/10 - now) - AI-powered Photography
Co-supervision
- Hateef Alshewaier (2021/04 - now) - Ensemble Methods For Multimedia Data Classification (co-supervised with Dr. Xianfang Sun).
- Ebtihal Alwadee (2021/10 - now) - Novel Adaptive Down-Sample Neural Network Classification For Detecting Brain Tumour From Mri Brain Images (co-supervised with Dr. Frank Langbein and Dr. Xianfang Sun).
- Nada Saad M Alharbi (2023/10 - now) - Detection of Autism Spectrum Disorder Using Deep Reinforcement Learning (co-supervised with Dr. Xianfang Sun).
- Yang Li (2023/10 - now) - Generative Models for Architectural Facade Design (co-supervised with Dr. Bailin Deng and Prof. Wassim Jabi).
- Zhengwen Chen (2024/10 - now) - Advanced Fine-Tuning of Large-Scale Image Generation Pretrained Models for Enhanced Performance and Control in Specialized Tasks (co-supervised with Prof. Yukun Lai and Dr. Oktay Karakus)
Current supervision
Past projects
- Yuanbang Liang (2021/10 - 2025/09) - Hubness Awareness Sampling for Deep Generative Models in Generation and Evaluation (EPSRC DTP) - Thesis Under Minor Corrections.
- Xin Zhao (2019/10 - 2025/06) - Saliency Modelling for Perceptual Image Processing (co-supervised with Prof. Hantao Liu).
- Thomas Poudevigne-Durance (2019/10 - 2024/06) - Generative Adversarial Networks For Rare Event Augmentation (co-supervised with Prof. Owen Jones, School of Mathematics).
Contact Details
Research themes
Specialisms
- Artificial intelligence
- Computer graphics
- Computer vision
- Machine learning
- Human Computer Interaction