🎓 Google Scholar | 💾 Demo videos | 🔗 Linkedin
I am currently a postdoc at the National University of Singapore, working with Prof. Gim Hee Lee. I was a Postdoctoral Fellow at the Robotics Institute of Carnegie Mellon University (2022--2024), working with Prof. Fernando de la Torre and Dr. Dong Huang. I received my Ph.D. degree from Sun Yat-sen University, advised by Prof. Jian Yin and Prof. Xiaodan Liang.
I mainly focus on Human-centered World Understanding and Generation
3D Hand & Human Reconstruction
Video Generation and Understanding
2D/3D Virtual Try-on Networks
Neural Representations and Rendering for Human
3D Vision
03/2025 Two papers accepted to CVPR 2025.
MV-SSM: Multi-View State Space Modeling for 3D Human Pose Estimation
Learnable Infinite Taylor Gaussian for Dynamic View Rendering
12/2024 Workshop Proposal accepted to CVPR 2025: Visual Modeling Challenges for 2D-3D Virtual Try-On, https://vto-at-cvpr25.github.io
09/2024 One paper accepted to NeurIPS 2024: Hamba: Single-view 3D Hand Reconstruction with Graph-guided Bi-Scanning Mamba
07/2024 One paper accepted to ACM MM 2024: DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models
07/2024 One paper accepted to ECCV 2024: Generalizable Human Guassians
Overview of my past research. Including controllable 2D/3D human image generation, realistic human try-on video synthesis, and accurate 3D human motion/generation using robust regressor/neural rendering.
The big picture of my future research plan. Firstly, building large human models for accurate and robust 3D humans based on a single image. Secondly, understanding and generating 3D humans in the scene. Lastly, leveraging Large Language Models (LLMs) and Large Vision Models (LVMs) to build Human-centered Artificial General Intelligence (HAGI).
Wenbo Gou (2023-Now): 3D Human Reconstruction
Master at CMU
Aviral Chharia (2023-Now): 3D Hand Reconstruction
Master at CMU
Zhenyu Xie (2019-Now): GP-VTON(CVPR23), WAS-VTON(ACM MM21)
Ph.D. at Sun Yat-sen University, visiting Stu. at CMU
Haoyuan Li (2022-Now): Coordinate Transformer (ICCV23, published during the undergraduate)
Master at Sun Yat-sen University
Xujie Zhang (2020-Now): Fashion Editing(CVPR20), WarpDiffusion
Ph.D. at Sun Yat-sen University, Research Intern at ByteDance.
Fuwei Zhao (2019-2022): M3D-VTON(ICCV21)
Researcher at ByteDance.
Very fortunate to meet you all. Welcome more motivated friends to collaborate together.
Organizer for CVPR 2025 Workshop on Visual Modeling Challenges for 2D-3D Virtual Try-On, https://vto-at-cvpr25.github.io
Organizer for CVPR 2020 Workshop on Human-centric Image/Video Synthesis. https://vuhcs.github.io
Organizer for CVPR 2019 Workshop on Augmented Human: Human-centric Understanding. https://vuhcs.github.io/vuhcs-2019/index.html
Reviewer for NeurIPS, CVPR, ICCV, ECCV, ICML, ICLR etc.
donghaoye12 at gmail.com | wechat: humanmodeling
© 2025 Haoye Dong