Hao Wu
Researcher @ Tencent, @ Department of Computer Science, University of Science and Technology of China @ Tsinghua University
I am currently a Research Intern at Tencent. I graduated from the Department of Computer Science at the University of Science and Technology of China (USTC). During my master's studies, I was also a joint training student in the large model training group of the Machine Learning Platform Department at Tencent.
My research lies at the intersection of Robot video / world models, multimodal large language models, and agent systems. More broadly, I am interested in building intelligent systems that can understand, predict, and reason about the physical world across modalities.
My recent and future focus includes three closely related directions: agentic reasoning, multimodal large models, robot video generation and embodied world models, and video world models. My work has appeared in venues such as ICLR, NeurIPS, ICML, KDD, AAAI, ICCV, ACM MM, TKDE, and TPAMI, with nearly 30 CCF-A publications.
Current Research Interests
News
Selected Awards & Honors
Experience
Research Intern, Tencent CSIG
Aug. 2025 - May. 2026Continuing research on multimodal foundation models, agent systems, and world models in industrial-scale settings.
Research Intern, Tencent Hunyuan
Aug. 2023 - Jul. 2025Worked on large models, world models, and multimodal generative modeling in Tencent Hunyuan.
Online Research Intern, UCLA
May 2023 - May 2024Conducted remote research on multimodal learning, dynamics modeling, and related machine learning problems.
Research Intern, HKUST (Guangzhou)
Mar. 2023 - Aug. 2023Worked on machine learning and multimodal modeling in the CityMind research environment.
Selected Publications
Full list on Google ScholarService
- Reviewer: ICLR, KDD, NeurIPS, ICCV, AAAI, TKDE, ICML, and ACM MM
- Research Areas: Multimodal large models, video world models, and agent systems