Portrait of Hao Wu

Hao Wu

Researcher @ Tencent, @ Department of Computer Science, University of Science and Technology of China @ Tsinghua University

Feel free to reach out via email if you are interested in discussing research ideas or potential collaborations.

I am currently a Research Intern at Tencent. I graduated from the Department of Computer Science at the University of Science and Technology of China (USTC). During my master's studies, I was also a joint training student in the large model training group of the Machine Learning Platform Department at Tencent.

My research lies at the intersection of Robot video / world models, multimodal large language models, and agent systems. More broadly, I am interested in building intelligent systems that can understand, predict, and reason about the physical world across modalities.

My recent and future focus includes three closely related directions: agentic reasoning, multimodal large models, robot video generation and embodied world models, and video world models. My work has appeared in venues such as ICLR, NeurIPS, ICML, KDD, AAAI, ICCV, ACM MM, TKDE, and TPAMI, with nearly 30 CCF-A publications.

Current Research Interests

News

2026.04
A paper on the topic of Inference-time Safety Alignment was accepted by ACL 2026.
2026.01
Two papers were accepted to TPAMI. Congrats to all collaborators!
2025.12
Two papers were accepted to AAAI and ICLR. Congrats to all collaborators!
2025.06
As corresponding author, one paper was accepted to ICCV. Congrats to all collaborators!
2025.05
As co-first author, one paper was accepted to ICML. Congrats to all collaborators!
2025.03
As first author, one paper was accepted to KDD. Congrats to all collaborators!
2024.12
As corresponding author, one paper was accepted to ICLR. Congrats to all collaborators!
2024.07
As first author or co-first author, three papers were accepted to NeurIPS. Congrats to all collaborators!
2024.05
As first author, one paper was accepted to KDD. Congrats to all collaborators!
2024.05
As first author, one paper was accepted to ACM MM. Congrats to all collaborators!
2024.05
As first author, one paper was accepted to ICML. Congrats to all collaborators!
2023.12
As first author, two papers were accepted to AAAI. Congrats to all collaborators!
2023.07
As co-first author, one paper was accepted to NeurIPS. Congrats to all collaborators!

Selected Awards & Honors

2025
Outstanding Graduate
USTC × Tencent Joint Training Program
2022
National Scholarship Top 1% in China
University of Science and Technology of China
2022
First-Class Scholarship
University of Science and Technology of China

Experience

Research Intern, Tencent CSIG

Aug. 2025 - May. 2026

Tencent Jarvis Lab

Continuing research on multimodal foundation models, agent systems, and world models in industrial-scale settings.

Research Intern, Tencent Hunyuan

Aug. 2023 - Jul. 2025

Machine Learning Platform Department

Worked on large models, world models, and multimodal generative modeling in Tencent Hunyuan.

Online Research Intern, UCLA

May 2023 - May 2024

Advisor: Xiao Luo

Conducted remote research on multimodal learning, dynamics modeling, and related machine learning problems.

Research Intern, HKUST (Guangzhou)

Mar. 2023 - Aug. 2023

Advisors: Yuxuan Liang and Kun Wang

Worked on machine learning and multimodal modeling in the CityMind research environment.

Selected Publications

Full list on Google Scholar
Safety-aware rollouts paper figure

Safety-Aware Rollouts with Self-Reflection and Structured Rewards

Huahui Yi, Kun Wang, Haolong Hu, Moayad Aloqaily, Liang Lin, Junhao Dong, Qiankun Li, Xing Fan, Hao Wu, Yang Liu, and Qingsong Wen Corresponding Author
TPAMIUnder Review
ICCV paper figure

Frequency-Aligned Knowledge Distillation for Lightweight Spatiotemporal Forecasting

Yuqi Li, Chuanguang Yang, Hansheng Zeng, Zeyu Dong, Zhulin An, Yongjun Xu, Yingli Tian, and Hao Wu Corresponding Author
ICCV 2025

Service

  • Reviewer: ICLR, KDD, NeurIPS, ICCV, AAAI, TKDE, ICML, and ACM MM
  • Research Areas: Multimodal large models, video world models, and agent systems