About me

I am an Associate Professor of Computer Science and Engineering at the Chinese University of Hong Kong. I also serve as Chief Scientist at Kunlun Wanwei Technology & Skywork AI, and Lead Scientist/Professor at Shanghai AI Lab and Shanghai Innovation Institute. I received my Ph.D. from Northwestern University in 2015 and my B.S. from Tsinghua University in 2010. From 2023 to 2025, I served as Chief Scientist at Minimax, which went public in 2026 with a valuation of $40 billion. With my industry collaborators, I have helped deliver a range of GenAI models, including Minimax abab6.5/7, M1, Hailuo Video, Skywork SuperAgent, Skyclaw, and Matrix-Game. Prior to that, from 2018 to 2023, I was a Principal Researcher at Microsoft Research Redmond, where I led multiple teams to productize core technologies powering Microsoft–OpenAI models such as Copilot, DALL-E 2, ChatGPT, and GPT-4.

My research interests specialize in efficient/sparse architectures, model compression, and multimodal learning. I regularly serve as a Senior Area Chair for NeurIPS and ICML and as an Area Chair for CVPR, ICLR, ACL, AAAI, and EMNLP. I am the Action Editor for Transactions on Machine Learning Research and ACM Transactions on Intelligent Systems and Technology. My papers have won IEEE 2024 SPS Young Author Best Paper Award, Outstanding Paper Award in NeurIPS 2023, and Best Student Paper Honorable Mention in WACV 2021. I am an affiliate professor/faculty at Tsinghua University, Shanghai Jiao Tong University, Fudan University, Zhejiang University, and University of Science and Technology of China.

Latest News