About me

I am an Associate Professor in Computer Science and Engineering at the Chinese University of Hong Kong. I am also the Chief Scientist at Kunlun Wanwei Technology and Skywork AI. I got my Ph.D. from Northwestern University in 2015 and B.S. from Tsinghua University in 2010. My research interests specialize in efficient/sparse architectures, model compression, and multimodal learning. From 2018 to 2023, I was a Principal Researcher at Microsoft Research Redmond, and led several teams to productize the aforementioned techniques for Microsoft-OpenAI models (Copilot, DALL-E-2, ChatGPT, GPT-4). From 2023 to 2025, I was the Chief Scientist at Minimax. Together with Minimax and Skywork AI, we delivered many GenAI models: Minimax abab6.5/7, M1, Hailuo Video, Skywork SuperAgent, Skyweels and Matrix-Game.

I regularly serve as a Senior Area Chair for NeurIPS and ICML and as an Area Chair for CVPR, ICLR, ACL, AAAI, and EMNLP. I also serve as the Action Editor for Transactions on Machine Learning Research (TMLR) and ACM Transactions on Intelligent Systems and Technology (TIST). My papers have won IEEE 2024 SPS Young Author Best Paper Award, Outstanding Paper Award in NeurIPS 2023, and Best Student Paper Honorable Mention in WACV 2021. I am an affiliate professor/faculty at Tsinghua University, Shanghai Jiao Tong University, Fudan University, Zhejiang University, and University of Science and Technology of China.

Latest News