About me

I am a Professor in Computer Science and Engineering at the Chinese University of Hong Kong. From 2018-2023, I was a Principal Researcher at Microsoft Research Redmond. Before that, I was a Research Staff Member at MIT-IBM Watson AI Lab. I got my Ph.D. from Northwestern University in 2015 and my B.S. degree from Tsinghua University in 2010. My research interests specialize in model compression & efficiency, deep generative models, and large multimodal/language models. From 2021 to 2023, I led several teams to productize these techniques for Microsoft-OpenAI core models (Copilot, DALL-E-2, ChatGPT, GPT-4).

I serve as a Senior Area Chair for NeurIPS and ICML and as an Area Chair for CVPR, ICLR, ACL, NAACL, and EMNLP. My papers have won the Cybersecurity Best Paper 2024, the Outstanding Paper Award in NeurIPS 2023, the Best Student Paper Honorable Mention in WACV 2021, and the Best Paper Finalist in SDM 2015. I am an affiliate professor/faculty at Tsinghua University, Shanghai Jiao Tong University, Fudan University, Zhejiang University, University of Science and Technology of China, and Tongji University.

Latest News

I am seeking two postdocs to join my research team at CUHK. Please drop me an email if you are interested in the position
Serving as Senior Area Chair ICML 2025 and NAACL 2025, Area Chair for CVPR 2025 and ICLR 2025 (Oct. 2024).
“DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models” is the winner of the 2024 Cybersecurity Best Paper Award (Sep. 2024).
I organized the Efficient Natural Language and Speech Processing Workshop at NeurIPS 2024. Please consider submitting your work to the workshop (Jul. 2024)
Invited talk at TTIC Summer Workshop on Multimodal Artificial Intelligence (June 2024).
Serving as Senior Area Chair for NeurIPS 2024, and Area Chair for EMNLP 2024 (Apr. 2024).
Our tutorial “Mixture-of-Experts in the Era of LLMs: A New Odyssey” has been accepted in ICML 2024 (Apr. 2024).
Serving as Senior Area Chair for ICML 2024, Area Chair for NAACL 2024 and ACL 2024 (Jan. 2024).
Invited talk at Tencent about Mixture of Experts in Large Language Models (Jan. 2024)
Our work “DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models” has won the Outstanding Paper Award in NeurIPS 2023 (Dec. 2023)
I organized the Efficient Natural Language and Speech Processing Workshop at NeurIPS 2023. Please consider submitting your work to the workshop (Oct. 2023)
Serving as Area Chair for CVPR 2024, ICLR 2024, and ACMMM 2024 (Aug. 2023)
Invited talk On the Efficiency and Robustness of Foundation Models at the Chinese University of Hong Kong and Tsinghua University (May 2023)
Serving as Senior Area Chair for the Main Track and Datasets & Benchmarks Track of NeurIPS 2023 (Mar. 2023)
I organized the Trustworthy and Reliable Large-Scale Machine Learning Models Workshop at ICLR 2023. Please consider submitting your work to the workshop (Feb. 2023)
Invited panel talk at Efficient Natural Language and Speech Processing Workshop at NeurIPS 2022 (Dec. 2022)
Invited talk at Fudan University (Dec. 2022)
Invited talk Hardware and Algorithms for Learning On-a-chip Workshop at ICCAD 2022 (Nov. 2022)
Invited talk Learning with Limited and Imperfect Data Workshop at ECCV 2022 (Oct. 2022)