About me
I am a Visiting Professor at Rice University. From 2018-2023, I was a Principal Researcher at Microsoft Research Redmond. Before that, I was a Research Staff Member at IBM Research & MIT-IBM Watson AI Lab. I got my Ph.D. from Northwestern University in 2015 and my B.S. degree from Tsinghua University in 2010. My research covers deep learning in general, with specific interests in model compression/efficiency, deep generative models, and large multimodal/language models. From 2021 to 2023, I led several teams to productize these techniques for Microsoft-OpenAI core models (Copilot, DALL-E-2, ChatGPT, GPT-4).
I serve as a Senior Area Chair for NeurIPS and ICML, Area Chair for CVPR, ICLR, ACL, EMNLP, and NAACL, and on the Editorial Board of TACL. My papers have won the Outstanding Paper Award in NeurIPS 2023, the Best Student Paper Honorable Mention in WACV 2021, and the Best Paper Finalist in SDM 2015.
Latest News
- Serving as Senior Area Chair for NeurIPS 2024, and Area Chair for EMNLP 2024 (Apr. 2024).
- Our tutorial “Mixture-of-Experts in the Era of LLMs: A New Odyssey” has been accepted in ICML 2024 (Apr. 2024).
- Serving as Senior Area Chair for ICML 2024, Area Chair for NAACL 2024 and ACL 2024 (Jan. 2024).
- Invited talk at Tencent about Mixture of Experts in Large Language Models (Jan. 2024)
- Our work “DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models” has won the Outstanding Paper Award in NeurIPS 2023 (Dec. 2023)
- I organized the Efficient Natural Language and Speech Processing Workshop at NeurIPS 2023. Please consider submitting your work to the workshop (Oct. 2023)
- Serving as Area Chair for CVPR 2024, ICLR 2024, and ACMMM 2024 (Aug. 2023)
- Invited talk On the Efficiency and Robustness of Foundation Models at the Chinese University of Hong Kong and Tsinghua University (May 2023)
- Serving as Senior Area Chair for the Main Track and Datasets & Benchmarks Track of NeurIPS 2023 (Mar. 2023)
- I organized the Trustworthy and Reliable Large-Scale Machine Learning Models Workshop at ICLR 2023. Please consider submitting your work to the workshop (Feb. 2023)
- Invited panel talk at Efficient Natural Language and Speech Processing Workshop at NeurIPS 2022 (Dec. 2022)
- Invited talk at Fudan University (Dec. 2022)
- Invited talk Hardware and Algorithms for Learning On-a-chip Workshop at ICCAD 2022 (Nov. 2022)
- Invited talk Learning with Limited and Imperfect Data Workshop at ECCV 2022 (Oct. 2022)
- Serving as Area Chair for CVPR 2023, WACV 2023, and ACMMM 2023 (Sep. 2022)