Selected Publications
2024
- Reinforcement Learning with Token-level Feedback for Controllable Text Generation. Wendi Li, Wei Wei, Kaihe xu, Wenfeng xie, Dangyang Chen, Yu Cheng. NAACL 2024
- Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy. Pingzhi Li, Zhenyu Zhang, Prateek Yadav, Yi-Lin Sung, Yu Cheng, Mohit Bansal, Tianlong Chen. ICLR 2024
- Xinyu Zhao, Xuxi Chen, Yu Cheng, Tianlong Chen. Sparse MoE with Language Guided Routing for Multilingual Machine Translation. ICLR 2024
- GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions. Woojeong Jin, Subhabrata Mukherjee, Yu Cheng, Yelong Shen, Weizhu Chen, Ahmed Hassan Awadallah, Damien Jose, Xiang Ren. arXiv preprint
- CR-MoE: Consistent Routed Mixture-of-Experts for Scaling Contrastive Learning. Ziyu Jiang, Guoqing Zheng, Yu Cheng, Ahmed Hassan Awadallah, Zhangyang Wang. Transactions on Machine Learning Research (TMLR)
- Enhancing Low-Resource Relation Representations through Multi-View Decoupling. Chenghao Fan, Wei Wei, Xiaoye Qu, Zhenyi Lu, Xie Wenfeng, Yu Cheng. AAAI 2024
- Unsupervised Domain Adaptative Temporal Sentence Localization with Mutual Information Maximization. Daizong Liu, Xiang Fang, Xiaoye Qu, Jianfeng Dong, He Yan, Yang Yang, Yu Cheng. AAAI 2024
- ProS: Facial Omni-Representation Learning via Prototype-based Self-Distillation. Xing Di, Yiyu Zheng, Xiaoming Liu, Yu Cheng. WACV 2024
2023
- DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models. Boxin Wang, Weixin Chen, Hengzhi Pei, Chulin Xie, Mintong Kang, Chenhui Zhang, Chejian Xu, Zidi Xiong, Ritik Dutta, Rylan Schaeffer, Sang T Truong, Simran Arora, Mantas Mazeika, Dan Hendrycks, Zinan Lin, Yu Cheng, Sanmi Koyejo, Dawn Song, Bo Li. NeurIPS 2023 (Outstanding Paper Award)
- Annotations Are Not All You Need: A Cross-modal Knowledge Transfer Network for Unsupervised Temporal Sentence Grounding. Xiang Fang, Daizong Liu, Wanlong Fang, Pan Zhou, Yu Cheng, Keke Tang, Kai Zou. EMNLP 2023
- Low-Switching Policy Gradient with Exploration via Online Sensitivity Sampling. Yunfan Li, Yiran Wang, Yu Cheng, Lin Yang. ICML 2023
- Local Byte Fusion for Neural Machine Translation. Makesh Narsimhan Sreedhar, Xiangpeng Wan, Yu Cheng, Junjie Hu. ACL 2023
- DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language Models. Xuxi Chen, Tianlong Chen, Weizhu Chen, Zhangyang Wang, Ahmed Hassan Awadallah, Yu Cheng. ACL 2023
- You Are Catching My Attention: Are Vision Transformers Bad Learners Under Backdoor Attacks? Zenghui Yuan, Pan Zhou, Kai Zou, Yu Cheng. CVPR 2023
- Transform-Equivariant Consistency Learning for Temporal Sentence Grounding. Daizong Liu, Xiaoye Qu, Jianfeng Dong, Pan Zhou, Zichuan Xu, Haozhao Wang, Xing Di, Weining Lu, Yu Cheng. CVPR 2023
- What do Compressed Large Language Models Forget? Robustness Challenges in Model Compression. Mengnan Du, Subhabrata Mukherjee, Yu Cheng, Milad Shokouhi, Xia Hu, Ahmed Hassan Awadallah. EACL 2023
- Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning. Qingru Zhang, Minshuo Chen, Alexander Bukharin, Pengcheng He, Yu Cheng, Weizhu Chen, Tuo Zhao. ICLR 2023
- Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis. Wan-Cyuan Fan, Yen-Chun Chen, DongDong Chen, Yu Cheng, Lu Yuan, Yu-Chiang Frank Wang. AAAI 2023
- Hypotheses Tree Building for One-Shot Temporal Sentence Localization. Daizong Liu, Xiang Fang, Pan Zhou, Xing Di, Weining Lu, Yu Cheng. AAAI 2023
- Filling the Information Gap between Video and Query for Language-Driven Moment Retrieval. Daizong Liu, Xiaoye Qu, Jianfeng Dong, Guoshun Nan, Pan Zhou, Zichuan Xu, Lixing Chen, He Yan, Yu Cheng. ACMMM 2023
2022
- RASAT: Integrating Relational Structures into Pretrained Seq2Seq Model for Text-to-SQL. Jiexing Qi, Jingyao Tang, Ziwei He, Xiangpeng Wan, Yu Cheng, Chenghu Zhou, Xinbing Wang, Quanshi Zhang, Zhouhan Lin. EMNLP 2022
- Rethinking the Video Sampling and Reasoning Strategies for Temporal Sentence Grounding. Jiahao Zhu, Daizong Liu, Pan Zhou, Xing Di, Yu Cheng, Song Yang, Wenzheng Xu, Zichuan Xu, Yao Wan, Lichao Sun, Zeyu Xiong. EMNLP 2022
- M3ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design. Hanxue Liang, Zhiwen Fan, Rishov Sarkar, Ziyu Jiang, Tianlong Chen, Kai Zou, Yu Cheng, Cong Hao, Zhangyang Wang. NeurIPS 2022
- Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction. Hanxue Liang, Hehe Fan, Zhiwen Fan, Yi Wang, Tianlong Chen, Yu Cheng, Zhangyang Wang. ECCV 2022
- DnA: Improving Few-Shot Transfer Learning with Low-Rank Decomposition and Alignment. Ziyu Jiang, Tianlong Chen, Xuxi Chen, Yu Cheng, Luowei Zhou, Lu Yuan, Ahmed Awadallah, Zhangyang Wang. ECCV 2022
- Scalable Learning to Optimize: A Learned Optimizer Can Train Big Models. Xuxi Chen, Tianlong Chen, Yu Cheng, Weizhu Chen, Ahmed Awadallah, Zhangyang Wang. ECCV 2022
- MA-CLIP: Towards Modality-Agnostic Contrastive Language-Image Pre-training. Haoxuan You, Luowei Zhou, Bin Xiao, Noel C Codella, Yu Cheng, Ruochen Xu, Shih-Fu Chang, Lu Yuan. ECCV 2022
- SemAttack: Natural Textual Attacks via Different Semantic Spaces. Boxin Wang, Chejian Xu, Xiangyu Liu, Yu Cheng, Bo Li. NAACL 2022
- A Good Prompt Is Worth Millions of Parameters? Low-resource Prompt-based Learning for Vision-Language Models. Woojeong Jin, Yu Cheng, Yelong Shen, Weizhu Chen, Xiang Ren. ACL 2022
- The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy. Tianlong Chen, Zhenyu Zhang, Yu Cheng, Ahmed Awadallah, Zhangyang Wang. CVPR 2022
- Memory-guided Semantic Learning Network for Temporal Sentence Grounding. Daizong Liu, Xiaoye Qu, Xing Di, Yu Cheng, Zichuan Xu, Pan Zhou. AAAI 2022
- Unsupervised Temporal Video Grounding with Deep Semantic Clustering. Daizong Liu, Xiaoye Qu, Yinzhen Wang, Xing Di, Kai Zou, Yu Cheng, Zichuan Xu, Pan Zhou. AAAI 2022
- Playing Lottery Tickets with Vision and Language. Zhe Gan, Yen-Chun Chen, Linjie Li, Tianlong Chen, Yu Cheng, Shuohang Wang, Jingjing Liu. AAAI 2022
- Efficient Robust Training via Backward Smoothing. Jinghui Chen, Yu Cheng, Zhe Gan, Quanquan Gu, Jingjing Liu. AAAI 2022
- Adversarial Feature Augmentation and Normalization for Visual Recognition. Tianlong Chen, Yu Cheng, Zhe Gan, Jianfeng Wang, Lijuan Wang, Jingjing Liu, Zhangyang Wang. Transactions on Machine Learning Research (TMLR)
2021
- Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models. Boxin Wang, Chejian Xu, Shuohang Wang, Zhe Gan, Yu Cheng, Jianfeng Gao, Ahmed Hassan Awadallah, Bo Li. NeurIPS 2021
- VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation. Linjie Li, Jie Lei, Zhe Gan, Licheng Yu, Yen-Chun Chen, Rohit Pillai, Yu Cheng, Luowei Zhou, Xin Eric Wang, William Yang Wang, Tamara Lee Berg, Mohit Bansal, Jingjing Liu, Lijuan Wang, Zicheng Liu. NeurIPS 2021
- Chasing Sparsity in Vision Transformers: An End-to-End Exploration. Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang, Zhangyang Wang. NeurIPS 2021
- Data-Efficient GAN Training Beyond (Just) Augmentations: A Lottery Ticket Perspective. Tianlong Chen, Yu Cheng, Zhe Gan, Jingjing Liu, Zhangyang Wang. NeurIPS 2021
- The Elastic Lottery Ticket Hypothesis. Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, Jingjing Liu, Zhangyang Wang. NeurIPS 2021
- Maxva: Fast Adaptation of Step Sizes by Maximizing Observed Variance of Gradients. Chen Zhu, Yu Cheng, Zhe Gan, Furong Huang, Jingjing Liu, Tom Goldstein. ECML 2021
- EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets. Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, Zhangyang Wang, Jingjing Liu. ACL 2021
- Cluster-Former: Clustering-based Sparse Transformer for Long-Range Dependency Encoding. Shuohang Wang, Luowei Zhou, Zhe Gan, Yen-Chun Chen, Yuwei Fang, Siqi Sun, Yu Cheng, Jingjing Liu. ACL 2021
- Deep Co-Attention Network for Multi-View Subspace Learning. Lecheng Zheng, Yu Cheng, Hongxia Yang, Nan Cao, Jingrui He. WWW 2021
- InfoBERT: Improving Robustness of Language Models from an Information Theoretic Perspective. Boxin Wang, Shuohang Wang, Yu Cheng, Zhe Gan, Ruoxi Jia, Bo Li, Jingjing Liu. ICLR 2021
- APo-VAE: Text Generation in Hyperbolic Space. Shuyang Dai, Zhe Gan, Yu Cheng, Chenyang Tao, Lawrence Carin, Jingjing Liu. NAACL 2021
- Few-Shot Text Classification with Triplet Networks, Data Augmentation, and Curriculum Learning. Jason Wei, Chengyu Huang, Soroush Vosoughi, Yu Cheng, Shiqi Xu. NAACL 2021
- Context-aware Biaffine Localizing Network for Temporal Sentence Grounding. Daizong Liu, Xiaoye Qu, Jianfeng Dong, Pan Zhou, Yu Cheng, Wei Wei, Zichuan Xu, Yulai Xie. CVPR 2021
- UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training. Mingyang Zhou, Luowei Zhou, Shuohang Wang, Yu Cheng, Linjie Li, Zhou Yu, Jingjing Liu. CVPR 2021
- EnlightenGAN: Deep Light Enhancement without Paired Supervision. Yifan Jiang, Xinyu Gong, Ding Liu, Yu Cheng, Chen Fang, Xiaohui Shen, Jianchao Yang, Pan Zhou, Zhangyang Wang. IEEE Transactions on Image Processing (TIP)
- Meta Module Network for Compositional Visual Reasoning. Wenhu Chen, Zhe Gan, Linjie Li, Yu Cheng, William Wang, Jingjing Liu. WACV 2021 (Best Student Paper Honorable Mention)
2020
- Contextual Text Style Transfer. Yu Cheng, Zhe Gan, Yizhe Zhang, Oussama Elachqar, Dianqi Li, Jingjing Liu. EMNLP 2020
- HERO: Hierarchical Encoder for Video+ Language Omni-representation Pre-training. Linjie Li, Yen-Chun Chen, Yu Cheng, Zhe Gan, Licheng Yu, Jingjing Liu. EMNLP 2020
- Multi-Fact Correction in Abstractive Text Summarization. Yue Dong, Shuohang Wang, Zhe Gan, Yu Cheng, Jackie Chi Kit Cheung, Jingjing Liu. EMNLP 2020
- Cross-Thought for Sentence Encoder Pre-training. Shuohang Wang, Yuwei Fang, Siqi Sun, Zhe Gan, Yu Cheng, Jing Jiang, Jingjing Liu. EMNLP 2020
- Contrastive Distillation on Intermediate Representations for Language Model Compression. Siqi Sun, Zhe Gan, Yu Cheng, Yuwei Fang, Shuohang Wang, Jingjing Liu. EMNLP 2020
- Uniter: Universal Image-text Representation Learning. Yen-Chun Chen, Linjie Li, Licheng Yu, Ahmed El Kholy, Faisal Ahmed, Zhe Gan, Yu Cheng, Jingjing Liu. ECCV 2020
- Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models. Jize Cao, Zhe Gan, Yu Cheng, Licheng Yu, Yen-Chun Chen, Jingjing Liu. ECCV 2020
- Large-Scale Adversarial Training for Vision-and-Language Representation Learning. Zhe Gan, Yen-Chun Chen, Linjie Li, Chen Zhu, Yu Cheng, Jingjing Liu. NeurIPS 2020
- Graph Optimal Transport for Cross-Domain Alignment. Liqun Chen, Zhe Gan, Yu Cheng, Linjie Li, Lawrence Carin, Jingjing Liu. ICML 2020
- Freelb: Enhanced Adversarial Training for Language Understanding. Chen Zhu, Yu Cheng, Zhe Gan, Siqi Sun, Thomas Goldstein, Jingjing Liu. ICLR 2020
- Sequential Attention GAN for Interactive Image Editing. Yu Cheng, Zhe Gan, Yitong Li, Jingjing Liu, Jianfeng Gao. ACMMM 2020
- Fine-grained Iterative Attention Network for Temporal Language Localization in Videos. Xiaoye Qu, Pengwei Tang, Zhikang Zhou, Yu Cheng, Jianfeng Dong, Pan Zhou. ACMMM 2020
- Distilling the Knowledge of BERT for Text Generation. Yen-Chun Chen, Zhe Gan, Yu Cheng, Jingzhou Liu, Jingjing Liu. ACL 2020
- INSET: Sentence Infilling with Inter-sentential Generative Pre-training. Yichen Huang, Yizhe Zhang, Oussama Elachqar, Yu Cheng. ACL 2020
- Discourse-Aware Neural Extractive Model for Text Summarization. Jiacheng Xu, Zhe Gan, Yu Cheng, Jingjing Liu. ACL 2020
- Adversarial Robustness: From Self-Supervised Pre-Training to Fine-Tuning. Tianlong Chen, Sijia Liu, Shiyu Chang, Yu Cheng, Lisa Amini, Zhangyang Wang. CVPR 2020
- VIOLIN: A Large-Scale Dataset for Video-and-Language Inference. Jingzhou Liu, Wenhu Chen, Yu Cheng, Zhe Gan, Licheng Yu, Yiming Yang, Jingjing Liu. CVPR 2020
- BachGAN: High-Resolution Image Synthesis from Salient Object Layout. Yandong Li, Yu Cheng, Zhe Gan, Licheng Yu, Liqiang Wang, Jingjing Liu. CVPR 2020
- Contrastively Smoothed Class Alignment for Unsupervised Domain Adaptation. Shuyang Dai, Yu Cheng, Yizhe Zhang, Zhe Gan, Jingjing Liu, Lawrence Carin. ACCV 2020
- What Makes A Good Story? Designing Composite Rewards for Visual Storytelling. Junjie Hu, Yu Cheng, Zhe Gan, Jingjing Liu, Jianfeng Gao, Graham Neubig. AAAI 2020