Zhiqing Sun

孙之清

avatar.jpg

PhD Student

CMU LTI (GHC 5507)

zhiqings[at]cs.cmu.edu

Hey there, welcome!

I am a final-year Ph.D. candidate at CMU LTI, advised by Prof. Yiming Yang. My research is generously supported by the Google PhD Fellowship in Natural Language Processing (2023) and the OpenAI Superalignment Fast Grants (2024). I received my B.S. in Computer Science from Peking University.

Research Interests

I am generally interested in machine learning and artificial intelligence. My recent research focuses on scalable alignment of foundation models. I am particularly interested in enhancing the reliability of foundation models, including large language models (LLMs) and large multimodal models (LMMs), through minimal human supervision and scalable oversight. This can be achieved using human-defined principles, factual feedback from real-world interactions, or easy-to-hard generalization. A few of my recent projects include:

  1. Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision: Guided by the observation that evaluation is easier than generation, we enabled large language models to excel on hard math problems beyond human evaluation capabilities through the easy-to-hard generalization of evaluators (e.g., process reward models).
  2. SALMON: Self-Alignment with Instructable Reward Models: We developed an Instructable Reward Model that helps RLAIF fully replace RLHF to align language models from scratch (enhancing both their alignment and capabilities)!
  3. Aligning Large Multimodal Models with Factually Augmented RLHF: We proposed Factually Augmented RLHF (Fact-RLHF) that augments the reward model with additional factual information to alleviate the reward hacking phenomenon in RLHF.

News

Education

  Language Technologies Institute, Carnegie Mellon University
  • Aug. 2019 - Present, M.S. / Ph.D. in Language Technologies
  School of Electrical Engineering & Computer Science (EECS), Peking University
  • Sept. 2015 - July 2019, B.S. in Computer Science (Summa Cum Laude)

Experience

  Allen Institute for AI
  MIT-IBM Watson AI Lab
  Google Brain
  Google Brain
  Microsoft Research Asia
  Mila - Quebec Artificial Intelligence Institute & University of Montreal

Selected Publications

For a more complete list or preprints, see the publications page, or my google scholar page.

(*=equal contribution)

2024

  1. ICLR
    SALMON: Self-Alignment with Instructable Reward Models
    Zhiqing Sun, Yikang Shen, Hongxin Zhang, Qinhong Zhou, Zhenfang Chen, David Daniel Cox, Yiming Yang, and Chuang Gan
    In The Twelfth International Conference on Learning Representations, 2024

2023

  1. NeurIPS (Spotlight)
    Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
    Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David Cox, Yiming Yang, and Chuang Gan
    In Thirty-seventh Conference on Neural Information Processing Systems, 2023
  2. NeurIPS (Spotlight)
    DIFUSCO: Graph-based Diffusion Solvers for Combinatorial Optimization
    Zhiqing Sun, and Yiming Yang
    In Thirty-seventh Conference on Neural Information Processing Systems, 2023
  3. ICML
    A Neural PDE Solver with Temporal Stencil Modeling
    Zhiqing Sun, Yiming Yang, and Shinjae Yoo
    In Proceedings of the 40th International Conference on Machine Learning, 2023
  4. ICLR
    Recitation-Augmented Language Models
    Zhiqing Sun, Xuezhi Wang, Yi Tay, Yiming Yang, and Denny Zhou
    In The Eleventh International Conference on Learning Representations, 2023

2022

  1. NeurIPS
    Dimes: A differentiable meta solver for combinatorial optimization problems
    Ruizhong Qiu*, Zhiqing Sun*, and Yiming Yang
    Advances in Neural Information Processing Systems, 2022
  2. ICLR
    Sparse attention with learning to hash
    Zhiqing Sun, Yiming Yang, and Shinjae Yoo
    In International Conference on Learning Representations, 2022

2021

  1. ICCV
    Rethinking transformer-based set prediction for object detection
    Zhiqing Sun*, Shengcao Cao*, Yiming Yang, and Kris M Kitani
    In Proceedings of the IEEE/CVF international conference on computer vision, 2021

2020

  1. ACL
    MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices
    Zhiqing Sun, Hongkun Yu, Xiaodan Song, Renjie Liu, Yiming Yang, and Denny Zhou
    In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
  2. ICML
    An EM approach to non-autoregressive conditional sequence generation
    Zhiqing Sun, and Yiming Yang
    In International Conference on Machine Learning, 2020

2019

  1. NeurIPS
    Fast structured decoding for sequence models
    Zhiqing Sun*, Zhuohan Li*, Haoqing Wang, Di He, Zi Lin, and Zhihong Deng
    Advances in Neural Information Processing Systems, 2019
  2. ICLR
    RotatE: Knowledge Graph Embedding by Relational Rotation in Complex Space
    Zhiqing Sun, Zhi-Hong Deng, Jian-Yun Nie, and Jian Tang
    In International Conference on Learning Representations, 2019