About Me
I am a senior undergraduate in Artificial Intelligence (Class of ’26) at Yuanpei College, Peking University, majoring in Artificial Intelligence. I currently focus on Reinforcement Learning and Large Language Model (LLM), with a particular interest in designing learning algorithms and LLM reasoning. My research interests also cover AI Alignment and Interpretability. My research is driven by the following questions:
How to close the gap between artificial intelligence and human-level intelligence by designing reliable AI system and efficient learning algorithm?
I am currently at the Ant Research Reinforcement Learning Lab (Ant Group), supervised by Professor Yi Wu. Starting in 2026, I will begin my PhD at the Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University, advised by Professor Yi Wu, while interning concurrently at the Ant Research Reinforcement Learning Lab.
Honor and Awards
- Peking University Freshman Scholarship (2022)
- Dean’s Scholarship, General Artificial Intelligence Experimental Class (2025)
Experiences
- Tong Class, Peking University Undergraduate Student September. 2022 – Present
- PAIR Lab: PKU Alignment and Interaction Research Lab Research Intern (advisor: Prof. Yaodong Yang at Institute for AI, Peking University) July. 2023 – February. 2025
- Tsinghua MARS Lab: Multimedia Computing, Autonomous Driving, Robotics and Sensors Visiting Student Researcher (advisor: Prof. Hang Zhao at IIIS, Tsinghua University) February. 2025 – June. 2025
- Ant Research Reinforcement Learning Lab Research Intern (supervisor: Prof. Yi Wu in Ant Group) June. 2025 - Present
Publications
- (In Submission) Finetuning Generative Trajectory Model with Reinforcement Learning from Human Feedback Derun Li, Jianwei Ren, Changye Li, Yue Wang, Xin Wen, Pengxiang Li, Leimeng Xu, Kun Zhan, Peng Jia, Xianpeng Lang, Ningyi Xu, Hang Zhao
- (ICML 2025 Poster) SAE-V: Interpreting Multimodal Models for Enhanced Alignment, Hantao Lou, Changye Li, Jiaming Ji, Yaodong Yang
- (ACL 2025 Best Paper) Language Models Resist Alignment: Evidence From Data Compression, Jiaming Ji, Kaile Wang, Tianyi Qiu, Boyuan Chen, Jiayi Zhou*, Changye Li, Hantao Lou, Juntao Dai, Yunhuai Liu, Yaodong Yang
- (AAAI 2025) Towards efficient collaboration via graph modeling in reinforcement learning, Wenzhe Fan, Zishun Yu, Chengdong Ma, Changye Li, Yaodong Yang, Xinhua Zhang
Project
- AReaL: Ant Reasoning Reinforcement Learning for LLMs Contributed to building the training and inference code for Vision-Language Model (VLM).
Language proficiency
- Chinese: Native
- English: Advanced
- Hokkien: Intermediate
- Italiano, Español: Basic