About Me

I am a senior undergraduate in Artificial Intelligence (Class of ’26) at Yuanpei College, Peking University, majoring in Artificial Intelligence. I currently focus on Reinforcement Learning and Large Language Model (LLM), with a particular interest in designing learning algorithms and LLM post-training. My research interests also cover AI Alignment and Interpretability. My research is driven by the following questions:

How to close the gap between artificial intelligence and human-level intelligence by designing reliable AI system and efficient learning algorithm?

I am currently at the Ant Research Reinforcement Learning Lab (Ant Group), supervised by Professor Yi Wu and Researcher Ligeng Zhu. Starting in 2026, I will begin my PhD at the Institute for Interdisciplinary Information Sciences (IIIS), Tsinghua University.

Honor and Awards

  • Peking University Freshman Scholarship (2022)
  • Dean’s Scholarship, Institute for Artificial Intelligence, Peking University (2025)

Experiences

Publications

Project

  • AReaL: Ant Reasoning Reinforcement Learning for LLMs

    Contributed to building the training and inference code for Vision-Language Model (VLM).

  • TVRA: Large Scale Tool-calling Vision Reasoning Agents

    Leading a research project on multimodal tool-calling agents for vision reasoning under the guidance from Prof. Yi Wu and Researcher Ligeng Zhu, with the goal of preparing the work for academic submission.

Language proficiency

  • Chinese: Native
  • English: Advanced
  • Hokkien: Intermediate
  • Italiano, Español: Basic