About me

I am a first-year CS Ph.D. student at Beijing Jiaotong University (BJTU), advised by Prof. Jinan Xu and Prof. Kaiyu Huang. I am broadly interested in Multimodal Large Language Models, Natural Language Generation and Trustworthy Large Language Models.

Before that, I obtained my Bachlor degree from the same Beijing Jiaotong University, majoring in Computer Science.

My research interests include:

  • 🖼️ Fine-grained Visual Perception (e.g. Visual Grounding)
  • 💬 Visual Reasoning in Multimodal Large Language Models
  • 🧩 The Reliability and Safety of LLM

Selected Publications

Full List is in Google Scholar

Education & Experience

  • B.S., Computer Science and Technology & Economics, Beijing Jiaotong University, 09/2020-06/2024
  • Ph.D., Computer Science and Technology, Beijing Jiaotong University, 09/2024-now
  • Internship at THUNLP Multimodal research group, 08/2024-07/2025
  • Internship at QiYuan Lab, 09/2025-now