About me
I’m Zongqi Wang. I am currently a second-year master’s student supervised by Prof.Yang at Tsinghua University. I received my bachelor’s degree from Xidian University. My previous research focused on AI Safety. Now, my research interests are primarily in Large Language Model, Reinforcement Learning, Reward Model, Character Model, Role-Playing, LLM-as-a-Judge and LLM Evaluation.
🔥 News
- 2026.01: 🎉🎉 We are excited to release our new paper: “Reward Modeling from Natural Language Human Feedback”.
- 2025.05: 🎉🎉 We are excited to release our new paper: “SCAN: Structured Capability Assessment and Navigation for LLMs”.
- 2025.05: 🎉🎉 Our 2 papers about LLM watermark are accepted to ACL 2025.
📝 Publications
⚖️ Reward Model
- Reward Modeling from Natural Language Human Feedback
- Zongqi Wang, Rui Wang, Yuchuan Wu, Yiyao Yu, Pinyi Zhang, Shaoning Sun, Yujiu Yang, Yongbin Li
- arXiv 2026.01 [pdf]
- S2J: Bridging the Gap Between Solving and Judging Ability in Generative Reward Models
- Shaoning Sun*, Jiachen Yu*, Zongqi Wang*, Xuewei Yang, Tianle Gu, Yujiu Yang
- arXiv 2025.09 [pdf]
- SCAN: Structured Capability Assessment and Navigation for LLMs
🛡️ AI Safety
- Invisible Entropy: Towards Safe and Efficient Low-Entropy LLM Watermarking
- MorphMark: Flexible Adaptive Watermarking for Large Language Models
- Robust and Minimally Invasive Watermarking for EaaS
- Thinking Racial Bias in Fair Forgery Detection: Models, Datasets and Evaluations
💻 Internships
- 2025.06 - present, Alibaba Tongyi Lab (通义实验室).
- 2024.12 - 2025.05, Baidu ERNIE Bot (文心一言).
📖 Educations
- 2024.09 - 2027.06, Master’s degree (Expected), Tsinghua University
- 2020.09 - 2024.06, Bachelor’s degree, Xidian University