I am Zihan Wang (王子涵), an incoming M.S. student in Artificial Intelligence at Tsinghua University (shenzhen). My advisor is Prof. Yujiu Yang.

I received my B.Eng. in Computer Science and Technology from Xi’an Jiaotong University.

My research interests center on NLP / Large Language Models (LLMs), with a focus on building intrinsically motivated, self-evolving, and reasoning-capable agents, particularly for coding, graphical user interface (GUI) interaction.

📖 Education

Sep. 2026 (incoming), Tsinghua University, M.S. in AI (Shenzhen)
Sep. 2022 – Jul. 2026, Xi’an Jiaotong University, B.Eng. in Computer Science and Technology
GPA 3.97/4.30, Rank 5/193

🔥 News

2026.03: 📄 Our paper “CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges” was released on arXiv.
2026.09: 🎓 Incoming M.S. student at Tsinghua University (Shenzhen).
2025: 🎉 Our paper “Alignment for Efficient Tool Calling of Large Language Models” was accepted to EMNLP 2025 (Main).
2025: 🎉 Our paper “Reducing Tool Hallucination via Reliability Alignment” was accepted to ICML 2025.

📝 Publications

CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges
Zi-Han Wang, Lam Nguyen, Zhengyang Zhao, Mengyue Yang, Chengwei Qin, Yujiu Yang, Linyi Yang
[ArXiv] [Homepage] [Code]

arXiv 2026

Introduces CreativeBench, a benchmark for machine creativity in code generation, covering both combinatorial and exploratory creativity with executable evaluation and an inference-time enhancement strategy, EvoRePE.

Alignment for Efficient Tool Calling of Large Language Models
Hongshen Xu*, Zihan Wang*, Zichen Zhu, Lei Pan, Xingyu Chen, Lu Chen, Kai Yu
[ArXiv]

EMNLP 2025 (Main)

Proposes a multi-objective alignment framework combining probabilistic knowledge boundary estimation with dynamic decision making to reduce unnecessary tool calls while preserving performance.

Reducing Tool Hallucination via Reliability Alignment
Hongshen Xu, Su Zhu, Zihan Wang, Hang Zheng, Da Ma, Ruisheng Cao, Shuai Fan, Lu Chen, Kai Yu
[ArXiv]

ICML 2025

Defines and categorizes tool hallucinations (tool-selection vs tool-usage) and introduces reliability-oriented alignment to improve robust tool interaction and efficiency.

Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
Huichi Zhou, Yihang Chen, Siyuan Guo, Xue Yan, Kin Hei Lee, Zihan Wang, Ka Yiu Lee, Guchun Zhang, Kun Shao, Linyi Yang†, Jun Wang†
[ArXiv] [Code]

Technical Report

A memory-based continual improvement framework enabling agents to learn from experience without updating model weights.

Delusions of Large Language Models
Hongshen Xu, Zixv Yang, Zichen Zhu, Kunyao Lan, Zihan Wang, Mengyue Wu, Ziwei Ji, Lu Chen, Pascale Fung, Kai Yu
[ArXiv]

Manuscript

Investigates high-confidence hallucination phenomena (“delusions”) and analyzes underlying causes and behaviors.

Repository

AgentFly

🎖 Honors and Awards

2022–2023: National Scholarship
2023–2024: Outstanding Award — American College Students Mathematical Modeling Competition
2023–2024: Provincial First Prize — National College Students’ Mathematical Competition
2023–2024: Outstanding Volunteer Award
2024–2025: Baidu Artificial Intelligence and Big Data Elite Class

Zihan Wang（王子涵）

📖 Education

🔥 News

📝 Publications

Repository

🎖 Honors and Awards