I am Zihan Wang (王子涵), an incoming M.S. student in Artificial Intelligence at Tsinghua University (shenzhen). My advisor is Prof. Yujiu Yang.

I received my B.Eng. in Computer Science and Technology from Xi’an Jiaotong University.

My research interests center on NLP / Large Language Models (LLMs), with a focus on building intrinsically motivated, self-evolving, and reasoning-capable agents, particularly for coding, graphical user interface (GUI) interaction.


📖 Education

  • Sep. 2026 (incoming), Tsinghua University, M.S. in AI (Shenzhen)
  • Sep. 2022 – Jul. 2026, Xi’an Jiaotong University, B.Eng. in Computer Science and Technology
    GPA 3.97/4.30, Rank 5/193

🔥 News

  • 2026.03: 📄 Our paper “CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges” was released on arXiv.
  • 2026.09: 🎓 Incoming M.S. student at Tsinghua University (Shenzhen).
  • 2025: 🎉 Our paper “Alignment for Efficient Tool Calling of Large Language Models” was accepted to EMNLP 2025 (Main).
  • 2025: 🎉 Our paper “Reducing Tool Hallucination via Reliability Alignment” was accepted to ICML 2025.

📝 Publications

CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges
Zi-Han Wang, Lam Nguyen, Zhengyang Zhao, Mengyue Yang, Chengwei Qin, Yujiu Yang, Linyi Yang
[ArXiv] [Homepage] [Code]

arXiv 2026
  • Introduces CreativeBench, a benchmark for machine creativity in code generation, covering both combinatorial and exploratory creativity with executable evaluation and an inference-time enhancement strategy, EvoRePE.

Alignment for Efficient Tool Calling of Large Language Models
Hongshen Xu*, Zihan Wang*, Zichen Zhu, Lei Pan, Xingyu Chen, Lu Chen, Kai Yu
[ArXiv]

EMNLP 2025 (Main)
  • Proposes a multi-objective alignment framework combining probabilistic knowledge boundary estimation with dynamic decision making to reduce unnecessary tool calls while preserving performance.

Reducing Tool Hallucination via Reliability Alignment
Hongshen Xu, Su Zhu, Zihan Wang, Hang Zheng, Da Ma, Ruisheng Cao, Shuai Fan, Lu Chen, Kai Yu
[ArXiv]

ICML 2025
  • Defines and categorizes tool hallucinations (tool-selection vs tool-usage) and introduces reliability-oriented alignment to improve robust tool interaction and efficiency.

Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
Huichi Zhou, Yihang Chen, Siyuan Guo, Xue Yan, Kin Hei Lee, Zihan Wang, Ka Yiu Lee, Guchun Zhang, Kun Shao, Linyi Yang†, Jun Wang†
[ArXiv] [Code] GitHub stars

Technical Report
  • A memory-based continual improvement framework enabling agents to learn from experience without updating model weights.

Delusions of Large Language Models
Hongshen Xu, Zixv Yang, Zichen Zhu, Kunyao Lan, Zihan Wang, Mengyue Wu, Ziwei Ji, Lu Chen, Pascale Fung, Kai Yu
[ArXiv]

Manuscript
  • Investigates high-confidence hallucination phenomena (“delusions”) and analyzes underlying causes and behaviors.

Repository

AgentFly GitHub stars


🎖 Honors and Awards

  • 2022–2023: National Scholarship
  • 2023–2024: Outstanding Award — American College Students Mathematical Modeling Competition
  • 2023–2024: Provincial First Prize — National College Students’ Mathematical Competition
  • 2023–2024: Outstanding Volunteer Award
  • 2024–2025: Baidu Artificial Intelligence and Big Data Elite Class