Ziwei Chai (柴子炜)

avatar.jpg

I am currently working at Zhipu AI (since Feb. 2025), where I co-lead the pre-training data pipeline specifically for high-quality math and science data. My work involves end-to-end data processing—including cleaning, parsing, synthesis, and evaluation—for the GLM model family (spanning GLM-4.5 through GLM-5). I also contributed to reasoning post-training (SFT & RL), where the GLM series achieve open-source SOTA performance.

Before joining Zhipu AI, I was a Research Intern at ByteDance Seed (Nov. 2023 - Feb. 2025) working on multi-agent LLM systems. I am currently a final-year Ph.D. student at Zhejiang University advised by Prof. Yang Yang, where I also received my B.E. degree in Software Engineering in 2021.

I am currently open to industrial opportunities in LLM teams. If you have suitable positions, please feel free to contact me via email or WeChat.

"...Die Bedeutung eines Wortes ist sein Gebrauch in der Sprache."

"...The meaning of a word is its use in the language."

— Ludwig Wittgenstein, Philosophische Untersuchungen, §43

News

Dec 2025 🚀 GLM-4.7 is released! Blog
Sep 2025 🚀 GLM-4.6 is released! Blog
Jul 2025 🚀 GLM-4.5 is released! Technical Report
Feb 2025 💼 Joined Zhipu AI (GLM Team) as a Technical Intern
May 2024 🎉 Two papers accepted: ETR (Multi-Agent Collaboration) to ACL 2024 and InfiAgent (Agent Evaluation) to ICML 2024.
Jan 2024 🚀 Released InfiAgent-DABench, a comprehensive benchmark for Agentic Data Analysis.
Nov 2023 💼 Joined Bytedance Seed as a Research Intern

Selected Publications

Full Paper List | Citation: 714
  1. Technical Report
    glm.jpg
    Glm-4.5: Agentic, reasoning, and coding (arc) foundation models
    Contributor to the GLM Team
    arXiv preprint arXiv:2508.06471, 2025
  2. NeurIPS
    cypher.jpg
    Cypher-RI: Reinforcement Learning for Integrating Schema Selection into Cypher Generation
    Hanchen Su, Xuyuan Li, Yan Zhou, Ziwei Chai, Haozheng Wang, Chen Zhang, and 1 more author
    In Thirty-ninth Annual Conference on Neural Information Processing Systems, 2025
  3. ACL
    acl.jpg
    An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
    Ziwei Chai, Guoyin Wang, Jing Su, Tianjie Zhang, Xuanwen Huang, Xuwu Wang, and 5 more authors
    In Annual Meeting of the Association for Computational Linguistics, 2024
  4. WWW
    gnn.png
    Can GNN be Good Adapter for LLMs?
    Xuanwen Huang, Kaiqiao Han, Yang Yang, Dezheng Bao, Quanjin Tao, Ziwei Chai, and 1 more author
    In Proceedings of the ACM Web Conference, 2024
  5. ICML
    da.jpg
    Infiagent-dabench: Evaluating agents on data analysis tasks
    Xueyu Hu, Ziyu Zhao, Shuang Wei, Ziwei Chai, Guoyin Wang, Xuwu Wang, and 9 more authors
    In International Conference on Machine Learning, 2024
  6. TBD
    graphllm.jpg
    Graphllm: Boosting graph reasoning ability of large language model
    Ziwei Chai, Tianjie Zhang, Liang Wu, Kaiqiao Han, Xiaohai Hu, Xuanwen Huang, and 1 more author
    IEEE Transactions on Big Data, 2025
  7. AAAI
    aml.jpg
    Towards Learning to Discover Money Laundering Sub-network in Massive Transaction Network
    Ziwei Chai, Yang Yang, Jiawang Dan, Sheng Tian, Changhua Meng, Weiqiang Wang, and 1 more author
    In Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
  8. IJCAI
    amnet.jpg
    Can abnormality be detected by graph neural networks?
    Ziwei Chai, Siqi You, Yang Yang, Shiliang Pu, Jiarong Xu, Haoyang Cai, and 1 more author
    In International Joint Conferences on Artificial Intelligence, 2022