Ziyan Wang

Email: ziyan.wang[at]kcl[dot]ac[dot]uk

I am currently the second-year Ph.D. student at Cooperative AI Lab, King’s College London. I am supervised by Dr Yali Du and Prof. Sanjay Modgil. I received my M.S. degree from the University College London, where I was supervised by Prof. Jun Wang. I have also been fortunate to work closely with Prof. Fei Fang at the Carnegie Mellon University.

My research interests lie in the intersection of Multi-agent Reinforcement Learning, Large Language Models, and Robotics. Current research themes include:

  • Multi-agent Reinforcement Learning: Developing algorithms that enable agents to learn to collaborate and compete in complex environments.
  • Human Robot Cooperation: Bridging the gap between human and robot communication, enabling robots to understand huamn’s free-form instructions.
  • Safe Reinforcement Learning: Ensuring that agents learn policies that satisfy given constraints while accomplishing tasks.
  • Large Language Models: Exploring the capabilities of large language models in multi-agent settings.

News

May 2025 MACCA has been accepted to TMLR!
May 2025 M³HF has been accepted to ICML 2025! Looking forward to seeing everyone again in Vancouver this July! 🍁
Feb 2025 Starting a visiting PhD at Carnegie Mellon University under supervised by Prof. Fei Fang. See you in Pittsburgh!
Dec 2024 Gave a talk at Machine Learning Reading Group at Imperial College London (organized by Zijing, Ou) about our recent work on LLM and RL.
Nov 2024 Gave a talk at AISOC Lab at Carnegie Mellon University (hosted by Zhicheng Zhang) about our recent work on LLM and RL.
Oct 2024 🎉 One paper is accepted to NeurIPS 2024 Causal Representation Learning Workshop. Also happy to share that I got the NeurIPS 2024 Scholar Award! 🏆

Selected Publications

*: Equal contribution, : Corresponding author
  1. M³HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality ICML'25
    Ziyan Wang, Zhicheng Zhang, Fei Fang, and Yali Du
    In Forty-Second International Conference on Machine Learning (ICML) , 2025
  2. MACCA: Offline Multi-agent Reinforcement Learning with Causal Credit Assignment TMLR
    In Transactions on Machine Learning Research (TMLR) , 2025
  3. Policy Learning from Tutorial Books via Understanding, Rehearsing and Introspecting Oral NeurIPS'24
    Xiong-Hui Chen*, Ziyan Wang*, Yali Du, Shengyi Jiang, Meng Fang, Yang Yu, and Jun Wang
    In The Thirty-Eight Annual Conference on Neural Information Processing Systems (NeruIPS) , 2024
  4. Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf NeurIPS'24
    Xuanfa Jin*, Ziyan Wang*, Yali Du, Meng Fang, Haifeng Zhang, and Jun Wang
    In The Thirty-Eight Annual Conference on Neural Information Processing Systems (NeruIPS) , 2024
  5. Chessgpt: Bridging policy learning and language modeling NeurIPS'23
    Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, and Jun Wang
    In The Thirty-Seventh Annual Conference on Neural Information Processing Systems (NeruIPS) , 2023

Professional Services

  • Conference reviewer for ICML 2024/25, NeurIPS 2024/25, ICLR 2024/25, AISTATS 2025
  • Journal reviewer for IEEE Transactions on Knowledge and Data Engineering (TKDE), IEEE Transactions on Artificial Intelligence (TAI)