Publications

*: Equal contribution, : Corresponding author

2024

  1. Policy Learning from Tutorial Books via Understanding, Rehearsing and Introspecting Oral NeurIPS'24
    Xiong-Hui Chen*, Ziyan Wang*, Yali Du, Shengyi Jiang, Meng Fang, Yang Yu, and Jun Wang
    In The Thirty-Eight Annual Conference on Neural Information Processing Systems (NeruIPS) , 2024
  2. Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf NeurIPS'24
    Xuanfa Jin*, Ziyan Wang*, Yali Du, Meng Fang, Haifeng Zhang, and Jun Wang
    In The Thirty-Eight Annual Conference on Neural Information Processing Systems (NeruIPS) , 2024
  3. Safe Multi-agent Reinforcement Learning with Natural Language Constraints ICLR'24 GenAI4DM Workshop
    Ziyan Wang, Meng Fang, Tristan Tomilin, Fei Fang, and Yali Du
    In ICLR 2024 Workshop on Generative Models for Decision Making (ICLR GenAI4DM) , 2024
  4. MACCA: Offline Multi-agent Reinforcement Learning with Causal Credit Assignment NeurIPS'24 CRL Workshop
    In NeurIPS 2024 Causal Representation Learning Workshop (NeurIPS CRL) , 2024
  5. Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models AAMAS'24
    Xingzhou Lou, Junge Zhang, Ziyan Wang, Kaiqi Huang, and Yali Du
    In The 23rd International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS) , 2024
  6. M³HF: Multi-agent Reinforcement Learning from Multi-phase Human Feedback of Mixed Quality Preprint
    Ziyan Wang, Zhicheng Zhang, Fei Fang, and Yali Du
    In Under Review , 2024

2023

  1. Chessgpt: Bridging policy learning and language modeling NeurIPS'23
    Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, and Jun Wang
    In The Thirty-Seventh Annual Conference on Neural Information Processing Systems (NeruIPS) , 2023
  2. Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach NeurIPS'23
    Yudi Zhang, Yali Du, Biwei Huang, Ziyan Wang, Jun Wang, Meng Fang, and Mykola Pechenizkiy
    In The Thirty-Seventh Annual Conference on Neural Information Processing Systems (NeruIPS) , 2023

2021

  1. Multi-Agent Constrained Policy Optimisation Preprint
    Shangding Gu, Jakub Kuba, Muning Wen, Ruiqing Chen, Ziyan Wang, Zheng Tian, Jun Wang, Alois Knoll, and Yaodong Yang
    In arXiv preprint , 2021