default search action

combined dblp search
author search
venue search
publication search

ask others

Muning Wen

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/fcsc/WangHWPLL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/fcsc/WangHWPLL25
Dongzi Wang, Lilan Huang, Muning Wen, Yuanxi Peng, Minglong Li, Teng Li:
RDHNet: addressing rotational and permutational symmetries in continuous multi-agent systems. Frontiers Comput. Sci. 19(11): 1911365 (2025)
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ShiWZ0LL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ShiWZ0LL25
Yiwei Shi, Muning Wen, Qi Zhang, Weinan Zhang, Cunjia Liu, Weiru Liu:
Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation. AAAI 2025: 738-745
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/GuSW0MCWS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/GuSW0MCWS25
Shangding Gu, Laixi Shi, Muning Wen, Ming Jin, Eric Mazumdar, Yuejie Chi, Adam Wierman, Costas J. Spanos:
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning. ICLR 2025
[c7]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LinWPNLWMZCZW025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LinWPNLWMZCZW025
Qiqiang Lin, Muning Wen, Qiuying Peng, Guanyu Nie, Junwei Liao, Jun Wang, Xiaoyun Mo, Jiamu Zhou, Cheng Cheng, Yin Zhao, Jun Wang, Weinan Zhang:
Robust Function-Calling for On-Device Language Model via Function Masking. ICLR 2025
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-08378
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-08378
Tao Huang, Junli Ren, Huayi Wang, Zirui Wang, Qingwei Ben, Muning Wen, Xiao Chen, Jianan Li, Jiangmiao Pang:
Learning Humanoid Standing-up Control across Diverse Postures. CoRR abs/2502.08378 (2025)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-16496
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-16496
Kun Hu, Muning Wen, Xihuai Wang, Shao Zhang, Yiwei Shi, Minne Li, Minglong Li, Ying Wen:
PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning. CoRR abs/2502.16496 (2025)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-19652
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-19652
Shangding Gu, Laixi Shi, Muning Wen, Ming Jin, Eric Mazumdar, Yuejie Chi, Adam Wierman, Costas J. Spanos:
Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning. CoRR abs/2502.19652 (2025)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-16129
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-16129
Junwei Liao, Muning Wen, Jun Wang, Weinan Zhang:
MARFT: Multi-Agent Reinforcement Fine-Tuning. CoRR abs/2504.16129 (2025)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-16736
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-16736
Yingxuan Yang, Huacan Chai, Yuanyi Song, Siyuan Qi, Muning Wen, Ning Li, Junwei Liao, Haoyi Hu, Jianghao Lin, Gaowei Chang, Weiwen Liu, Ying Wen, Yong Yu, Weinan Zhang:
A Survey of AI Agent Protocols. CoRR abs/2504.16736 (2025)
2024
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/WangZLWPLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/WangZLWPLY24
Dongzi Wang, Fangwei Zhong, Minglong Li, Muning Wen, Yuanxi Peng, Teng Li, Adam Yang:
RoMAT: Role-based multi-agent transformer for generalizable heterogeneous cooperation. Neural Networks 174: 106129 (2024)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/tii/GuHWCK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tii/GuHWCK24
Shangding Gu, Dianye Huang, Muning Wen, Guang Chen, Alois Knoll:
Safe Multiagent Learning With Soft Constrained Policy Optimization in Real Robot Control. IEEE Trans. Ind. Informatics 20(9): 10706-10716 (2024)
[c6]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WanFWM00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WanFWM00024
Ziyu Wan, Xidong Feng, Muning Wen, Stephen Marcus McAleer, Ying Wen, Weinan Zhang, Jun Wang:
AlphaZero-Like Tree-Search can Guide Large Language Model Decoding and Training. ICML 2024
[c5]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WenWWZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WenWWZ024
Muning Wen, Ziyu Wan, Jun Wang, Weinan Zhang, Ying Wen:
Reinforcing LLM Agents via Policy Optimization with Action Decomposition. NeurIPS 2024
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/sigir/ZhouYW0WXXY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigir/ZhouYW0WXXY024
Ruiwen Zhou, Yingxuan Yang, Muning Wen, Ying Wen, Wenhao Wang, Chunling Xi, Guoqiang Xu, Yong Yu, Weinan Zhang:
TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision. SIGIR 2024: 3-13
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-06700
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-06700
Muning Wen, Cheng Deng, Jun Wang, Weinan Zhang, Ying Wen:
Entropy-Regularized Token-Level Policy Optimization for Large Language Models. CoRR abs/2402.06700 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-06221
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-06221
Ruiwen Zhou, Yingxuan Yang, Muning Wen, Ying Wen, Wenhao Wang, Chunling Xi, Guoqiang Xu, Yong Yu, Weinan Zhang:
TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision. CoRR abs/2403.06221 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-15821
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-15821
Muning Wen, Ziyu Wan, Weinan Zhang, Jun Wang, Ying Wen:
Reinforcing Language Agents via Policy Optimization with Action Decomposition. CoRR abs/2405.15821 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-05541
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-05541
Yingxuan Yang, Huayi Wang, Muning Wen, Weinan Zhang:
P3: A Policy-Driven, Pace-Adaptive, and Diversity-Promoted Framework for Optimizing LLM Training. CoRR abs/2408.05541 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-09541
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-09541
Yiwei Shi, Muning Wen, Qi Zhang, Weinan Zhang, Cunjia Liu, Weiru Liu:
Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation. CoRR abs/2409.09541 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-04587
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-04587
Qiqiang Lin, Muning Wen, Qiuying Peng, Guanyu Nie, Junwei Liao, Jun Wang, Xiaoyun Mo, Jiamu Zhou, Cheng Cheng, Yin Zhao, Jun Wang, Weinan Zhang:
Hammer: Robust Function-Calling for On-Device Language Models via Function Masking. CoRR abs/2410.04587 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-09671
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-09671
Jun Wang, Meng Fang, Ziyu Wan, Muning Wen, Jiachen Zhu, Anjie Liu, Ziqin Gong, Yan Song, Lei Chen, Lionel M. Ni, Linyi Yang, Ying Wen, Weinan Zhang:
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models. CoRR abs/2410.09671 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-16516
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-16516
Jun Wang, Jiamu Zhou, Muning Wen, Xiaoyun Mo, Haoyu Zhang, Qiqiang Lin, Cheng Jin, Xihuai Wang, Weinan Zhang, Qiuying Peng, Jun Wang:
HammerBench: Fine-Grained Function-Calling Evaluation in Real Mobile Device Scenarios. CoRR abs/2412.16516 (2024)
2023
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/fcsc/WenLWYWMWZZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/fcsc/WenLWYWMWZZ23
Muning Wen, Runji Lin, Hanjing Wang, Yaodong Yang, Ying Wen, Luo Mai, Jun Wang, Hai-Feng Zhang, Weinan Zhang:
Large sequence models for sequential decision-making: a survey. Frontiers Comput. Sci. 17(6): 176349 (2023)
[j2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ijautcomp/MengWLLXZWZWYX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijautcomp/MengWLLXZWZWYX23
Linghui Meng, Muning Wen, Chenyang Le, Xiyun Li, Dengpeng Xing, Weinan Zhang, Ying Wen, Haifeng Zhang, Jun Wang, Yaodong Yang, Bo Xu:
Offline Pre-trained Multi-agent Decision Transformer. Mach. Intell. Res. 20(2): 233-248 (2023)
[j1]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/ZhouWWWW0000023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/ZhouWWWW0000023
Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen, Yaodong Yang, Yong Yu, Jun Wang, Weinan Zhang:
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning. J. Mach. Learn. Res. 24: 150:1-150:12 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-13945
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-13945
Muning Wen, Runji Lin, Hanjing Wang, Yaodong Yang, Ying Wen, Luo Mai, Jun Wang, Haifeng Zhang, Weinan Zhang:
Large Sequence Models for Sequential Decision-Making: A Survey. CoRR abs/2306.13945 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-17179
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-17179
Xidong Feng, Ziyu Wan, Muning Wen, Ying Wen, Weinan Zhang, Jun Wang:
Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training. CoRR abs/2309.17179 (2023)
2022
[c3]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/KubaCWWSW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/KubaCWWSW022
Jakub Grudzien Kuba, Ruiqing Chen, Muning Wen, Ying Wen, Fanglei Sun, Jun Wang, Yaodong Yang:
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning. ICLR 2022
[c2]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WenKL000022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WenKL000022
Muning Wen, Jakub Grudzien Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang:
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem. NeurIPS 2022
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-14953
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-14953
Muning Wen, Jakub Grudzien Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang:
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem. CoRR abs/2205.14953 (2022)
2021
[c1]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/KubaWMGZMWY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KubaWMGZMWY21
Jakub Grudzien Kuba, Muning Wen, Linghui Meng, Shangding Gu, Haifeng Zhang, David Mguni, Jun Wang, Yaodong Yang:
Settling the Variance of Multi-Agent Policy Gradients. NeurIPS 2021: 13458-13470
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-07551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-07551
Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen, Yaodong Yang, Weinan Zhang, Jun Wang:
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning. CoRR abs/2106.07551 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-08612
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-08612
Jakub Grudzien Kuba, Muning Wen, Yaodong Yang, Linghui Meng, Shangding Gu, Haifeng Zhang, David Henry Mguni, Jun Wang:
Settling the Variance of Multi-Agent Policy Gradients. CoRR abs/2108.08612 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-11251
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-11251
Jakub Grudzien Kuba, Ruiqing Chen, Muning Wen, Ying Wen, Fanglei Sun, Jun Wang, Yaodong Yang:
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning. CoRR abs/2109.11251 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-02793
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-02793
Shangding Gu, Jakub Grudzien Kuba, Muning Wen, Ruiqing Chen, Ziyan Wang, Zheng Tian, Jun Wang, Alois C. Knoll, Yaodong Yang:
Multi-Agent Constrained Policy Optimisation. CoRR abs/2110.02793 (2021)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-02845
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-02845
Linghui Meng, Muning Wen, Yaodong Yang, Chenyang Le, Xiyun Li, Weinan Zhang, Ying Wen, Haifeng Zhang, Jun Wang, Bo Xu:
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks. CoRR abs/2112.02845 (2021)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.