default search action

combined dblp search
author search
venue search
publication search

ask others

Youpeng Zhao 0001

> Home > Persons

Person information

affiliation (former): University of Science and Technology of China (USTC), Department of Electronic Engineering and Information Science, Hefei, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/WangJHZD0XZH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/WangJHZD0XZH25
Chenxu Wang, Yonggang Jin, Cheng Hu, Youpeng Zhao, Zipeng Dai, Jian Zhao, Liuyu Xiang, Junge Zhang, Zhaofeng He:
Generalizable agent modeling for agent collaboration-competition adaptation with multi-retrieval and dynamic generation. Neurocomputing 651: 130912 (2025)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/tciaig/ChenLZDZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tciaig/ChenLZDZ25
Zhen Chen, Yong Liao, Youpeng Zhao, Zipeng Dai, Jian Zhao:
CuDA2: An Approach for Incorporating Traitor Agents Into Cooperative Multiagent Systems. IEEE Trans. Games 17(2): 397-407 (2025)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/tciaig/LiuZHCZYMWHLLH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tciaig/LiuZHCZYMWHLLH25
Lin Liu, Jian Zhao, Cheng Hu, Zhengtao Cao, Youpeng Zhao, Zhenbin Ye, Meng Meng, Wenjun Wang, Zhaofeng He, Houqiang Li, Xia Lin, Lanxiao Huang:
Mini Honor of Kings: A Lightweight Environment for Multiagent Reinforcement Learning. IEEE Trans. Games 17(3): 787-796 (2025)
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HouZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HouZ025
Jinbing Hou, Youpeng Zhao, Jian Zhao:
CraftFactory: A Conditioned Control Policy Benchmark for Compositional Generalization. AAAI 2025: 1318-1326
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-16718
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-16718
Chenxu Wang, Yonggang Jin, Cheng Hu, Youpeng Zhao, Zipeng Dai, Jian Zhao, Shiyu Huang, Liuyu Xiang, Junge Zhang, Zhaofeng He:
Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation. CoRR abs/2506.16718 (2025)
2024
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/tciaig/ZhaoZHZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tciaig/ZhaoZHZL24
Youpeng Zhao, Jian Zhao, Xunhan Hu, Wengang Zhou, Houqiang Li:
Full DouZero+: Improving DouDizhu AI by Opponent Modeling, Coach-Guided Training and Bidding Learning. IEEE Trans. Games 16(3): 518-529 (2024)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/tciaig/ZhaoYZHZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tciaig/ZhaoYZHZL24
Jian Zhao, Mingyu Yang, Youpeng Zhao, Xunhan Hu, Wengang Zhou, Houqiang Li:
MCMARL: Parameterizing Value Function via Mixture of Categorical Distributions for Multi-Agent Reinforcement Learning. IEEE Trans. Games 16(3): 556-565 (2024)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tciaig/ZhaoLZZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tciaig/ZhaoLZZL24
Youpeng Zhao, Yudong Lu, Jian Zhao, Wengang Zhou, Houqiang Li:
DanZero+: Dominating the GuanDan Game Through Reinforcement Learning. IEEE Trans. Games 16(4): 914-926 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-03978
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-03978
Lin Liu, Jian Zhao, Cheng Hu, Zhengtao Cao, Youpeng Zhao, Zhenbin Ye, Meng Meng, Wenjun Wang, Zhaofeng He, Houqiang Li, Xia Lin, Lanxiao Huang:
Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement Learning. CoRR abs/2406.03978 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-17425
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-17425
Zhen Chen, Yong Liao, Youpeng Zhao, Zipeng Dai, Jian Zhao:
CuDA2: An approach for Incorporating Traitor Agents into Cooperative Multi-Agent Systems. CoRR abs/2406.17425 (2024)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-11417
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-11417
Junjie Lin, Jian Zhao, Lin Liu, Yue Deng, Youpeng Zhao, Lanxiao Huang, Xia Lin, Wengang Zhou, Houqiang Li:
RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement. CoRR abs/2412.11417 (2024)
2023
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tciaig/ZhaoSZZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tciaig/ZhaoSZZL23
Jian Zhao, Weide Shu, Youpeng Zhao, Wengang Zhou, Houqiang Li:
Improving Deep Reinforcement Learning With Mirror Loss. IEEE Trans. Games 15(3): 337-347 (2023)
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/cig/LuZZZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cig/LuZZZL23
Yudong Lu, Jian Zhao, Youpeng Zhao, Wengang Zhou, Houqiang Li:
DanZero: Mastering GuanDan Game with Reinforcement Learning. CoG 2023: 1-8
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/HuZZZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/HuZZZL23
Xunhan Hu, Jian Zhao, Youpeng Zhao, Wengang Zhou, Houqiang Li:
Q-SAT: Value Factorization with Self-Attention for Deep Multi-Agent Reinforcement Learning. IJCNN 2023: 1-8
[c2]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Zhao0LZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Zhao0LZL23
Youpeng Zhao, Yaodong Yang, Zhenbo Lu, Wengang Zhou, Houqiang Li:
Multi-Agent First Order Constrained Optimization in Policy Space. NeurIPS 2023
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-02561
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-02561
Youpeng Zhao, Yudong Lu, Jian Zhao, Wengang Zhou, Houqiang Li:
DanZero+: Dominating the GuanDan Game through Reinforcement Learning. CoRR abs/2312.02561 (2023)
2022
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/jzusc/ZhaoZWYHZHL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jzusc/ZhaoZWYHZHL22
Jian Zhao, Youpeng Zhao, Weixun Wang, Mingyu Yang, Xunhan Hu, Wengang Zhou, Jianye Hao, Houqiang Li:
Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents. Frontiers Inf. Technol. Electron. Eng. 23(7): 1032-1042 (2022)
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/cig/ZhaoZHZL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cig/ZhaoZHZL22
Youpeng Zhao, Jian Zhao, Xunhan Hu, Wengang Zhou, Houqiang Li:
DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning. CoG 2022: 127-134
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-08454
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-08454
Jian Zhao, Youpeng Zhao, Weixun Wang, Mingyu Yang, Xunhan Hu, Wengang Zhou, Jianye Hao, Houqiang Li:
Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents. CoRR abs/2203.08454 (2022)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-02558
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-02558
Youpeng Zhao, Jian Zhao, Xunhan Hu, Wengang Zhou, Houqiang Li:
DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning. CoRR abs/2204.02558 (2022)
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-17087
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-17087
Yudong Lu, Jian Zhao, Youpeng Zhao, Wengang Zhou, Houqiang Li:
DanZero: Mastering GuanDan Game with Reinforcement Learning. CoRR abs/2210.17087 (2022)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.