


default search action
Yaodong Yang 0002
Person information
- affiliation: Chinese University of Hong Kong, Hong Kong
- affiliation (former): Huawei Noah's Ark Lab, China
- affiliation (former): Tianjin University, Tianjin, China
Other persons with the same name
- Yaodong Yang 0001
(aka: Adam Yang 0001) — Peking University, Institute for AI, Beijing, China (and 3 more)
- Yaodong Yang 0003 — University of Nebraska - Lincoln, USA
- Yaodong Yang 0004 — University of Science and Technology Beijing, Beijing, China
- Yaodong Yang 0005 — Hefei University of Technology, School of Mathematics, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [c20]Yaodong Yang, Guangyong Chen, Hongyao Tang, Furui Liu, Danruo Deng, Pheng-Ann Heng:
Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer. AAMAS 2025: 2226-2234 - [i13]Yaodong Yang, Guangyong Chen, Hongyao Tang, Furui Liu, Danruo Deng, Pheng-Ann Heng:
Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer. CoRR abs/2502.02018 (2025) - 2024
- [c19]Yaodong Yang, Guangyong Chen, Jianye Hao, Pheng-Ann Heng:
Sample-Efficient Multiagent Reinforcement Learning with Reset Replay. ICML 2024 - [c18]Junde Xu, Zikai Lin, Donghao Zhou, Yaodong Yang
, Xiangyun Liao, Qiong Wang, Bian Wu, Guangyong Chen, Pheng-Ann Heng:
DPPMask: Masked Image Modeling with Determinantal Point Processes. WACV 2024: 2255-2265 - 2023
- [c17]Jianye Hao, Xiaotian Hao, Hangyu Mao, Weixun Wang, Yaodong Yang, Dong Li, Yan Zheng, Zhen Wang:
Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks. ICLR 2023 - [i12]Junde Xu, Zikai Lin, Donghao Zhou, Yaodong Yang, Xiangyun Liao, Bian Wu, Guangyong Chen, Pheng-Ann Heng:
DPPMask: Masked Image Modeling with Determinantal Point Processes. CoRR abs/2303.12736 (2023) - 2022
- [c16]Hongyao Tang, Zhaopeng Meng, Jianye Hao, Chen Chen, Daniel Graves, Dong Li, Changmin Yu, Hangyu Mao, Wulong Liu, Yaodong Yang, Wenyuan Tao, Li Wang:
What about Inputting Policy in Value Function: Policy Representation and Policy-Extended Value Function Approximator. AAAI 2022: 8441-8449 - [c15]Yaodong Yang, Guangyong Chen, Weixun Wang, Xiaotian Hao, Jianye Hao, Pheng-Ann Heng:
Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing. NeurIPS 2022 - [c14]Zongkai Liu, Chao Yu, Yaodong Yang, Peng Sun, Zifan Wu, Yuan Li:
A Unified Diversity Measure for Multiagent Reinforcement Learning. NeurIPS 2022 - [i11]Xiaotian Hao, Weixun Wang
, Hangyu Mao, Yaodong Yang, Dong Li, Yan Zheng, Zhen Wang, Jianye Hao:
API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks. CoRR abs/2203.05285 (2022) - 2021
- [c13]Hongyao Tang, Zhaopeng Meng, Guangyong Chen, Pengfei Chen, Chen Chen, Yaodong Yang, Luo Zhang, Wulong Liu, Jianye Hao:
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction. AAAI 2021: 9834-9842 - [c12]Yaodong Yang, Jun Luo, Ying Wen, Oliver Slumbers, Daniel Graves, Haitham Bou-Ammar, Jun Wang, Matthew E. Taylor:
Diverse Auto-Curriculum is Critical for Successful Real-World Multiagent Learning Systems. AAMAS 2021: 51-56 - [i10]Yaodong Yang, Jun Luo, Ying Wen, Oliver Slumbers, Daniel Graves, Haitham Bou-Ammar, Jun Wang, Matthew E. Taylor:
Diverse Auto-Curriculum is Critical for Successful Real-World Multiagent Learning Systems. CoRR abs/2102.07659 (2021) - [i9]Hongyao Tang, Jianye Hao, Guangyong Chen, Pengfei Chen, Chen Chen, Yaodong Yang, Luo Zhang, Wulong Liu, Zhaopeng Meng:
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction. CoRR abs/2103.02225 (2021) - [i8]Tianze Zhou, Fubiao Zhang, Kun Shao, Kai Li, Wenhan Huang, Jun Luo, Weixun Wang, Yaodong Yang, Hangyu Mao, Bin Wang, Dong Li, Wulong Liu, Jianye Hao:
Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment. CoRR abs/2106.00517 (2021) - 2020
- [j3]Leilei Liu, Xianglei Zhu, Yi Ma, Haiyin Piao
, Yaodong Yang, Xiaotian Hao, Yue Fu, Li Wang, Jiajie Peng
:
Combining sequence and network information to enhance protein-protein interaction prediction. BMC Bioinform. 21-S(16): 537 (2020) - [c11]Ming Zhou
, Jun Luo, Julian Villela, Yaodong Yang, David Rusu, Jiayu Miao, Weinan Zhang, Montgomery Alban, Iman Fadakar, Zheng Chen, Chongxi Huang, Ying Wen, Kimia Hassanzadeh, Daniel Graves, Zhengbang Zhu, Yihan Ni, Nhat M. Nguyen, Mohamed Elsayed, Haitham Ammar, Alexander I. Cowen-Rivers, Sanjeevan Ahilan, Zheng Tian, Daniel Palenicek, Kasra Rezaee, Peyman Yadmellat, Kun Shao, Dong Chen, Baokuan Zhang, Hongbo Zhang, Jianye Hao, Wulong Liu, Jun Wang:
SMARTS: An Open-Source Scalable Multi-Agent RL Training School for Autonomous Driving. CoRL 2020: 264-285 - [c10]Yaodong Yang, Jianye Hao, Guangyong Chen, Hongyao Tang, Yingfeng Chen, Yujing Hu, Changjie Fan, Zhongyu Wei:
Q-value Path Decomposition for Deep Multiagent Reinforcement Learning. ICML 2020: 10706-10715 - [i7]Yaodong Yang, Jianye Hao, Ben Liao, Kun Shao, Guangyong Chen, Wulong Liu, Hongyao Tang:
Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning. CoRR abs/2002.03939 (2020) - [i6]Yaodong Yang, Jianye Hao, Guangyong Chen, Hongyao Tang, Yingfeng Chen, Yujing Hu, Changjie Fan, Zhongyu Wei:
Q-value Path Decomposition for Deep Multiagent Reinforcement Learning. CoRR abs/2002.03950 (2020) - [i5]Hongyao Tang, Zhaopeng Meng, Jianye Hao, Chen Chen, Daniel Graves, Dong Li, Wulong Liu, Yaodong Yang:
What About Taking Policy as Input of Value Function: Policy-extended Value Function Approximator. CoRR abs/2010.09536 (2020) - [i4]Ming Zhou, Jun Luo, Julian Villela, Yaodong Yang, David Rusu, Jiayu Miao, Weinan Zhang, Montgomery Alban, Iman Fadakar, Zheng Chen, Aurora Chongxi Huang, Ying Wen, Kimia Hassanzadeh, Daniel Graves, Dong Chen, Zhengbang Zhu, Nhat M. Nguyen, Mohamed Elsayed, Kun Shao, Sanjeevan Ahilan, Baokuan Zhang, Jiannan Wu, Zhengang Fu, Kasra Rezaee, Peyman Yadmellat, Mohsen Rohani, Nicolas Perez Nieves, Yihan Ni, Seyedershad Banijamali, Alexander I. Cowen-Rivers, Zheng Tian, Daniel Palenicek
, Haitham Bou-Ammar, Hongbo Zhang, Wulong Liu, Jianye Hao, Jun Wang:
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving. CoRR abs/2010.09776 (2020)
2010 – 2019
- 2019
- [j2]Hanxu Hou, Tian Gan, Yaodong Yang, Xianglei Zhu, Sen Liu, Weiming Guo, Jianye Hao:
Using deep reinforcement learning to speed up collective cell migration. BMC Bioinform. 20-S(18): 571:1-571:10 (2019) - [j1]Xianglei Zhu, Bofeng Fu, Yaodong Yang, Yu Ma, Jianye Hao, Siqi Chen, Shuang Liu, Tiegang Li, Sen Liu, Weiming Guo, Zhenyu Liao:
Attention-based recurrent neural network for influenza epidemic prediction. BMC Bioinform. 20-S(18): 575:1-575:10 (2019) - [c9]Xiaotian Hao, Weixun Wang, Jianye Hao, Yaodong Yang:
Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems. AAMAS 2019: 1315-1323 - [c8]Yaodong Yang, Jianye Hao, Yan Zheng, Xiaotian Hao, Bofeng Fu:
Large-Scale Home Energy Management Using Entropy-Based Collective Multiagent Reinforcement Learning Framework. AAMAS 2019: 2285-2287 - [c7]Leilei Liu, Yi Ma, Xianglei Zhu, Yaodong Yang
, Xiaotian Hao, Li Wang, Jiajie Peng:
Integrating Sequence and Network Information to Enhance Protein-Protein Interaction Prediction Using Graph Convolutional Networks. BIBM 2019: 1762-1768 - [c6]Yaodong Yang
, Jianye Hao, Yan Zheng
, Chao Yu:
Large-Scale Home Energy Management Using Entropy-Based Collective Multiagent Deep Reinforcement Learning Framework. IJCAI 2019: 630-636 - [i3]Hongyao Tang, Jianye Hao, Guangyong Chen, Pengfei Chen, Zhaopeng Meng, Yaodong Yang, Li Wang:
Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction. CoRR abs/1905.11100 (2019) - [i2]Yi Ma, Jianye Hao, Yaodong Yang, Han Li, Junqi Jin, Guangyong Chen:
Spectral-based Graph Convolutional Network for Directed Graphs. CoRR abs/1907.08990 (2019) - [i1]Xiaotian Hao, Weixun Wang, Jianye Hao, Yaodong Yang:
Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems. CoRR abs/1909.11468 (2019) - 2018
- [c5]Yaodong Yang, Jianye Hao, Zan Wang, Mingyang Sun, Goran Strbac:
Recurrent Deep Multiagent Q-Learning for Autonomous Agents in Future Smart Grid. AAMAS 2018: 2136-2138 - [c4]Bofeng Fu, Yaodong Yang
, Yu Ma, Jianye Hao, Siqi Chen, Shuang Liu, Tiegang Li, Zhenyu Liao, Xianglei Zhu:
Attention-Based Recurrent Multi-Channel Neural Network for Influenza Epidemic Prediction. BIBM 2018: 1245-1248 - [c3]Tian Gan, Yaodong Yang
, Jianye Hao, Zhenyu Liao, Xianglei Zhu:
Speeding up Collective Cell Migration Using Deep Reinforcement Learning. BIBM 2018: 1277-1280 - [c2]Yaodong Yang
, Jianye Hao, Mingyang Sun, Zan Wang, Changjie Fan, Goran Strbac:
Recurrent Deep Multiagent Q-Learning for Autonomous Brokers in Smart Grid. IJCAI 2018: 569-575 - 2017
- [c1]Xianglei Zhu, Shuai Zhao, Yaodong Yang, Hongyao Tang, Zan Wang, Jianye Hao:
A real-time ensemble classification algorithm for time series data. ICA 2017: 145-150
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-10-23 02:03 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint