default search action

combined dblp search
author search
venue search
publication search

ask others

Weixun Wang

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Journal Articles

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/tciaig/ZhouZSDLHWWLLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tciaig/ZhouZSDLHWWLLH24
Tianze Zhou, Fubiao Zhang, Kun Shao, Zipeng Dai, Kai Li, Wenhan Huang, Weixun Wang, Bin Wang, Dong Li, Wulong Liu, Jianye Hao:
Cooperative Multiagent Transfer Learning With Coalition Pattern Decomposition. IEEE Trans. Games 16(2): 352-364 (2024)
2023
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/aamas/YangWHTLHHCFRHZG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/aamas/YangWHTLHHCFRHZG23
Tianpei Yang, Weixun Wang, Jianye Hao, Matthew E. Taylor, Yong Liu, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Chunxu Ren, Ye Huang, Jiangcheng Zhu, Yang Gao:
ASN: action semantics network for multiagent reinforcement learning. Auton. Agents Multi Agent Syst. 37(2): 45 (2023)
[j8]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/HuZGW0L0C023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/HuZGW0L0C023
Siyi Hu, Yifan Zhong, Minquan Gao, Weixun Wang, Hao Dong, Xiaodan Liang, Zhihui Li, Xiaojun Chang, Yaodong Yang:
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library. J. Mach. Learn. Res. 24: 315:1-315:23 (2023)
2022
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/jzusc/ZhaoZWYHZHL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jzusc/ZhaoZWYHZHL22
Jian Zhao, Youpeng Zhao, Weixun Wang, Mingyu Yang, Xunhan Hu, Wengang Zhou, Jianye Hao, Houqiang Li:
Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents. Frontiers Inf. Technol. Electron. Eng. 23(7): 1032-1042 (2022)
2012
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/suscom/WangRM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/suscom/WangRM12
Weixun Wang, Sanjay Ranka, Prabhat Mishra:
Energy-aware dynamic slack allocation for real-time multitasking systems. Sustain. Comput. Informatics Syst. 2(3): 128-137 (2012)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/tcad/QinWM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcad/QinWM12
Xiaoke Qin, Weixun Wang, Prabhat Mishra:
TCEC: Temperature and Energy-Constrained Scheduling in Real-Time Multitasking Systems. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 31(8): 1159-1168 (2012)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/tecs/WangMG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tecs/WangMG12
Weixun Wang, Prabhat Mishra, Ann Gordon-Ross:
Dynamic Cache Reconfiguration for Soft Real-Time Systems. ACM Trans. Embed. Comput. Syst. 11(2): 28:1-28:31 (2012)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tvlsi/WangM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tvlsi/WangM12
Weixun Wang, Prabhat Mishra:
System-Wide Leakage-Aware Energy Minimization Using Dynamic Voltage Scaling and Cache Reconfiguration in Multitasking Systems. IEEE Trans. Very Large Scale Integr. Syst. 20(5): 902-910 (2012)
2011
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/jolpe/WangM11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jolpe/WangM11
Weixun Wang, Prabhat Mishra:
Dynamic Reconfiguration of Two-Level Cache Hierarchy in Real-Time Embedded Systems. J. Low Power Electron. 7(1): 17-28 (2011)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/suscom/WangRM11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/suscom/WangRM11
Weixun Wang, Sanjay Ranka, Prabhat Mishra:
Energy-aware dynamic reconfiguration algorithms for real-time multitasking systems. Sustain. Comput. Informatics Syst. 1(1): 35-45 (2011)

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WuHYHZWT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WuHYHZWT24
Jizhou Wu, Jianye Hao, Tianpei Yang, Xiaotian Hao, Yan Zheng, Weixun Wang, Matthew E. Taylor:
PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning. AAAI 2024: 15934-15942
2023
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/0001WW0HORHCF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/0001WW0HORHCF23
Wei Qiu, Weixun Wang, Rundong Wang, Bo An, Yujing Hu, Svetlana Obraztsova, Zinovi Rabinovich, Jianye Hao, Yingfeng Chen, Changjie Fan:
Off-Beat Multi-Agent Reinforcement Learning. AAMAS 2023: 2424-2426
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/WuYHHZWT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/WuYHHZWT23
Jizhou Wu, Tianpei Yang, Xiaotian Hao, Jianye Hao, Yan Zheng, Weixun Wang, Matthew E. Taylor:
PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning. AAMAS 2023: 2460-2462
[c22]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/HaoHMW00ZW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HaoHMW00ZW23
Jianye Hao, Xiaotian Hao, Hangyu Mao, Weixun Wang, Yaodong Yang, Dong Li, Yan Zheng, Zhen Wang:
Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks. ICLR 2023
2022
[c21]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/WangZHWZGHLF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WangZHWZGHLF22
Li Wang, Yupeng Zhang, Yujing Hu, Weixun Wang, Chongjie Zhang, Yang Gao, Jianye Hao, Tangjie Lv, Changjie Fan:
Individual Reward Assisted Multi-Agent Reinforcement Learning. ICML 2022: 23417-23432
[c20]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/0001CWHHH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0001CWHHH22
Yaodong Yang, Guangyong Chen, Weixun Wang, Xiaotian Hao, Jianye Hao, Pheng-Ann Heng:
Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing. NeurIPS 2022
2021
[c19]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/YangWTHMMLLCHFZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangWTHMMLLCHFZ21
Tianpei Yang, Weixun Wang, Hongyao Tang, Jianye Hao, Zhaopeng Meng, Hangyu Mao, Dong Li, Wulong Liu, Yingfeng Chen, Yujing Hu, Changjie Fan, Chengwei Zhang:
An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning. NeurIPS 2021: 17037-17048
2020
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiuWHHC020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiuWHHC020
Yong Liu, Weixun Wang, Yujing Hu, Jianye Hao, Xingguo Chen, Yang Gao:
Multi-Agent Game Abstraction via Graph Attention Neural Network. AAAI 2020: 7211-7218
[c17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WangYLHHHCFG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WangYLHHHCFG20
Weixun Wang, Tianpei Yang, Yong Liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao:
From Few to More: Large-Scale Dynamic Multiagent Curriculum Learning. AAAI 2020: 7293-7300
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/YangHMZHCFWWP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/YangHMZHCFWWP20
Tianpei Yang, Jianye Hao, Zhaopeng Meng, Zongzhang Zhang, Yujing Hu, Yingfeng Chen, Changjie Fan, Weixun Wang, Zhaodong Wang, Jiajie Peng:
Efficient Deep Reinforcement Learning through Policy Transfer. AAMAS 2020: 2053-2055
[c15]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/WangYLHHHCFG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WangYLHHHCFG20
Weixun Wang, Tianpei Yang, Yong Liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao:
Action Semantics Network: Considering the Effects of Actions in Multiagent Systems. ICLR 2020
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/ZhangHWTMDZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/ZhangHWTMDZ20
Peng Zhang, Jianye Hao, Weixun Wang, Hongyao Tang, Yi Ma, Yihai Duan, Yan Zheng:
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge. IJCAI 2020: 2291-2297
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/YangHMZHCFWLWP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/YangHMZHCFWLWP20
Tianpei Yang, Jianye Hao, Zhaopeng Meng, Zongzhang Zhang, Yujing Hu, Yingfeng Chen, Changjie Fan, Weixun Wang, Wulong Liu, Zhaodong Wang, Jiajie Peng:
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer. IJCAI 2020: 3094-3100
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/HaoJHLWMZLXG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/HaoJHLWMZLXG20
Xiaotian Hao, Junqi Jin, Jianye Hao, Jin Li, Weixun Wang, Yi Ma, Zhenzhe Zheng, Han Li, Jian Xu, Kun Gai:
Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising. IJCAI 2020: 3437-3443
[c11]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/HuWJWCH0F20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HuWJWCH0F20
Yujing Hu, Weixun Wang, Hangtian Jia, Yixiang Wang, Yingfeng Chen, Jianye Hao, Feng Wu, Changjie Fan:
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping. NeurIPS 2020
2019
[c10]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/atal/HaoWHY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/HaoWHY19
Xiaotian Hao, Weixun Wang, Jianye Hao, Yaodong Yang:
Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems. AAMAS 2019: 1315-1323
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/cikm/WangJH0YZWHWLXG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cikm/WangJH0YZWHWLXG19
Weixun Wang, Junqi Jin, Jianye Hao, Chunjie Chen, Chuan Yu, Weinan Zhang, Jun Wang, Xiaotian Hao, Yixi Wang, Han Li, Jian Xu, Kun Gai:
Learning Adaptive Display Exposure for Real-Time Advertising. CIKM 2019: 2595-2603
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/dai2/WangHWT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dai2/WangHWT19
Weixun Wang, Jianye Hao, Yixi Wang, Matthew E. Taylor:
Achieving cooperation through deep multiagent reinforcement learning in sequential prisoner's dilemmas. DAI 2019: 11:1-11:7
2011
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/dac/WangMR11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dac/WangMR11
Weixun Wang, Prabhat Mishra, Sanjay Ranka:
Dynamic cache reconfiguration and partitioning for energy optimization in real-time multi-core systems. DAC 2011: 948-953
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/vlsid/WangRM11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/vlsid/WangRM11
Weixun Wang, Sanjay Ranka, Prabhat Mishra:
A General Algorithm for Energy-Aware Dynamic Reconfiguration in Multitasking Systems. VLSI Design 2011: 334-339
2010
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/dac/WangM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dac/WangM10
Weixun Wang, Prabhat Mishra:
PreDVS: preemptive dynamic voltage scaling for real-time systems using approximation scheme. DAC 2010: 705-710
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/islped/WangQM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/islped/WangQM10
Weixun Wang, Xiaoke Qin, Prabhat Mishra:
Temperature- and energy-constrained scheduling in multitasking systems: a model checking approach. ISLPED 2010: 85-90
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/vlsid/WangM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/vlsid/WangM10
Weixun Wang, Prabhat Mishra:
Leakage-Aware Energy Minimization Using Dynamic Voltage Scaling and Cache Reconfiguration in Real-Time Systems. VLSI Design 2010: 357-362
2009
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/isvlsi/WangM09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isvlsi/WangM09
Weixun Wang, Prabhat Mishra:
Dynamic Reconfiguration of Two-Level Caches in Soft Real-Time Embedded Systems. ISVLSI 2009: 145-150
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/vlsid/WangMG09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/vlsid/WangMG09
Weixun Wang, Prabhat Mishra, Ann Gordon-Ross:
SACR: Scheduling-Aware Cache Reconfiguration for Real-Time Embedded Systems. VLSI Design 2009: 547-552

Reference Works

see FAQ

What is the meaning of the colors in the publication lists?

2012
[r1]
- view
  authority control:
- export record
  dblp key:
  - reference/crc/WangQM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/reference/crc/WangQM12
Weixun Wang, Xiaoke Qin, Prabhat Mishra:
Energy-Aware Scheduling and Dynamic Reconfiguration in Real-Time Systems. Handbook of Energy-Aware and Green Computing 2012: 543-572

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-17031
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-17031
Shengyi Huang, Michael Noukhovitch, Arian Hosseini, Kashif Rasul, Weixun Wang, Lewis Tunstall:
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization. CoRR abs/2403.17031 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-11143
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-11143
Jian Hu, Xibin Wu, Weixun Wang, Xianyu, Dehao Zhang, Yu Cao:
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework. CoRR abs/2405.11143 (2024)
2022
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-04427
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-04427
Jian Zhao, Yue Zhang, Xunhan Hu, Weixun Wang, Wengang Zhou, Jianye Hao, Jiangcheng Zhu, Houqiang Li:
Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization. CoRR abs/2202.04427 (2022)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-05285
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-05285
Xiaotian Hao, Weixun Wang, Hangyu Mao, Yaodong Yang, Dong Li, Yan Zheng, Zhen Wang, Jianye Hao:
API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks. CoRR abs/2203.05285 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-08454
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-08454
Jian Zhao, Youpeng Zhao, Weixun Wang, Mingyu Yang, Xunhan Hu, Wengang Zhou, Jianye Hao, Houqiang Li:
Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents. CoRR abs/2203.08454 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-09123
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-09123
Shengyi Huang, Anssi Kanervisto, Antonin Raffin, Weixun Wang, Santiago Ontañón, Rousslan Fernand Julien Dossa:
A2C is a special case of PPO. CoRR abs/2205.09123 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-13718
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-13718
Wei Qiu, Weixun Wang, Rundong Wang, Bo An, Yujing Hu, Svetlana Obraztsova, Zinovi Rabinovich, Jianye Hao, Yingfeng Chen, Changjie Fan:
Off-Beat Multi-Agent Reinforcement Learning. CoRR abs/2205.13718 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13708
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13708
Siyi Hu, Yifan Zhong, Minquan Gao, Weixun Wang, Hao Dong, Zhihui Li, Xiaodan Liang, Xiaojun Chang, Yaodong Yang:
MARLlib: Extending RLlib for Multi-agent Reinforcement Learning. CoRR abs/2210.13708 (2022)
2021
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-00517
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-00517
Tianze Zhou, Fubiao Zhang, Kun Shao, Kai Li, Wenhan Huang, Jun Luo, Weixun Wang, Yaodong Yang, Hangyu Mao, Bin Wang, Dong Li, Wulong Liu, Jianye Hao:
Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment. CoRR abs/2106.00517 (2021)
2020
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-07418
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-07418
Peng Zhang, Jianye Hao, Weixun Wang, Hongyao Tang, Yi Ma, Yihai Duan, Yan Zheng:
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge. CoRR abs/2002.07418 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-08030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-08030
Tianpei Yang, Weixun Wang, Hongyao Tang, Jianye Hao, Zhaopeng Meng, Wulong Liu, Yujing Hu, Yingfeng Chen:
Learning When to Transfer among Agents: An Efficient Multiagent Transfer Learning Framework. CoRR abs/2002.08030 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-08037
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-08037
Tianpei Yang, Jianye Hao, Zhaopeng Meng, Zongzhang Zhang, Weixun Wang, Yujing Hu, Yingfeng Chen, Changjie Fan, Zhaodong Wang, Jiajie Peng:
Efficient Deep Reinforcement Learning through Policy Transfer. CoRR abs/2002.08037 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-04355
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-04355
Xiaotian Hao, Junqi Jin, Jianye Hao, Jin Li, Weixun Wang, Yi Ma, Zhenzhe Zheng, Han Li, Jian Xu, Kun Gai:
Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising. CoRR abs/2005.04355 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-02669
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-02669
Yujing Hu, Weixun Wang, Hangtian Jia, Yixiang Wang, Yingfeng Chen, Jianye Hao, Feng Wu, Changjie Fan:
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping. CoRR abs/2011.02669 (2020)
2019
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1907-11461
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-11461
Weixun Wang, Tianpei Yang, Yong Liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao:
Action Semantics Network: Considering the Effects of Actions in Multiagent Systems. CoRR abs/1907.11461 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-02790
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-02790
Weixun Wang, Tianpei Yang, Yong Liu, Jianye Hao, Xiaotian Hao, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao:
From Few to More: Large-scale Dynamic Multiagent Curriculum Learning. CoRR abs/1909.02790 (2019)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-11468
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-11468
Xiaotian Hao, Weixun Wang, Jianye Hao, Yaodong Yang:
Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems. CoRR abs/1909.11468 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1911-10715
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-10715
Yong Liu, Weixun Wang, Yujing Hu, Jianye Hao, Xingguo Chen, Yang Gao:
Multi-Agent Game Abstraction via Graph Attention Neural Network. CoRR abs/1911.10715 (2019)
2018
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1803-00162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-00162
Weixun Wang, Jianye Hao, Yixi Wang, Matthew E. Taylor:
Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach. CoRR abs/1803.00162 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1809-03149
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-03149
Weixun Wang, Junqi Jin, Jianye Hao, Chunjie Chen, Chuan Yu, Weinan Zhang, Jun Wang, Yixi Wang, Han Li, Jian Xu, Kun Gai:
Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning. CoRR abs/1809.03149 (2018)
2012
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1211-1736
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1211-1736
Kanad Basu, Subrata Mitra, Srishti Mukherjee, Weixun Wang:
A Novel Approach for Handling Misbehaving Nodes in Behavior-Aware Mobile Networking. CoRR abs/1211.1736 (2012)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.