default search action
Search dblp
Full-text search
- > Home
Please enter a search query
- case-insensitive prefix search: default
e.g., sig matches "SIGIR" as well as "signal" - exact word search: append dollar sign ($) to word
e.g., graph$ matches "graph", but not "graphics" - boolean and: separate words by space
e.g., codd model - boolean or: connect words by pipe symbol (|)
e.g., graph|network
Update May 7, 2017: Please note that we had to disable the phrase search operator (.) and the boolean not operator (-) due to technical problems. For the time being, phrase search queries will yield regular prefix search result, and search terms preceded by a minus will be interpreted as regular (positive) search terms.
Author search results
Exact matches
Venue search results
no matches
Refine list
refine by author
- no options
- temporarily not available
refine by venue
- no options
- temporarily not available
refine by type
- no options
- temporarily not available
refine by access
- no options
- temporarily not available
refine by year
- no options
- temporarily not available
Publication search results
found 54 matches
- 2024
- Jianye Hao, Tianpei Yang, Hongyao Tang, Chenjia Bai, Jinyi Liu, Zhaopeng Meng, Peng Liu, Zhen Wang:
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain. IEEE Trans. Neural Networks Learn. Syst. 35(7): 8762-8782 (2024) - Xi Zeng, Xiaotian Hao, Hongyao Tang, Zhentao Tang, Shaoqing Jiao, Dazhi Lu, Jiajie Peng:
Designing Biological Sequences without Prior Knowledge Using Evolutionary Reinforcement Learning. AAAI 2024: 383-391 - Shiqi Wang, Enguang Hou, Hongyao Ma, Jiarui Tang, Zhen Shen, Gang Xiong:
Steel Coil Recognition Neural Networks for Intelligent Cranes. ANZCC 2024: 31-36 - Pengyi Li, Yan Zheng, Hongyao Tang, Xian Fu, Jianye Hao:
EvoRainbow: Combining Improvements in Evolutionary Reinforcement Learning for Policy Search. ICML 2024 - Pengyi Li, Jianye Hao, Hongyao Tang, Yan Zheng, Fazl Barez:
Value-Evolutionary-Based Reinforcement Learning. ICML 2024 - Pengyi Li, Jianye Hao, Hongyao Tang, Xian Fu, Yan Zheng, Ke Tang:
Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey. CoRR abs/2401.11963 (2024) - Min Zhang, Jianye Hao, Xian Fu, Peilong Han, Hao Zhang, Lei Shi, Hongyao Tang, Yan Zheng:
MFE-ETP: A Comprehensive Evaluation Benchmark for Multi-modal Foundation Models on Embodied Task Planning. CoRR abs/2407.05047 (2024) - 2023
- Jianye Hao, Pengyi Li, Hongyao Tang, Yan Zheng, Xian Fu, Zhaopeng Meng:
ERL-Re$2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation. ICLR 2023 - Pengyi Li, Jianye Hao, Hongyao Tang, Yan Zheng, Xian Fu:
RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolution. ICML 2023: 19490-19503 - Yi Ma, Hongyao Tang, Dong Li, Zhaopeng Meng:
Reining Generalization in Offline Reinforcement Learning via Representation Distinction. NeurIPS 2023 - Hongyao Tang, Min Zhang, Jianye Hao:
The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting. CoRR abs/2303.01391 (2023) - 2022
- Xiangyu Meng, Xiaozhou Lü, Yaoguang Shi, Hongyao Tang, Weimin Bao:
Pressure-Controlled Thermochromic Electronic Skin With Adjustable Memory Time During Fabrication for In Situ Pressure Display Application. IEEE Trans. Instrum. Meas. 71: 1-9 (2022) - Hongyao Tang, Zhaopeng Meng, Jianye Hao, Chen Chen, Daniel Graves, Dong Li, Changmin Yu, Hangyu Mao, Wulong Liu, Yaodong Yang, Wenyuan Tao, Li Wang:
What about Inputting Policy in Value Function: Policy Representation and Policy-Extended Value Function Approximator. AAAI 2022: 8441-8449 - Yining Li, Tianpei Yang, Jianye Hao, Yan Zheng, Hongyao Tang:
Efficient Deep Reinforcement Learning via Policy-Extended Successor Feature Approximator. DAI 2022: 29-44 - Boyan Li, Hongyao Tang, Yan Zheng, Jianye Hao, Pengyi Li, Zhen Wang, Zhaopeng Meng, Li Wang:
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation. ICLR 2022 - Pengyi Li, Hongyao Tang, Tianpei Yang, Xiaotian Hao, Tong Sang, Yan Zheng, Jianye Hao, Matthew E. Taylor, Wenyuan Tao, Zhen Wang:
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration. ICML 2022: 12979-12997 - Tong Sang, Hongyao Tang, Yi Ma, Jianye Hao, Yan Zheng, Zhaopeng Meng, Boyan Li, Zhen Wang:
PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations. IJCAI 2022: 3416-3422 - Pengyi Li, Hongyao Tang, Tianpei Yang, Xiaotian Hao, Tong Sang, Yan Zheng, Jianye Hao, Matthew E. Taylor, Zhen Wang:
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration. CoRR abs/2203.08553 (2022) - Tong Sang, Hongyao Tang, Yi Ma, Jianye Hao, Yan Zheng, Zhaopeng Meng, Boyan Li, Zhen Wang:
PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations. CoRR abs/2204.02877 (2022) - Min Zhang, Hongyao Tang, Jianye Hao, Yan Zheng:
Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes. CoRR abs/2209.07696 (2022) - Pengyi Li, Hongyao Tang, Jianye Hao, Yan Zheng, Xian Fu, Zhaopeng Meng:
ERL-Re2: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation. CoRR abs/2210.17375 (2022) - Chen Chen, Hongyao Tang, Yi Ma, Chao Wang, Qianli Shen, Dong Li, Jianye Hao:
State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning. CoRR abs/2211.15065 (2022) - 2021
- Chen Chen, Hongyao Tang, Jianye Hao, Wulong Liu, Zhaopeng Meng:
Addressing Action Oscillations through Learning Policy Inertia. AAAI 2021: 7020-7027 - Haotian Fu, Hongyao Tang, Jianye Hao, Chen Chen, Xidong Feng, Dong Li, Wulong Liu:
Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning. AAAI 2021: 7457-7465 - Hongyao Tang, Zhaopeng Meng, Guangyong Chen, Pengfei Chen, Chen Chen, Yaodong Yang, Luo Zhang, Wulong Liu, Jianye Hao:
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction. AAAI 2021: 9834-9842 - Tong Sang, Hongyao Tang, Jianye Hao, Yan Zheng, Zhaopeng Meng:
Uncertainty-Aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning. DAI 2021: 21-37 - Tianpei Yang, Weixun Wang, Hongyao Tang, Jianye Hao, Zhaopeng Meng, Hangyu Mao, Dong Li, Wulong Liu, Yingfeng Chen, Yujing Hu, Changjie Fan, Chengwei Zhang:
An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning. NeurIPS 2021: 17037-17048 - Hongyao Tang, Jianye Hao, Guangyong Chen, Pengfei Chen, Chen Chen, Yaodong Yang, Luo Zhang, Wulong Liu, Zhaopeng Meng:
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction. CoRR abs/2103.02225 (2021) - Chen Chen, Hongyao Tang, Jianye Hao, Wulong Liu, Zhaopeng Meng:
Addressing Action Oscillations through Learning Policy Inertia. CoRR abs/2103.02287 (2021) - Boyan Li, Hongyao Tang, Yan Zheng, Jianye Hao, Pengyi Li, Zhen Wang, Zhaopeng Meng, Li Wang:
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation. CoRR abs/2109.05490 (2021)
skipping 24 more matches
loading more results
failed to load more results, please try again later
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
retrieved on 2024-10-06 14:27 CEST from data curated by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint