


default search action
Yuhui Wang 0001
Person information
- affiliation: King Abdullah University of Science and Technology, Saudi Arabia
- affiliation (former): Nanjing University of Aeronautics & Astronautics, College of Automation Engineering, China
Other persons with the same name
- Yuhui Wang — disambiguation page
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j4]Yao Li, Yuhui Wang
, Xiaoyang Tan:
Highly valued subgoal generation for efficient goal-conditioned reinforcement learning. Neural Networks 181: 106825 (2025) - 2024
- [c11]Yuhui Wang, Weida Li, Francesco Faccio, Qingyuan Wu, Jürgen Schmidhuber:
Highway Value Iteration Networks. ICML 2024 - [c10]Qingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Jürgen Schmidhuber, Chao Huang:
Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays. ICML 2024 - [c9]Qingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Chao Huang:
Variational Delayed Policy Optimization. NeurIPS 2024 - [i14]Qingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Chao Huang:
Variational Delayed Policy Optimization. CoRR abs/2405.14226 (2024) - [i13]Yuhui Wang, Miroslav Strupl, Francesco Faccio, Qingyuan Wu, Haozhe Liu, Michal Grudzien, Xiaoyang Tan, Jürgen Schmidhuber:
Highway Reinforcement Learning. CoRR abs/2405.18289 (2024) - [i12]Yuhui Wang, Weida Li, Francesco Faccio, Qingyuan Wu, Jürgen Schmidhuber:
Highway Value Iteration Networks. CoRR abs/2406.03485 (2024) - [i11]Yuhui Wang, Qingyuan Wu, Weida Li, Dylan R. Ashley, Francesco Faccio, Chao Huang, Jürgen Schmidhuber:
Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning. CoRR abs/2406.08404 (2024) - 2023
- [j3]Xinghu Yao
, Chao Wen
, Yuhui Wang
, Xiaoyang Tan
:
SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multiagent Reinforcement Learning. IEEE Trans. Neural Networks Learn. Syst. 34(1): 52-63 (2023) - [c8]Haozhe Liu, Mingchen Zhuge, Bing Li, Yuhui Wang, Francesco Faccio, Bernard Ghanem
, Jürgen Schmidhuber:
Learning to Identify Critical States for Reinforcement Learning from Videos. ICCV 2023: 1955-1965 - [i10]Deyao Zhu
, Yuhui Wang, Jürgen Schmidhuber, Mohamed Elhoseiny:
Guiding Online Reinforcement Learning with Action-Free Offline Pretraining. CoRR abs/2301.12876 (2023) - [i9]Mingchen Zhuge, Haozhe Liu, Francesco Faccio, Dylan R. Ashley, Róbert Csordás, Anand Gopalakrishnan, Abdullah Hamdi, Hasan Abed Al Kader Hammoud, Vincent Herrmann, Kazuki Irie, Louis Kirsch, Bing Li, Guohao Li, Shuming Liu, Jinjie Mai, Piotr Piekos, Aditya A. Ramesh, Imanol Schlag, Weimin Shi, Aleksandar Stanic, Wenyi Wang, Yuhui Wang, Mengmeng Xu, Deng-Ping Fan, Bernard Ghanem, Jürgen Schmidhuber:
Mindstorms in Natural Language-Based Societies of Mind. CoRR abs/2305.17066 (2023) - [i8]Haozhe Liu, Mingchen Zhuge, Bing Li, Yuhui Wang, Francesco Faccio, Bernard Ghanem, Jürgen Schmidhuber:
Learning to Identify Critical States for Reinforcement Learning from Videos. CoRR abs/2308.07795 (2023) - 2022
- [j2]Yao Li, Yuhui Wang, Yaozhong Gan, Xiaoyang Tan:
Alleviating the estimation bias of deep deterministic policy gradient via co-regularization. Pattern Recognit. 131: 108872 (2022) - [c7]Chao Wen, Miao Xu, Zhilin Zhang, Zhenzhe Zheng, Yuhui Wang, Xiangyu Liu, Yu Rong, Dong Xie, Xiaoyang Tan, Chuan Yu, Jian Xu, Fan Wu, Guihai Chen, Xiaoqiang Zhu, Bo Zheng:
A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising. WSDM 2022: 1129-1139 - 2021
- [c6]Yuhui Wang, Xiaoyang Tan:
Deep Recurrent Belief Propagation Network for POMDPs. AAAI 2021: 10236-10244 - [i7]Yuhui Wang, Pengcheng He, Xiaoyang Tan:
Greedy Multi-step Off-Policy Reinforcement Learning. CoRR abs/2102.11717 (2021) - [i6]Chao Wen, Miao Xu, Zhilin Zhang, Zhenzhe Zheng, Yuhui Wang, Xiangyu Liu, Yu Rong, Dong Xie, Xiaoyang Tan, Chuan Yu, Jian Xu, Fan Wu, Guihai Chen, Xiaoqiang Zhu:
A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising. CoRR abs/2106.06224 (2021) - 2020
- [c5]Chao Wen, Xinghu Yao, Yuhui Wang, Xiaoyang Tan:
SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning. AAAI 2020: 7301-7308 - [c4]Junting Fang, Xiaoyang Tan, Yuhui Wang:
ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection. ICPR 2020: 423-430
2010 – 2019
- 2019
- [j1]Xin Jin, Yuhui Wang, Xiaoyang Tan:
Pornographic Image Recognition via Weighted Multiple Instance Learning. IEEE Trans. Cybern. 49(12): 4412-4420 (2019) - [c3]Yuhui Wang, Hao He, Xiaoyang Tan, Yaozhong Gan:
Trust Region-Guided Proximal Policy Optimization. NeurIPS 2019: 624-634 - [c2]Yuhui Wang, Hao He, Xiaoyang Tan:
Truly Proximal Policy Optimization. UAI 2019: 113-122 - [i5]Yuhui Wang, Hao He, Xiaoyang Tan, Yaozhong Gan:
Trust Region-Guided Proximal Policy Optimization. CoRR abs/1901.10314 (2019) - [i4]Xin Jin, Yuhui Wang, Xiaoyang Tan:
Pornographic Image Recognition via Weighted Multiple Instance Learning. CoRR abs/1902.03771 (2019) - [i3]Yuhui Wang, Hao He, Xiaoyang Tan:
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations. CoRR abs/1902.05795 (2019) - [i2]Yuhui Wang, Hao He, Xiaoyang Tan:
Truly Proximal Policy Optimization. CoRR abs/1903.07940 (2019) - [i1]Xinghu Yao, Chao Wen, Yuhui Wang, Xiaoyang Tan:
SMIX($λ$): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning. CoRR abs/1911.04094 (2019) - 2016
- [c1]Yuhui Wang, Xin Jin, Xiaoyang Tan:
Pornographic image recognition by strongly-supervised deep multiple instance learning. ICIP 2016: 4418-4422
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-04-16 21:19 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint