default search action

combined dblp search
author search
venue search
publication search

ask others

Yuhui Wang 0001

> Home > Persons

Person information

affiliation: King Abdullah University of Science and Technology, Saudi Arabia
affiliation (former): Nanjing University of Aeronautics & Astronautics, College of Automation Engineering, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/LiWT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/LiWT25
Yao Li, Yuhui Wang, Xiaoyang Tan:
Highly valued subgoal generation for efficient goal-conditioned reinforcement learning. Neural Networks 181: 106825 (2025)
2024
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WangLFWS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WangLFWS24
Yuhui Wang, Weida Li, Francesco Faccio, Qingyuan Wu, Jürgen Schmidhuber:
Highway Value Iteration Networks. ICML 2024
[c10]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WuZ0WLL0S024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WuZ0WLL0S024
Qingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Jürgen Schmidhuber, Chao Huang:
Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays. ICML 2024
[c9]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WuZWWLLZH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WuZWWLLZH24
Qingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Chao Huang:
Variational Delayed Policy Optimization. NeurIPS 2024
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-14226
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-14226
Qingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Chao Huang:
Variational Delayed Policy Optimization. CoRR abs/2405.14226 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-18289
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-18289
Yuhui Wang, Miroslav Strupl, Francesco Faccio, Qingyuan Wu, Haozhe Liu, Michal Grudzien, Xiaoyang Tan, Jürgen Schmidhuber:
Highway Reinforcement Learning. CoRR abs/2405.18289 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-03485
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-03485
Yuhui Wang, Weida Li, Francesco Faccio, Qingyuan Wu, Jürgen Schmidhuber:
Highway Value Iteration Networks. CoRR abs/2406.03485 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-08404
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-08404
Yuhui Wang, Qingyuan Wu, Weida Li, Dylan R. Ashley, Francesco Faccio, Chao Huang, Jürgen Schmidhuber:
Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning. CoRR abs/2406.08404 (2024)
2023
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/YaoWWT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/YaoWWT23
Xinghu Yao, Chao Wen, Yuhui Wang, Xiaoyang Tan:
SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multiagent Reinforcement Learning. IEEE Trans. Neural Networks Learn. Syst. 34(1): 52-63 (2023)
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/LiuZLWFGS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/LiuZLWFGS23
Haozhe Liu, Mingchen Zhuge, Bing Li, Yuhui Wang, Francesco Faccio, Bernard Ghanem, Jürgen Schmidhuber:
Learning to Identify Critical States for Reinforcement Learning from Videos. ICCV 2023: 1955-1965
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-12876
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-12876
Deyao Zhu, Yuhui Wang, Jürgen Schmidhuber, Mohamed Elhoseiny:
Guiding Online Reinforcement Learning with Action-Free Offline Pretraining. CoRR abs/2301.12876 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-17066
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-17066
Mingchen Zhuge, Haozhe Liu, Francesco Faccio, Dylan R. Ashley, Róbert Csordás, Anand Gopalakrishnan, Abdullah Hamdi, Hasan Abed Al Kader Hammoud, Vincent Herrmann, Kazuki Irie, Louis Kirsch, Bing Li, Guohao Li, Shuming Liu, Jinjie Mai, Piotr Piekos, Aditya A. Ramesh, Imanol Schlag, Weimin Shi, Aleksandar Stanic, Wenyi Wang, Yuhui Wang, Mengmeng Xu, Deng-Ping Fan, Bernard Ghanem, Jürgen Schmidhuber:
Mindstorms in Natural Language-Based Societies of Mind. CoRR abs/2305.17066 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-07795
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-07795
Haozhe Liu, Mingchen Zhuge, Bing Li, Yuhui Wang, Francesco Faccio, Bernard Ghanem, Jürgen Schmidhuber:
Learning to Identify Critical States for Reinforcement Learning from Videos. CoRR abs/2308.07795 (2023)
2022
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/pr/LiWGT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pr/LiWGT22
Yao Li, Yuhui Wang, Yaozhong Gan, Xiaoyang Tan:
Alleviating the estimation bias of deep deterministic policy gradient via co-regularization. Pattern Recognit. 131: 108872 (2022)
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/wsdm/WenXZZWLRXTYXWC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wsdm/WenXZZWLRXTYXWC22
Chao Wen, Miao Xu, Zhilin Zhang, Zhenzhe Zheng, Yuhui Wang, Xiangyu Liu, Yu Rong, Dong Xie, Xiaoyang Tan, Chuan Yu, Jian Xu, Fan Wu, Guihai Chen, Xiaoqiang Zhu, Bo Zheng:
A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising. WSDM 2022: 1129-1139
2021
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WangT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WangT21
Yuhui Wang, Xiaoyang Tan:
Deep Recurrent Belief Propagation Network for POMDPs. AAAI 2021: 10236-10244
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-11717
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-11717
Yuhui Wang, Pengcheng He, Xiaoyang Tan:
Greedy Multi-step Off-Policy Reinforcement Learning. CoRR abs/2102.11717 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-06224
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-06224
Chao Wen, Miao Xu, Zhilin Zhang, Zhenzhe Zheng, Yuhui Wang, Xiangyu Liu, Yu Rong, Dong Xie, Xiaoyang Tan, Chuan Yu, Jian Xu, Fan Wu, Guihai Chen, Xiaoqiang Zhu:
A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising. CoRR abs/2106.06224 (2021)
2020
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WenYWT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WenYWT20
Chao Wen, Xinghu Yao, Yuhui Wang, Xiaoyang Tan:
SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning. AAAI 2020: 7301-7308
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/FangTW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/FangTW20
Junting Fang, Xiaoyang Tan, Yuhui Wang:
ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection. ICPR 2020: 423-430

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tcyb/JinWT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcyb/JinWT19
Xin Jin, Yuhui Wang, Xiaoyang Tan:
Pornographic Image Recognition via Weighted Multiple Instance Learning. IEEE Trans. Cybern. 49(12): 4412-4420 (2019)
[c3]
- view
- export record
  dblp key:
  - conf/nips/WangHTG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangHTG19
Yuhui Wang, Hao He, Xiaoyang Tan, Yaozhong Gan:
Trust Region-Guided Proximal Policy Optimization. NeurIPS 2019: 624-634
[c2]
- view
- export record
  dblp key:
  - conf/uai/WangHT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/WangHT19
Yuhui Wang, Hao He, Xiaoyang Tan:
Truly Proximal Policy Optimization. UAI 2019: 113-122
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-10314
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-10314
Yuhui Wang, Hao He, Xiaoyang Tan, Yaozhong Gan:
Trust Region-Guided Proximal Policy Optimization. CoRR abs/1901.10314 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-03771
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-03771
Xin Jin, Yuhui Wang, Xiaoyang Tan:
Pornographic Image Recognition via Weighted Multiple Instance Learning. CoRR abs/1902.03771 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-05795
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-05795
Yuhui Wang, Hao He, Xiaoyang Tan:
Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations. CoRR abs/1902.05795 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-07940
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-07940
Yuhui Wang, Hao He, Xiaoyang Tan:
Truly Proximal Policy Optimization. CoRR abs/1903.07940 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-04094
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-04094
Xinghu Yao, Chao Wen, Yuhui Wang, Xiaoyang Tan:
SMIX($λ$): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning. CoRR abs/1911.04094 (2019)
2016
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/WangJT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/WangJT16
Yuhui Wang, Xin Jin, Xiaoyang Tan:
Pornographic image recognition by strongly-supervised deep multiple instance learning. ICIP 2016: 4418-4422

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.