


Остановите войну!
for scientists:


default search action
Yang Yu 0001
Person information

- affiliation (PhD 2011): Nanjing University, State Key Laboratory for Novel Software Technology, China
- affiliation: Pazhou Lab, Guangzhou, China
Other persons with the same name
- Yang Yu — disambiguation page
- Yang Yu 0002
— University of Technology Sydney, Faculty of Engineering and Information Technology, NSW, Australia (and 1 more)
- Yang Yu 0003
— North China Electric Power University, State Key Laboratory of Alternate Electrical Power System with Renewable Energy Sources, Baoding, China
- Yang Yu 0004
— Rochester Institute of Technology, Saunders College of Business, Rochester, NY, USA (and 1 more)
- Yang Yu 0005
— Jiangsu University of Technology, School of Electric Information Engineering, Changzhou, China
- Yang Yu 0006
— National University of Defense Technology, College of Electrical Science and Engineering, National Key Laboratory of Science and Technology on ATR, Changsha, China
- Yang Yu 0007
— National University of Defense Technology, College of Computer, Changsha, China
- Yang Yu 0008
— Tsinghua University, Department of Computer Science and Technology, Beijing, China (and 1 more)
- Yang Yu 0009 — Motorola Labs, Schaumburg, IL, USA (and 1 more)
- Yang Yu 0010 — Rutgers University, Department of Computer Science, Piscataway, NJ, USA
- Yang Yu 0011
— Tsinghua University, Institute for Interdisciplinary Information Sciences, Beijing, China (and 1 more)
- Yang Yu 0012 — University of Sheffield, UK
- Yang Yu 0013
— University of Toyama, Faculty of Engineering, Toyama, Japan
- Yang Yu 0014
— National University of Defense Technology, College of Intelligence Science and Technology, Changsha, China (and 1 more)
- Yang Yu 0015
— Harbin Institute of Technology, Department of Automatic Test and Control, Harbin, China
- Yang Yu 0016
— Northeastern University, College of Information Science and Engineering, Shenyang, China
- Yang Yu 0017
— Changchun University of Technology, School of Mechatronic Engineering, Changchun, China
- Yang Yu 0018
— Harbin Jiancheng Group Company, Harbin, China
- Yang Yu 0019
— Shanghai Jiao Tong University, School of Mechanical Engineering, State Key Laboratory of Mechanical System and Vibration, Shanghai, China
- Yang Yu 0020
— Tongji University, State Key Laboratory of Marine Geology, Shanghai, China
- Yang Yu 0021
— University of Technology Sydney, School of Civil and Environmental Engineering, Sydney, Australia
- Yang Yu 0022
— Hebei University of Technology, School of Computer Science and Engineering, Tianjin, China
- Yang Yu 0023
— Wuhan University, School of Urban Design, Department of Urban Planning, Wuha, China
- Yang Yu 0024
— Tongji University, Department of Control Science and Engineering, Shanghai, China
- Yang Yu 0025 — Rutgers University, Department of Mathematics, Piscataway, NJ, USA
- Yang Yu 0026
— China Agricultural University, College of Engineering, Beijing, China
- Yang Yu 0027
— Sun Yat-sen University, School of Data and Computer Science, Guangzhou, China
- Yang Yu 0028
— Hong Kong University of Science and Technology, Department of Electronic and Computer Engineering, Robotics and Multi-Perception Laborotary, Hong Kong
- Yang Yu 0029 — Google, Mountain View, CA, USA (and 3 more)
- Yang Yu 0030
— Tianjin University, College of Intelligence and Computing, China
- Yang Yu 0031
— Southwest Forestry University, School of Machinery and Transportation, Kunming, China (and 1 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j32]Hua Yang
, Minghao Zhao, Lei Yuan, Yang Yu, Zhenhua Li
, Ming Gu:
Memory-efficient Transformer-based network model for Traveling Salesman Problem. Neural Networks 161: 589-597 (2023) - [j31]Guangda Huzhang
, Zhen-Jia Pang, Yongqing Gao, Yawen Liu, Weijie Shen, Wen-Ji Zhou
, Qianying Lin, Qing Da, Anxiang Zeng
, Han Yu
, Yang Yu
, Zhi-Hua Zhou
:
AliExpress Learning-to-Rank: Maximizing Online Model Performance Without Going Online. IEEE Trans. Knowl. Data Eng. 35(2): 1214-1226 (2023) - [c95]Shaowei Zhang, Jiahan Cao, Lei Yuan, Yang Yu, De-Chuan Zhan:
Self-Motivated Multi-Agent Exploration. AAMAS 2023: 476-484 - [c94]Xu-Hui Liu, Feng Xu, Xinyu Zhang, Tianyuan Liu, Shengyi Jiang, Ruifeng Chen, Zongzhang Zhang, Yang Yu:
How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement. AAMAS 2023: 1276-1284 - [i67]Shaowei Zhang, Jiahan Cao, Lei Yuan, Yang Yu, De-Chuan Zhan:
Self-Motivated Multi-Agent Exploration. CoRR abs/2301.02083 (2023) - [i66]Ziniu Li, Tian Xu, Yang Yu, Zhi-Quan Luo:
Theoretical Analysis of Offline Imitation With Supplementary Dataset. CoRR abs/2301.11687 (2023) - [i65]Jing-Cheng Pang, Xin-Yu Yang, Si-Hang Yang, Yang Yu:
Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation. CoRR abs/2302.09368 (2023) - [i64]Cong Guan, Feng Chen, Lei Yuan, Zongzhang Zhang, Yang Yu:
Efficient Communication via Self-supervised Information Aggregation for Online and Offline Multi-agent Reinforcement Learning. CoRR abs/2302.09605 (2023) - [i63]Xu-Hui Liu, Feng Xu, Xinyu Zhang, Tianyuan Liu, Shengyi Jiang, Ruifeng Chen, Zongzhang Zhang, Yang Yu:
How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement. CoRR abs/2303.02073 (2023) - [i62]Zheng-Mao Zhu, Yu-Ren Liu, Hong-Long Tian, Yang Yu, Kun Zhang:
Beware of Instantaneous Dependence in Reinforcement Learning. CoRR abs/2303.05458 (2023) - [i61]Xiong-Hui Chen, Bowei He, Yang Yu, Qingyang Li, Zhiwei Tony Qin, Wenjie Shang, Jieping Ye, Chen Ma:
Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems. CoRR abs/2305.04832 (2023) - [i60]Lei Yuan, Feng Chen, Zongzhang Zhang, Yang Yu:
Communication-Robust Multi-Agent Learning by Adaptable Auxiliary Multi-Agent Adversary Generation. CoRR abs/2305.05116 (2023) - [i59]Lei Yuan, Ziqian Zhang, Ke Xue, Hao Yin, Feng Chen, Cong Guan, Lihe Li, Chao Qian, Yang Yu:
Robust multi-agent coordination via evolutionary generation of auxiliary adversarial attackers. CoRR abs/2305.05909 (2023) - [i58]Ziqian Zhang, Lei Yuan, Lihe Li, Ke Xue, Chengxing Jia, Cong Guan, Chao Qian, Yang Yu:
Fast Teammate Adaptation in the Presence of Sudden Policy Change. CoRR abs/2305.05911 (2023) - [i57]Lei Yuan, Tao Jiang, Lihe Li, Feng Chen, Zongzhang Zhang, Yang Yu:
Robust Multi-agent Communication via Multi-view Message Certification. CoRR abs/2305.13936 (2023) - [i56]Lei Yuan, Lihe Li, Ziqian Zhang, Fuxiang Zhang, Cong Guan, Yang Yu:
Multi-agent Continual Coordination via Progressive Task Contextualization. CoRR abs/2305.13937 (2023) - 2022
- [j30]Yu-Ren Liu, Yi-Qi Hu, Hong Qian, Chao Qian, Yang Yu:
ZOOpt: a toolbox for derivative-free optimization. Sci. China Inf. Sci. 65(10) (2022) - [j29]Ruo-Ze Liu, Zhen-Jia Pang, Zhou-Yu Meng, Wenhai Wang, Yang Yu, Tong Lu:
On Efficient Reinforcement Learning for Full-length Game of StarCraft II. J. Artif. Intell. Res. 75: 213-260 (2022) - [j28]Yi-Feng Zhang
, Fan-Ming Luo, Yang Yu:
Improve generated adversarial imitation learning with reward variance regularization. Mach. Learn. 111(3): 977-995 (2022) - [j27]Yi-Qi Hu
, Xu-Hui Liu
, Shu-Qiao Li
, Yang Yu:
Cascaded Algorithm Selection With Extreme-Region UCB Bandit. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6782-6794 (2022) - [j26]Tian Xu
, Ziniu Li, Yang Yu:
Error Bounds of Imitating Policies and Environments for Reinforcement Learning. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6968-6980 (2022) - [j25]Ruo-Ze Liu
, Haifeng Guo, Xiaozhong Ji, Yang Yu, Zhen-Jia Pang, Zitai Xiao, Yuzhou Wu, Tong Lu
:
Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning. IEEE Trans. Games 14(2): 294-307 (2022) - [j24]Xin Jin, Yanping Xie, Xiu-Shen Wei
, Borui Zhao, Yongshun Zhang, Xiaoyang Tan
, Yang Yu:
A Lightweight Encoder-Decoder Path for Deep Residual Networks. IEEE Trans. Neural Networks Learn. Syst. 33(2): 866-878 (2022) - [c93]Fan-Ming Luo, Shengyi Jiang, Yang Yu, Zongzhang Zhang, Yi-Feng Zhang:
Adapt to Environment Sudden Changes by Learning a Context Sensitive Policy. AAAI 2022: 7637-7646 - [c92]Zheng-Mao Zhu, Shengyi Jiang, Yu-Ren Liu, Yang Yu, Kun Zhang:
Invariant Action Effect Model for Reinforcement Learning. AAAI 2022: 9260-9268 - [c91]Lei Yuan, Jianhao Wang, Fuxiang Zhang, Chenghe Wang, Zongzhang Zhang, Yang Yu, Chongjie Zhang:
Multi-Agent Incentive Communication via Decentralized Teammate Modeling. AAAI 2022: 9466-9474 - [c90]Yang Yu, Rui Jin, Hao Yin, Keke Gai, Zijian Zhang:
A Searchable Re-encryption-based Scheme for Massive Data Transactions. CSCloud/EdgeCom 2022: 135-140 - [c89]Tonghan Wang, Liang Zeng, Weijun Dong, Qianlan Yang, Yang Yu, Chongjie Zhang:
Context-Aware Sparse Deep Coordination Graphs. ICLR 2022 - [c88]Siyuan Li, Jin Zhang, Jianhao Wang, Yang Yu, Chongjie Zhang:
Active Hierarchical Exploration with Stable Subgoal Representation Learning. ICLR 2022 - [c87]Hang Zhao, Yang Yu, Kai Xu:
Learning Efficient Online 3D Bin Packing on Packing Configuration Trees. ICLR 2022 - [c86]Hong Qian, Xu-Hui Liu, Chen-Xi Su, Aimin Zhou, Yang Yu:
The Teaching Dimension of Regularized Kernel Learners. ICML 2022: 17984-18002 - [c85]Di Xue, Lei Yuan, Zongzhang Zhang, Yang Yu:
Efficient Multi-Agent Communication via Shapley Message Value. IJCAI 2022: 578-584 - [c84]Lei Yuan, Chenghe Wang, Jianhao Wang, Fuxiang Zhang, Feng Chen, Cong Guan, Zongzhang Zhang, Chongjie Zhang, Yang Yu:
Multi-Agent Concentrative Coordination with Decentralized Task Representation. IJCAI 2022: 599-605 - [c83]Ke Xue, Jiacheng Xu, Lei Yuan, Miqing Li, Chao Qian, Zongzhang Zhang, Yang Yu:
Multi-agent Dynamic Algorithm Configuration. NeurIPS 2022 - [c82]Cong Guan, Feng Chen, Lei Yuan, Chenghe Wang, Hao Yin, Zongzhang Zhang, Yang Yu:
Efficient Multi-agent Communication via Self-supervised Information Aggregation. NeurIPS 2022 - [c81]Rongjun Qin, Xingyuan Zhang, Songyi Gao, Xiong-Hui Chen, Zewen Li, Weinan Zhang, Yang Yu:
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning. NeurIPS 2022 - [c80]Chenyang Wu, Tianci Li, Zongzhang Zhang, Yang Yu:
Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning. NeurIPS 2022 - [e4]João Gama, Tianrui Li
, Yang Yu, Enhong Chen, Yu Zheng, Fei Teng:
Advances in Knowledge Discovery and Data Mining - 26th Pacific-Asia Conference, PAKDD 2022, Chengdu, China, May 16-19, 2022, Proceedings, Part I. Lecture Notes in Computer Science 13280, Springer 2022, ISBN 978-3-031-05932-2 [contents] - [e3]João Gama, Tianrui Li
, Yang Yu, Enhong Chen, Yu Zheng, Fei Teng:
Advances in Knowledge Discovery and Data Mining - 26th Pacific-Asia Conference, PAKDD 2022, Chengdu, China, May 16-19, 2022, Proceedings, Part II. Lecture Notes in Computer Science 13281, Springer 2022, ISBN 978-3-031-05935-3 [contents] - [e2]João Gama, Tianrui Li
, Yang Yu, Enhong Chen, Yu Zheng, Fei Teng:
Advances in Knowledge Discovery and Data Mining - 26th Pacific-Asia Conference, PAKDD 2022, Chengdu, China, May 16-19, 2022, Proceedings, Part III. Lecture Notes in Computer Science 13282, Springer 2022, ISBN 978-3-031-05980-3 [contents] - [i55]Ziniu Li, Tian Xu, Yang Yu, Zhi-Quan Luo:
Rethinking ValueDice: Does It Really Improve Performance? CoRR abs/2202.02468 (2022) - [i54]Rongjun Qin, Feng Chen, Tonghan Wang, Lei Yuan, Xiaoran Wu, Zongzhang Zhang, Chongjie Zhang, Yang Yu:
Multi-Agent Policy Transfer via Task Relationship Modeling. CoRR abs/2203.04482 (2022) - [i53]Ziniu Li, Tian Xu, Yang Yu:
A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle. CoRR abs/2203.11489 (2022) - [i52]Fan-Ming Luo, Xingchen Cao, Yang Yu:
Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble. CoRR abs/2206.00238 (2022) - [i51]Zheng-Mao Zhu, Xiong-Hui Chen, Hong-Long Tian, Kun Zhang, Yang Yu:
Offline Reinforcement Learning with Causal Structured World Models. CoRR abs/2206.01474 (2022) - [i50]Xue-Kun Jin, Xu-Hui Liu, Shengyi Jiang, Yang Yu:
Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning. CoRR abs/2206.02000 (2022) - [i49]Xiong-Hui Chen, Yang Yu, Zheng-Mao Zhu, Zhihua Yu, Zhenjun Chen, Chenghe Wang, Yinan Wu, Hongqiu Wu, Rong-Jun Qin, Ruijin Ding, Fangsheng Huang:
Adversarial Counterfactual Environment Model Learning. CoRR abs/2206.04890 (2022) - [i48]Fan-Ming Luo, Tian Xu, Hang Lai, Xiong-Hui Chen, Weinan Zhang, Yang Yu:
A Survey on Model-based Reinforcement Learning. CoRR abs/2206.09328 (2022) - [i47]Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo:
Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis. CoRR abs/2208.01899 (2022) - [i46]Ke Xue, Yutong Wang, Lei Yuan, Cong Guan, Chao Qian, Yang Yu:
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution. CoRR abs/2208.04957 (2022) - [i45]Rong-Jun Qin, Fan-Ming Luo, Hong Qian, Yang Yu:
Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games. CoRR abs/2208.09452 (2022) - [i44]Ruo-Ze Liu, Zhen-Jia Pang, Zhou-Yu Meng, Wenhai Wang, Yang Yu, Tong Lu:
On Efficient Reinforcement Learning for Full-length Game of StarCraft II. CoRR abs/2209.11553 (2022) - [i43]Zhengbang Zhu, Rongjun Qin, Junjie Huang, Xinyi Dai, Yang Yu, Yong Yu, Weinan Zhang:
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems. CoRR abs/2210.05662 (2022) - [i42]Ke Xue, Jiacheng Xu, Lei Yuan, Miqing Li, Chao Qian, Zongzhang Zhang, Yang Yu:
Multi-agent Dynamic Algorithm Configuration. CoRR abs/2210.06835 (2022) - [i41]Yang Yu, Qi Liu, Likang Wu, Runlong Yu, Sanshi Lei Yu, Zaixi Zhang:
Untargeted Attack against Federated Recommendation Systems via Poisonous Item Embeddings and the Defense. CoRR abs/2212.05399 (2022) - 2021
- [j23]Anxiang Zeng, Han Yu, Qing Da, Yusen Zhan, Yang Yu, Jingren Zhou, Chunyan Miao:
Improving Search Engine Efficiency through Contextual Factor Selection. AI Mag. 42(2): 50-58 (2021) - [j22]Chao Qian
, Chao Bian, Yang Yu, Ke Tang, Xin Yao:
Analysis of Noisy Evolutionary Optimization When Sampling Fails. Algorithmica 83(4): 940-975 (2021) - [j21]Chao Bian, Chao Qian, Yang Yu, Ke Tang:
On the robustness of median sampling in noisy evolutionary optimization. Sci. China Inf. Sci. 64(5) (2021) - [j20]Lei Bu
, Yongjuan Liang, Zhunyi Xie, Hong Qian, Yi-Qi Hu, Yang Yu, Xin Chen, Xuandong Li:
Machine learning steered symbolic execution framework for complex software code. Formal Aspects Comput. 33(3): 301-323 (2021) - [j19]Wenjie Shang
, Qingyang Li, Zhiwei (Tony) Qin, Yang Yu, Yiping Meng, Jieping Ye:
Partially observable environment estimation with uplift inference for reinforcement learning based recommendation. Mach. Learn. 110(9): 2603-2640 (2021) - [j18]Hugo Jair Escalante
, Quanming Yao
, Wei-Wei Tu, Nelishia Pillay, Rong Qu
, Yang Yu, Neil Houlsby:
Guest Editorial: Automated Machine Learning. IEEE Trans. Pattern Anal. Mach. Intell. 43(9): 2887-2890 (2021) - [c79]Chenyang Wu, Rui Kong, Guoyu Yang, Xianghan Kong, Zongzhang Zhang, Yang Yu, Dong Li, Wulong Liu:
LB-DESPOT: Efficient Online POMDP Planning Considering Lower Bound in Action Selection (Student Abstract). AAAI 2021: 15927-15928 - [c78]Feng Xu, Shengyi Jiang, Hao Yin, Zongzhang Zhang, Yang Yu, Ming Li, Dong Li, Wulong Liu:
Enhancing Context-Based Meta-Reinforcement Learning Algorithms via An Efficient Task Encoder (Student Abstract). AAAI 2021: 15937-15938 - [c77]Jianhao Wang, Zhizhou Ren, Terry Liu, Yang Yu, Chongjie Zhang:
QPLEX: Duplex Dueling Multi-Agent Q-Learning. ICLR 2021 - [c76]Chao Bian, Chao Qian, Frank Neumann, Yang Yu:
Fast Pareto Optimization for Subset Selection with Dynamic Cost Constraints. IJCAI 2021: 2191-2197 - [c75]Weijie Shen, Lei Yuan, Junfu Huang, Songyi Gao, Yuyang Huang, Yang Yu:
Sequential and Dynamic constraint Contrastive Learning for Reinforcement Learning. IJCNN 2021: 1-9 - [c74]Xiong-Hui Chen, Yang Yu, Qingyang Li, Fan-Ming Luo, Zhiwei (Tony) Qin, Wenjie Shang, Jieping Ye:
Offline Model-based Adaptable Policy Learning. NeurIPS 2021: 8432-8443 - [c73]Xiong-Hui Chen, Shengyi Jiang, Feng Xu, Zongzhang Zhang, Yang Yu:
Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning. NeurIPS 2021: 12520-12532 - [c72]Xu-Hui Liu, Zhenghai Xue, Jing-Cheng Pang, Shengyi Jiang, Feng Xu, Yang Yu:
Regret Minimization Experience Replay in Off-Policy Reinforcement Learning. NeurIPS 2021: 17604-17615 - [c71]Chenyang Wu, Guoyu Yang, Zongzhang Zhang, Yang Yu, Dong Li, Wulong Liu, Jianye Hao:
Adaptive Online Packing-guided Search for POMDPs. NeurIPS 2021: 28419-28430 - [i40]Rongjun Qin, Songyi Gao, Xingyuan Zhang, Zhen Xu, Shengkai Huang, Zewen Li, Weinan Zhang, Yang Yu:
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning. CoRR abs/2102.00714 (2021) - [i39]Hong Qian, Yang Yu:
Derivative-Free Reinforcement Learning: A Review. CoRR abs/2102.05710 (2021) - [i38]Ruo-Ze Liu, Wenhai Wang, Yanjie Shen, Zhiqi Li, Yang Yu, Tong Lu:
An Introduction of mini-AlphaStar. CoRR abs/2104.06890 (2021) - [i37]Zhenghai Xue, Xu-Hui Liu, Jing-Cheng Pang, Shengyi Jiang, Feng Xu, Yang Yu:
Regret Minimization Experience Replay. CoRR abs/2105.07253 (2021) - [i36]Jing-Cheng Pang, Tian Xu, Shengyi Jiang, Yu-Ren Liu, Yang Yu:
Sparsity Prior Regularized Q-learning for Sparse Action Tasks. CoRR abs/2105.08666 (2021) - [i35]Tonghan Wang, Liang Zeng, Weijun Dong, Qianlan Yang, Yang Yu, Chongjie Zhang:
Context-Aware Sparse Deep Coordination Graphs. CoRR abs/2106.02886 (2021) - [i34]Tian Xu, Ziniu Li, Yang Yu:
Nearly Minimax Optimal Adversarial Imitation Learning with Known and Unknown Transitions. CoRR abs/2106.10424 (2021) - [i33]Yongqing Gao, Guangda Huzhang, Weijie Shen, Yawen Liu, Wen-Ji Zhou, Qing Da, Dan Shen, Yang Yu:
Imitate TheWorld: A Search Engine Simulation Platform. CoRR abs/2107.07693 (2021) - [i32]Zhao-Hua Li, Yang Yu, Yingfeng Chen, Ke Chen, Zhipeng Hu, Changjie Fan:
Neural-to-Tree Policy Distillation with Policy Improvement Criterion. CoRR abs/2108.06898 (2021) - [i31]Jiahan Cao, Lei Yuan, Jianhao Wang, Shaowei Zhang, Chongjie Zhang, Yang Yu, De-Chuan Zhan:
LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates. CoRR abs/2109.12508 (2021) - [i30]Qixin Zhang, Wenbing Ye, Zaiyi Chen, Haoyuan Hu, Enhong Chen, Yang Yu:
Online Allocation with Two-sided Resource Constraints. CoRR abs/2112.13964 (2021) - 2020
- [j17]Yi-Qi Hu
, Yang Yu:
A technical view on neural architecture search. Int. J. Mach. Learn. Cybern. 11(4): 795-811 (2020) - [j16]Chao Bian, Chao Qian, Ke Tang, Yang Yu:
Running time analysis of the (1+1)-EA for robust linear optimization. Theor. Comput. Sci. 843: 57-72 (2020) - [c70]Chao Bian, Chao Feng, Chao Qian, Yang Yu:
An Efficient Evolutionary Algorithm for Subset Selection with General Cost Constraints. AAAI 2020: 3267-3274 - [c69]Meng Wang, Yingfeng Chen, Tangjie Lv, Yan Song, Kai Guan, Changjie Fan, Yang Yu:
Reinforcement Learning with Action-Specific Focuses in Video Games. CoG 2020: 9-16 - [c68]Yi-Qi Hu, Zelin Liu, Hua Yang, Yang Yu, Yunfeng Liu:
Derivative-Free Optimization with Adaptive Experience for Efficient Hyper-Parameter Tuning. ECAI 2020: 1207-1214 - [c67]Shengyi Jiang, Jing-Cheng Pang, Yang Yu:
Offline Imitation Learning with a Misspecified Simulator. NeurIPS 2020 - [c66]Tian Xu, Ziniu Li, Yang Yu:
Error Bounds of Imitating Policies and Environments. NeurIPS 2020 - [e1]Matthew E. Taylor, Yang Yu, Edith Elkind, Yang Gao:
Distributed Artificial Intelligence - Second International Conference, DAI 2020, Nanjing, China, October 24-27, 2020, Proceedings. Lecture Notes in Computer Science 12547, Springer 2020, ISBN 978-3-030-64095-8 [contents] - [i29]Wen-Ji Zhou, Yang Yu:
Temporal-adaptive Hierarchical Reinforcement Learning. CoRR abs/2002.02080 (2020) - [i28]Chao Wang, Ruo-Ze Liu, Han-Jia Ye, Yang Yu:
Novelty-Prepared Few-Shot Classification. CoRR abs/2003.00497 (2020) - [i27]Guangda Huzhang, Zhen-Jia Pang, Yongqing Gao, Wen-Ji Zhou, Qing Da, Anxiang Zeng, Yang Yu:
Validation Set Evaluation can be Wrong: An Evaluator-Generator Approach for Maximizing Online Performance of Ranking in E-commerce. CoRR abs/2003.11941 (2020) - [i26]Jianhao Wang, Zhizhou Ren, Terry Liu, Yang Yu, Chongjie Zhang:
QPLEX: Duplex Dueling Multi-Agent Q-Learning. CoRR abs/2008.01062 (2020) - [i25]Tian Xu, Ziniu Li, Yang Yu:
Error Bounds of Imitating Policies and Environments. CoRR abs/2010.11876 (2020)
2010 – 2019
- 2019
- [b1]Zhi-Hua Zhou, Yang Yu, Chao Qian:
Evolutionary Learning: Advances in Theories and Algorithms. Springer 2019, ISBN 978-981-13-5955-2, pp. 3-293 - [j15]Chao Qian, Yang Yu, Ke Tang, Xin Yao, Zhi-Hua Zhou:
Maximizing submodular or monotone approximately submodular functions by multi-objective evolutionary algorithms. Artif. Intell. 275: 279-294 (2019) - [c65]Yi-Qi Hu, Yang Yu, Wei-Wei Tu, Qiang Yang, Yuqiang Chen, Wenyuan Dai:
Multi-Fidelity Automatic Hyper-Parameter Tuning via Transfer Series Expansion. AAAI 2019: 3846-3853 - [c64]Zhen-Jia Pang, Ruo-Ze Liu, Zhou-Yu Meng, Yi Zhang, Yang Yu, Tong Lu:
On Reinforcement Learning for Full-Length Game of StarCraft. AAAI 2019: 4691-4698 - [c63]Jing-Cheng Shi, Yang Yu, Qing Da, Shi-Yong Chen, Anxiang Zeng:
Virtual-Taobao: Virtualizing Real-World Online Retail Environment for Reinforcement Learning. AAAI 2019: 4902-4909 - [c62]Xiong-Hui Chen, Yang Yu:
Reinforcement Learning with Derivative-Free Exploration. AAMAS 2019: 1880-1882 - [c61]Yu-Ren Liu, Yi-Qi Hu, Hong Qian, Yang Yu:
Asynchronous classification-based optimization. DAI 2019: 9:1-9:8 - [c60]Songyi Gao, Weijie Shen, Zelin Liu, An Zhu, Yang Yu:
Only Image Cosine Embedding for Few-Shot Learning. ICONIP (2) 2019: 83-94 - [c59]Yi-Qi Hu, Yang Yu, Jun-Da Liao:
Cascaded Algorithm-Selection and Hyper-Parameter Optimization with Extreme-Region Upper Confidence Bound Bandit. IJCAI 2019: 2528-2534 - [c58]Wen-Ji Zhou, Yang Yu, Yingfeng Chen, Kai Guan, Tangjie Lv, Changjie Fan, Zhi-Hua Zhou:
Reinforcement Learning Experience Reuse with Policy Residual Representation. IJCAI 2019: 4447-4453 - [c57]Wenjie Shang, Yang Yu, Qingyang Li, Zhiwei (Tony) Qin, Yiping Meng, Jieping Ye:
Environment Reconstruction with Hidden Confounders for Reinforcement Learning based Recommendation. KDD 2019: 566-576 - [c56]Wang-Zhou Dai, Qiu-Ling Xu, Yang Yu, Zhi-Hua Zhou:
Bridging Machine Learning and Logical Reasoning by Abductive Learning. NeurIPS 2019: 2811-2822 - [i24]Ruo-Ze Liu, Haifeng Guo, Xiaozhong Ji, Yang Yu, Zitai Xiao, Yuzhou Wu, Zhen-Jia Pang, Tong Lu:
Efficient Reinforcement Learning with a Mind-Game for Full-Length StarCraft II. CoRR abs/1903.00715 (2019) - [i23]Yi-Qi Hu, Yang Yu, Jun-Da Liao:
Cascaded Algorithm-Selection and Hyper-Parameter Optimization with Extreme-Region Upper Confidence Bound Bandit. CoRR abs/1905.13703 (2019) - [i22]Wen-Ji Zhou, Yang Yu, Yingfeng Chen, Kai Guan, Tangjie Lv, Changjie Fan, Zhi-Hua Zhou:
Reinforcement Learning Experience Reuse with Policy Residual Representation. CoRR abs/1905.13719 (2019) - [i21]Wenjie Shang, Yang Yu, Qingyang Li, Zhiwei (Tony) Qin, Yiping Meng, Jieping Ye:
Environment Reconstruction with Hidden Confounders for Reinforcement Learning based Recommendation. CoRR abs/1907.06584 (2019) - [i20]Jorge G. Madrid, Hugo Jair Escalante, Eduardo F. Morales, Wei-Wei Tu, Yang Yu, Lisheng Sun-Hosoya, Isabelle Guyon, Michèle Sebag:
Towards AutoML in the presence of Drift: first results. CoRR abs/1907.10772 (2019) - [i19]Chao Bian, Chao Qian, Yang Yu:
On the Robustness of Median Sampling in Noisy Evolutionary Optimization. CoRR abs/1907.13100 (2019) - [i18]Tian Xu, Ziniu Li, Yang Yu:
On Value Discrepancy of Imitation Learning. CoRR abs/1911.07027 (2019) - [i17]Rong-Jun Qin, Jing-Cheng Pang, Yang Yu:
Improving Fictitious Play Reinforcement Learning with Expanding Models. CoRR abs/1911.11928 (2019) - 2018
- [j14]Chao Qian, Yang Yu, Zhi-Hua Zhou:
Analyzing Evolutionary Optimization in Noisy Environments. Evol. Comput. 26(1) (2018) - [j13]Chao Qian, Yang Yu, Ke Tang, Yaochu Jin
, Xin Yao
, Zhi-Hua Zhou:
On the Effectiveness of Sampling for Evolutionary Optimization in Noisy Environments. Evol. Comput. 26(2) (2018) - [j12]Yang Yu
, Shi-Yong Chen
, Qing Da, Zhi-Hua Zhou
:
Reusable Reinforcement Learning via Shallow Trails. IEEE Trans. Neural Networks Learn. Syst. 29(6): 2204-2215 (2018) - [c55]Hong Wang, Hong Qian, Yang Yu:
Noisy Derivative-Free Optimization With Value Suppression. AAAI 2018: 1447-1454 - [c54]