default search action
Jinyi Liu 0002
Person information
- affiliation: Tianjin University, College of Intelligence and Computing, China
Other persons with the same name
- Jinyi Liu — disambiguation page
- Jinyi Liu 0001 — Tianjin University, College of Intelligence and Computing, China (and 2 more)
- Jinyi Liu 0003 — Chinese Academy of Sciences, Institute of Semiconductors, Beijing, China
- Jinyi Liu 0004 — China Agricultural University, Beijing, China
- Jinyi Liu 0005 — Loughborough University, UK
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j1]Jianye Hao, Tianpei Yang, Hongyao Tang, Chenjia Bai, Jinyi Liu, Zhaopeng Meng, Peng Liu, Zhen Wang:
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain. IEEE Trans. Neural Networks Learn. Syst. 35(7): 8762-8782 (2024) - [c10]Jinyi Liu, Zhi Wang, Yan Zheng, Jianye Hao, Chenjia Bai, Junjie Ye, Zhen Wang, Haiyin Piao, Yang Sun:
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments. AAAI 2024: 13954-13962 - [c9]Jinyi Liu, Yi Ma, Jianye Hao, Yujing Hu, Yan Zheng, Tangjie Lv, Changjie Fan:
A Trajectory Perspective on the Role of Data Sampling Techniques in Offline Reinforcement Learning. AAMAS 2024: 1229-1237 - [c8]Kai Zhao, Jianye Hao, Yi Ma, Jinyi Liu, Yan Zheng, Zhaopeng Meng:
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles. AAMAS 2024: 2609-2611 - [c7]Yiwen Zhu, Jinyi Liu, Wenya Wei, Qianyi Fu, Yujing Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan:
vMFER: von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement of Actor-Critic Algorithms. AAMAS 2024: 2621-2623 - [c6]Yifu Yuan, Jianye Hao, Yi Ma, Zibin Dong, Hebin Liang, Jinyi Liu, Zhixin Feng, Kai Zhao, Yan Zheng:
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback. ICLR 2024 - [c5]Longxin Kou, Fei Ni, Yan Zheng, Jinyi Liu, Yifu Yuan, Zibin Dong, Jianye Hao:
KISA: A Unified Keyframe Identifier and Skill Annotator for Long-Horizon Robotics Demonstrations. ICML 2024 - [c4]Kai Zhao, Jianye Hao, Yi Ma, Jinyi Liu, Yan Zheng, Zhaopeng Meng:
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles. IJCAI 2024: 5563-5571 - [c3]Yiwen Zhu, Jinyi Liu, Wenya Wei, Qianyi Fu, Yujing Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan:
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement. IJCAI 2024: 5725-5733 - [i12]Yifu Yuan, Jianye Hao, Yi Ma, Zibin Dong, Hebin Liang, Jinyi Liu, Zhixin Feng, Kai Zhao, Yan Zheng:
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback. CoRR abs/2402.02423 (2024) - [i11]Jinyi Liu, Yifu Yuan, Jianye Hao, Fei Ni, Lingzhi Fu, Yibin Chen, Yan Zheng:
Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models. CoRR abs/2402.14245 (2024) - [i10]Yibin Chen, Yifu Yuan, Zeyu Zhang, Yan Zheng, Jinyi Liu, Fei Ni, Jianye Hao:
SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models. CoRR abs/2403.03636 (2024) - [i9]Yiwen Zhu, Jinyi Liu, Wenya Wei, Qianyi Fu, Yujing Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan:
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement. CoRR abs/2405.08638 (2024) - [i8]Yihang Xiao, Jinyi Liu, Yan Zheng, Xiaohan Xie, Jianye Hao, Mingzhi Li, Ruitao Wang, Fei Ni, Yuxiao Li, Jintian Luo, Shaoqing Jiao, Jiajie Peng:
CellAgent: An LLM-driven Multi-Agent Framework for Automated Single-cell Data Analysis. CoRR abs/2407.09811 (2024) - 2023
- [c2]Yifu Yuan, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Jinyi Liu, Yingfeng Chen, Changjie Fan:
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model. ICLR 2023 - [i7]Shixi Lian, Yi Ma, Jinyi Liu, Yan Zheng, Zhaopeng Meng:
HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach. CoRR abs/2306.06329 (2023) - [i6]Kai Zhao, Yi Ma, Jinyi Liu, Yan Zheng, Zhaopeng Meng:
Ensemble-based Offline-to-Online Reinforcement Learning: From Pessimistic Learning to Optimistic Exploration. CoRR abs/2306.06871 (2023) - [i5]Jinyi Liu, Yi Ma, Jianye Hao, Yujing Hu, Yan Zheng, Tangjie Lv, Changjie Fan:
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning. CoRR abs/2306.15503 (2023) - [i4]Jinyi Liu, Zhi Wang, Yan Zheng, Jianye Hao, Chenjia Bai, Junjie Ye, Zhen Wang, Haiyin Piao, Yang Sun:
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments. CoRR abs/2312.12145 (2023) - 2022
- [i3]Yifu Yuan, Jianye Hao, Fei Ni, Yao Mu, Yan Zheng, Yujing Hu, Jinyi Liu, Yingfeng Chen, Changjie Fan:
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model. CoRR abs/2210.00498 (2022) - 2021
- [c1]Shaohua Zhang, Shuang Liu, Jun Sun, Yuqi Chen, Wenzhi Huang, Jinyi Liu, Jian Liu, Jianye Hao:
FIGCPS: Effective Failure-inducing Input Generation for Cyber-Physical Systems with Deep Reinforcement Learning. ASE 2021: 555-567 - [i2]Tianpei Yang, Hongyao Tang, Chenjia Bai, Jinyi Liu, Jianye Hao, Zhaopeng Meng, Peng Liu:
Exploration in Deep Reinforcement Learning: A Comprehensive Survey. CoRR abs/2109.06668 (2021) - [i1]Cong Wang, Tianpei Yang, Jianye Hao, Yan Zheng, Hongyao Tang, Fazl Barez, Jinyi Liu, Jiajie Peng, Haiyin Piao, Zhixiao Sun:
ED2: An Environment Dynamics Decomposition Framework for World Model Construction. CoRR abs/2112.02817 (2021)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-05 20:45 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint