


default search action
Hui Yuan 0002
Person information
- affiliation: Princeton University, Department of Electrical and Computer Engineering, NJ, USA
- affiliation (former): University of Science and Technology of China, Hefei, China
Other persons with the same name
- Hui Yuan — disambiguation page
- Hui Yuan 0001
— Shandong University, School of Control Science and Engineering, Jinan, China (and 2 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c10]Hui Yuan, Yifan Zeng, Yue Wu, Huazheng Wang, Mengdi Wang, Liu Leqi:
A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement. ICLR 2025
[i15]Kaixuan Huang, Jiacheng Guo, Zihao Li, Xiang Ji, Jiawei Ge, Wenzhe Li, Yingqing Guo, Tianle Cai, Hui Yuan, Runzhe Wang, Yue Wu, Ming Yin, Shange Tang, Yangsibo Huang, Chi Jin, Xinyun Chen, Chiyuan Zhang, Mengdi Wang:
MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations. CoRR abs/2502.06453 (2025)
[i14]Quan Xiao, Hui Yuan, A. F. M. Saif, Gaowen Liu, Ramana Kompella, Mengdi Wang, Tianyi Chen:
A First-order Generative Bilevel Optimization Framework for Diffusion Models. CoRR abs/2502.08808 (2025)
[i13]Yingqing Guo, Yukang Yang, Hui Yuan, Mengdi Wang:
Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models. CoRR abs/2502.11420 (2025)- 2024
[j1]Zichen Wang, Rishab Balasubramanian, Hui Yuan, Chenyu Song, Mengdi Wang, Huazheng Wang:
Adversarial Attacks on Online Learning to Rank with Stochastic Click Models. Trans. Mach. Learn. Res. 2024 (2024)
[c9]Jiahao Qiu, Hui Yuan, Jinghong Zhang, Wentao Chen, Huazheng Wang, Mengdi Wang:
Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization. AAAI 2024: 14686-14694
[c8]Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Dinesh Manocha, Furong Huang, Amrit S. Bedi, Mengdi Wang:
MaxMin-RLHF: Alignment with Diverse Human Preferences. ICML 2024
[c7]Shuhua Yang
, Hui Yuan
, Xiaoying Zhang
, Mengdi Wang
, Hong Zhang
, Huazheng Wang
:
Conversational Dueling Bandits in Generalized Linear Models. KDD 2024: 3806-3817
[c6]Yingqing Guo, Hui Yuan, Yukang Yang, Minshuo Chen, Mengdi Wang:
Gradient Guidance for Diffusion Models: An Optimization Perspective. NeurIPS 2024
[i12]Jiahao Qiu, Hui Yuan, Jinghong Zhang, Wentao Chen, Huazheng Wang, Mengdi Wang
:
Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization. CoRR abs/2401.06173 (2024)
[i11]Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Furong Huang, Dinesh Manocha, Amrit Singh Bedi, Mengdi Wang
:
MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences. CoRR abs/2402.08925 (2024)
[i10]Zihao Li, Hui Yuan, Kaixuan Huang, Chengzhuo Ni, Yinyu Ye, Minshuo Chen, Mengdi Wang:
Diffusion Model for Data-Driven Black-Box Optimization. CoRR abs/2403.13219 (2024)
[i9]Yingqing Guo, Hui Yuan, Yukang Yang, Minshuo Chen, Mengdi Wang:
Gradient Guidance for Diffusion Models: An Optimization Perspective. CoRR abs/2404.14743 (2024)
[i8]Shuhua Yang, Hui Yuan, Xiaoying Zhang, Mengdi Wang, Hong Zhang, Huazheng Wang:
Conversational Dueling Bandits in Generalized Linear Models. CoRR abs/2407.18488 (2024)
[i7]Hui Yuan, Yifan Zeng, Yue Wu, Huazheng Wang, Mengdi Wang, Liu Leqi:
A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement. CoRR abs/2410.13828 (2024)- 2023
[c5]Hui Yuan, Kaixuan Huang, Chengzhuo Ni, Minshuo Chen, Mengdi Wang:
Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement. NeurIPS 2023
[c4]Zeyu Zhang, Yi Su, Hui Yuan, Yiran Wu, Rishab Balasubramanian, Qingyun Wu, Huazheng Wang, Mengdi Wang:
Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective. NeurIPS 2023
[i6]Zichen Wang, Rishab Balasubramanian, Hui Yuan, Chenyu Song, Mengdi Wang, Huazheng Wang:
Adversarial Attacks on Online Learning to Rank with Stochastic Click Models. CoRR abs/2305.19218 (2023)
[i5]Zeyu Zhang, Yi Su, Hui Yuan, Yiran Wu, Rishab Balasubramanian, Qingyun Wu, Huazheng Wang, Mengdi Wang
:
Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective. CoRR abs/2306.07528 (2023)
[i4]Hui Yuan, Kaixuan Huang, Chengzhuo Ni, Minshuo Chen, Mengdi Wang:
Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement. CoRR abs/2307.07055 (2023)- 2022
[c3]Hui Yuan, Chengzhuo Ni, Huazheng Wang, Xuezhou Zhang, Le Cong, Csaba Szepesvári, Mengdi Wang:
Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization. NeurIPS 2022
[i3]Hui Yuan, Chengzhuo Ni, Huazheng Wang, Xuezhou Zhang, Le Cong
, Csaba Szepesvári, Mengdi Wang
:
Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization. CoRR abs/2206.02092 (2022)- 2020
[c2]Hui Yuan, Yingyu Liang:
Learning Entangled Single-Sample Distributions via Iterative Trimming. AISTATS 2020: 2666-2676
[c1]Yingyu Liang, Hui Yuan:
Learning Entangled Single-Sample Gaussians in the Subset-of-Signals Model. COLT 2020: 2712-2737
[i2]Hui Yuan, Yingyu Liang:
Learning Entangled Single-Sample Distributions via Iterative Trimming. CoRR abs/2004.09563 (2020)
[i1]Yingyu Liang, Hui Yuan:
Learning Entangled Single-Sample Gaussians in the Subset-of-Signals Model. CoRR abs/2007.05557 (2020)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-11-13 22:05 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







