default search action

combined dblp search
author search
venue search
publication search

ask others

Yangchen Pan

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jair/PanWXT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/PanWXT25
Yangchen Pan, Junfeng Wen, Chenjun Xiao, Philip H. S. Torr:
An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models. J. Artif. Intell. Res. 83 (2025)
[c21]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/MaPF25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MaPF25
Avery Ma, Yangchen Pan, Amir-massoud Farahmand:
PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling. ICML 2025
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-01925
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-01925
Avery Ma, Yangchen Pan, Amir-massoud Farahmand:
PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling. CoRR abs/2502.01925 (2025)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-11412
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-11412
Yudong Luo, Yangchen Pan, Jiaqi Tan, Pascal Poupart:
Measures of Variability for Risk-averse Policy Gradient. CoRR abs/2504.11412 (2025)
2024
[j4]
- view
  - electronic edition @ umass.edu (open access)
  - details & citations
- export record
  dblp key:
  - conf/rlc/LuoPW0P24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rlc/LuoPW0P24
Yudong Luo, Yangchen Pan, Han Wang, Philip Torr, Pascal Poupart:
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization. RLJ 2: 573-592 (2024)
[j3]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/ImaniZL0PTP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/ImaniZL0PTP24
Ehsan Imani, Guojun Zhang, Runjia Li, Jun Luo, Pascal Poupart, Philip H. S. Torr, Yangchen Pan:
Label Alignment Regularization for Distribution Shift. J. Mach. Learn. Res. 25: 247:1-247:32 (2024)
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/MaFPTG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/MaFPTG24
Avery Ma, Amir-massoud Farahmand, Yangchen Pan, Philip Torr, Jindong Gu:
Improving Adversarial Transferability via Model Alignment. ECCV (62) 2024: 74-92
[c19]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LuoPW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LuoPW024
Zhiyao Luo, Yangchen Pan, Peter J. Watkinson, Tingting Zhu:
Position: Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination. ICML 2024
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-11062
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-11062
Yudong Luo, Yangchen Pan, Han Wang, Philip Torr, Pascal Poupart:
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization. CoRR abs/2403.11062 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-15518
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-15518
Yangchen Pan, Junfeng Wen, Chenjun Xiao, Philip Torr:
An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models. CoRR abs/2404.15518 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-18556
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-18556
Zhiyao Luo, Yangchen Pan, Peter J. Watkinson, Tingting Zhu:
Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination. CoRR abs/2405.18556 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-18610
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-18610
Zhiyao Luo, Mingcheng Zhu, Fenglin Liu, Jiali Li, Yangchen Pan, Jiandong Zhou, Tingting Zhu:
DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime. CoRR abs/2405.18610 (2024)
2023
[j2]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/LanPLM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/LanPLM23
Qingfeng Lan, Yangchen Pan, Jun Luo, A. Rupam Mahmood:
Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation. Trans. Mach. Learn. Res. 2023 (2023)
[j1]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/MaPF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/MaPF23
Avery Ma, Yangchen Pan, Amir-massoud Farahmand:
Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods. Trans. Mach. Learn. Res. 2023 (2023)
[c18]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/NeumannLJP0W23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/NeumannLJP0W23
Samuel Neumann, Sungsu Lim, Ajin George Joseph, Yangchen Pan, Adam White, Martha White:
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement. ICLR 2023
[c17]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/XiaoWP0W23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/XiaoWP0W23
Chenjun Xiao, Han Wang, Yangchen Pan, Adam White, Martha White:
The In-Sample Softmax for Offline Reinforcement Learning. ICLR 2023
[c16]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LuoLPP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LuoLPP23
Yudong Luo, Guiliang Liu, Pascal Poupart, Yangchen Pan:
An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient. NeurIPS 2023
[c15]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/ZhaoPXCR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/ZhaoPXCR23
Xutong Zhao, Yangchen Pan, Chenjun Xiao, Sarath Chandar, Janarthanan Rajendran:
Conditionally optimistic exploration for cooperative deep multi-agent reinforcement learning. UAI 2023: 2529-2540
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-14372
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-14372
Chenjun Xiao, Han Wang, Yangchen Pan, Adam White, Martha White:
The In-Sample Softmax for Offline Reinforcement Learning. CoRR abs/2302.14372 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-09032
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-09032
Xutong Zhao, Yangchen Pan, Chenjun Xiao, Sarath Chandar, Janarthanan Rajendran:
Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning. CoRR abs/2303.09032 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-08873
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-08873
Yudong Luo, Guiliang Liu, Pascal Poupart, Yangchen Pan:
An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient. CoRR abs/2307.08873 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-06703
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-06703
Avery Ma, Yangchen Pan, Amir-massoud Farahmand:
Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods. CoRR abs/2308.06703 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-18495
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-18495
Avery Ma, Amir-massoud Farahmand, Yangchen Pan, Philip H. S. Torr, Jindong Gu:
Improving Adversarial Transferability via Model Alignment. CoRR abs/2311.18495 (2023)
2022
[c14]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/0006TPWM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/0006TPWM22
Shivam Garg, Samuele Tosatto, Yangchen Pan, Martha White, Rupam Mahmood:
An Alternate Policy Gradient Estimator for Softmax Policies. AISTATS 2022: 6630-6689
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/XuLPJL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/XuLPJL22
Liangliang Xu, Daoming Lyu, Yangchen Pan, Aiwen Jiang, Bo Liu:
TOPS: Transition-Based Volatility-Reduced Policy Search. AAMAS Workshops 2022: 3-47
[c12]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/PanMFWYR022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/PanMFWYR022
Yangchen Pan, Jincheng Mei, Amir-massoud Farahmand, Martha White, Hengshuai Yao, Mohsen Rohani, Jun Luo:
Understanding and mitigating the limitations of prioritized experience replay. UAI 2022: 1561-1571
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-10868
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-10868
Qingfeng Lan, Yangchen Pan, Jun Luo, A. Rupam Mahmood:
Memory-efficient Reinforcement Learning with Knowledge Consolidation. CoRR abs/2205.10868 (2022)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-14960
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-14960
Ehsan Imani, Guojun Zhang, Jun Luo, Pascal Poupart, Yangchen Pan:
Label Alignment Regularization for Distribution Shift. CoRR abs/2211.14960 (2022)
2021
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/PanBW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/PanBW21
Yangchen Pan, Kirby Banman, Martha White:
Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online. ICLR 2021
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-11622
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-11622
Shivam Garg, Samuele Tosatto, Yangchen Pan, Martha White, A. Rupam Mahmood:
An Alternate Policy Gradient Estimator for Softmax Policies. CoRR abs/2112.11622 (2021)
2020
[c10]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LanPFW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LanPFW20
Qingfeng Lan, Yangchen Pan, Alona Fyshe, Martha White:
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning. ICLR 2020
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/PanMF20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/PanMF20
Yangchen Pan, Jincheng Mei, Amir-massoud Farahmand:
Frequency-based Search-control in Dyna. ICLR 2020
[c8]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/PanIFW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PanIFW20
Yangchen Pan, Ehsan Imani, Amir-massoud Farahmand, Martha White:
An implicit function learning approach for parametric modal regression. NeurIPS 2020
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-05822
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-05822
Yangchen Pan, Jincheng Mei, Amir-massoud Farahmand:
Frequency-based Search-control in Dyna. CoRR abs/2002.05822 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-06195
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-06195
Yangchen Pan, Ehsan Imani, Martha White, Amir-massoud Farahmand:
An implicit function learning approach for parametric modal regression. CoRR abs/2002.06195 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-06487
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-06487
Qingfeng Lan, Yangchen Pan, Alona Fyshe, Martha White:
Maxmin Q-learning: Controlling the Estimation Bias of Q-learning. CoRR abs/2002.06487 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-09569
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-09569
Jincheng Mei, Yangchen Pan, Martha White, Amir-massoud Farahmand, Hengshuai Yao:
Beyond Prioritized Replay: Sampling States in Model-Based RL via Simulated Priorities. CoRR abs/2007.09569 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/PanYFW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/PanYFW19
Yangchen Pan, Hengshuai Yao, Amir-massoud Farahmand, Martha White:
Hill Climbing on Value Estimates for Search-control in Dyna. IJCAI 2019: 3209-3215
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-07791
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-07791
Yangchen Pan, Hengshuai Yao, Amir-massoud Farahmand, Martha White:
Hill Climbing on Value Estimates for Search-control in Dyna. CoRR abs/1906.07791 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-08068
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-08068
Yangchen Pan:
Deep Tile Coder: an Efficient Sparse Representation Learning Approach with applications in Reinforcement Learning. CoRR abs/1911.08068 (2019)
2018
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/PanFWNGN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/PanFWNGN18
Yangchen Pan, Amir-massoud Farahmand, Martha White, Saleh Nabi, Piyush Grover, Daniel Nikovski:
Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control. ICML 2018: 3983-3992
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/PanZWPW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/PanZWPW18
Yangchen Pan, Muhammad Zaheer, Adam White, Andrew Patterson, Martha White:
Organizing Experience: a Deeper Look at Replay Mechanisms for Sample-Based Planning in Continuous State Domains. IJCAI 2018: 4794-4800
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-04624
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-04624
Yangchen Pan, Muhammad Zaheer, Adam White, Andrew Patterson, Martha White:
Organizing Experience: A Deeper Look at Replay Mechanisms for Sample-based Planning in Continuous State Domains. CoRR abs/1806.04624 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-06931
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-06931
Yangchen Pan, Amir-massoud Farahmand, Martha White, Saleh Nabi, Piyush Grover, Daniel Nikovski:
Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control. CoRR abs/1806.06931 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-09103
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-09103
Sungsu Lim, Ajin Joseph, Lei Le, Yangchen Pan, Martha White:
Actor-Expert: A Framework for using Action-Value Methods in Continuous Action Spaces. CoRR abs/1810.09103 (2018)
2017
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/PanWW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/PanWW17
Yangchen Pan, Adam White, Martha White:
Accelerated Gradient Temporal Difference Learning. AAAI 2017: 2464-2470
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SchlegelPCW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SchlegelPCW17
Matthew Schlegel, Yangchen Pan, Jiecao Chen, Martha White:
Adapting Kernel Representations Online Using Submodular Maximization. ICML 2017: 3037-3046
[c2]
- view
  - electronic edition @ auai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/PanAW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/PanAW17
Yangchen Pan, Erfan Sadeqi Azer, Martha White:
Effective sketching methods for value function approximation. UAI 2017
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1708-01298
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1708-01298
Yangchen Pan, Erfan Sadeqi Azer, Martha White:
Effective sketching methods for value function approximation. CoRR abs/1708.01298 (2017)
2016
[c1]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/GehringPW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/GehringPW16
Clement Gehring, Yangchen Pan, Martha White:
Incremental Truncated LSTD. IJCAI 2016: 1505-1511
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/PanWW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/PanWW16
Yangchen Pan, Adam White, Martha White:
Accelerated Gradient Temporal Difference Learning. CoRR abs/1611.09328 (2016)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.