Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

Search dblp

> Home

Author search results

Likely matches

Alekh Agarwal

Publication search results

found 176 matches

2024
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/alt/AbernethyAMW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/alt/AbernethyAMW24
Jacob D. Abernethy, Alekh Agarwal, Teodor Vanislavov Marinov, Manfred K. Warmuth:
A Mechanism for Sample-Efficient In-Context Learning for Sparse Retrieval Tasks. ALT 2024: 3-46
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-01879
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-01879
Ahmad Beirami, Alekh Agarwal, Jonathan Berant, Alexander D'Amour, Jacob Eisenstein, Chirag Nagpal, Ananda Theertha Suresh:
Theoretical guarantees on the best-of-n alignment policy. CoRR abs/2401.01879 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-04056
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-04056
Gokul Swamy, Christoph Dann, Rahul Kidambi, Zhiwei Steven Wu, Alekh Agarwal:
A Minimaximalist Approach to Reinforcement Learning from Human Feedback. CoRR abs/2401.04056 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-07198
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-07198
Kaiwen Wang, Owen Oertell, Alekh Agarwal, Nathan Kallus, Wen Sun:
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning. CoRR abs/2402.07198 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-17235
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-17235
Jincheng Mei, Zixin Zhong, Bo Dai, Alekh Agarwal, Csaba Szepesvári, Dale Schuurmans:
Stochastic Gradient Succeeds for Bandits. CoRR abs/2402.17235 (2024)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-19462
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-19462
Teodor V. Marinov, Alekh Agarwal, Mircea Trofin:
Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization. CoRR abs/2403.19462 (2024)
2023
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/colt/Agarwal00WWZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/Agarwal00WWZ23
Alekh Agarwal, Yuda Song, Wen Sun, Kaiwen Wang, Mengdi Wang, Xuezhou Zhang:
Provable Benefits of Representational Transfer in Reinforcement Learning. COLT 2023: 2114-2187
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/colt/AgarwalJ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/AgarwalJ023
Alekh Agarwal, Yujia Jin, Tong Zhang:
VOQL: Towards Optimal Regret in Model-free RL with Nonlinear Function Approximation. COLT 2023: 987-1063
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/0002AD023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0002AD023
Jonathan Lee, Alekh Agarwal, Christoph Dann, Tong Zhang:
Learning in POMDPs is Sample-Efficient with Hindsight Observability. ICML 2023: 18733-18773
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/MeiZ0ASS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MeiZ0ASS23
Jincheng Mei, Zixin Zhong, Bo Dai, Alekh Agarwal, Csaba Szepesvári, Dale Schuurmans:
Stochastic Gradient Succeeds for Bandits. ICML 2023: 24325-24360
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Mei0AGSS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Mei0AGSS23
Jincheng Mei, Bo Dai, Alekh Agarwal, Mohammad Ghavamzadeh, Csaba Szepesvári, Dale Schuurmans:
Ordering-based Conditions for Global Convergence of Policy Gradient Methods. NeurIPS 2023
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-13857
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-13857
Jonathan N. Lee, Alekh Agarwal, Christoph Dann, Tong Zhang:
Learning in POMDPs is Sample-Efficient with Hindsight Observability. CoRR abs/2301.13857 (2023)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-03784
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-03784
Alekh Agarwal, Claudio Gentile, Teodor V. Marinov:
Leveraging User-Triggered Supervision in Contextual Bandits. CoRR abs/2302.03784 (2023)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-10218
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-10218
Alekh Agarwal, H. Brendan McMahan, Zheng Xu:
An Empirical Evaluation of Federated Contextual Bandit Algorithms. CoRR abs/2303.10218 (2023)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-17040
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-17040
Jacob D. Abernethy, Alekh Agarwal, Teodor V. Marinov, Manfred K. Warmuth:
A Mechanism for Sample-Efficient In-Context Learning for Sparse Retrieval Tasks. CoRR abs/2305.17040 (2023)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-09497
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-09497
Alexander Goldberg, Ivan Stelmakh, Kyunghyun Cho, Alice H. Oh, Alekh Agarwal, Danielle Belgrave, Nihar B. Shah:
Peer Reviews of Peer Reviews: A Randomized Controlled Trial and Other Experiments. CoRR abs/2311.09497 (2023)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-09612
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-09612
Wang Zhu, Alekh Agarwal, Mandar Joshi, Robin Jia, Jesse Thomason, Kristina Toutanova:
Efficient End-to-End Visual Document Understanding with Rationale Distillation. CoRR abs/2311.09612 (2023)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-09244
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-09244
Jacob Eisenstein, Chirag Nagpal, Alekh Agarwal, Ahmad Beirami, Alex D'Amour, Dj Dvijotham, Adam Fisch, Katherine A. Heller, Stephen Pfohl, Deepak Ramachandran, Peter Shaw, Jonathan Berant:
Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking. CoRR abs/2312.09244 (2023)
2022
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/colt/AgarwalZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/AgarwalZ22
Alekh Agarwal, Tong Zhang:
Minimax Regret Optimization for Robust Machine Learning under Distribution Shift. COLT 2022: 2704-2729
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/colt/AgarwalZ22a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/AgarwalZ22a
Alekh Agarwal, Tong Zhang:
Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling. COLT 2022: 2776-2814
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/EfroniMKA022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/EfroniMKA022
Yonathan Efroni, Dipendra Misra, Akshay Krishnamurthy, Alekh Agarwal, John Langford:
Provably Filtering Exogenous Distractors using Multistep Inverse Dynamics. ICLR 2022
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ChengX0A22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChengX0A22
Ching-An Cheng, Tengyang Xie, Nan Jiang, Alekh Agarwal:
Adversarially Trained Actor Critic for Offline Reinforcement Learning. ICML 2022: 3852-3878
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ZhangSUWAS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhangSUWAS22
Xuezhou Zhang, Yuda Song, Masatoshi Uehara, Mengdi Wang, Alekh Agarwal, Wen Sun:
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning approach. ICML 2022: 26517-26547
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Agarwal022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Agarwal022
Alekh Agarwal, Tong Zhang:
Model-based RL with Optimistic Posterior Sampling: Structural Conditions and Sample Complexity. NeurIPS 2022
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Chen0K0A22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Chen0K0A22
Jinglin Chen, Aditya Modi, Akshay Krishnamurthy, Nan Jiang, Alekh Agarwal:
On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL. NeurIPS 2022
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-00063
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-00063
Xuezhou Zhang, Yuda Song, Masatoshi Uehara, Mengdi Wang, Alekh Agarwal, Wen Sun:
Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach. CoRR abs/2202.00063 (2022)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-02446
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-02446
Ching-An Cheng, Tengyang Xie, Nan Jiang, Alekh Agarwal:
Adversarially Trained Actor Critic for Offline Reinforcement Learning. CoRR abs/2202.02446 (2022)
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-05436
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-05436
Alekh Agarwal, Tong Zhang:
Minimax Regret Optimization for Robust Machine Learning under Distribution Shift. CoRR abs/2202.05436 (2022)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-08248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-08248
Alekh Agarwal, Tong Zhang:
Non-Linear Reinforcement Learning in Large Action Spaces: Structural Conditions and Sample-efficiency of Posterior Sampling. CoRR abs/2203.08248 (2022)
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-14571
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-14571
Alekh Agarwal, Yuda Song, Wen Sun, Kaiwen Wang, Mengdi Wang, Xuezhou Zhang:
Provable Benefits of Representational Transfer in Reinforcement Learning. CoRR abs/2205.14571 (2022)

skipping 146 more matches

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.

Search dblp

Full-text search

Please enter a search query

Author search results

Venue search results

Refine list

Publication search results