default search action

combined dblp search
author search
venue search
publication search

ask others

Hiteshi Sharma

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j2]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/ZhangYS0LYWA025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/ZhangYS0LYWA025
Shenao Zhang, Donghan Yu, Hiteshi Sharma, Han Zhong, Zhihan Liu, Ziyi Yang, Shuohang Wang, Hany Hassan Awadalla, Zhaoran Wang:
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment. Trans. Mach. Learn. Res. 2025 (2025)
2024
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/HuangSM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/HuangSM24
Baihe Huang, Hiteshi Sharma, Yi Mao:
Enhancing Language Model Alignment: A Confidence-Based Approach to Label Smoothing. EMNLP 2024: 21341-21352
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/FengXHSSZC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/FengXHSSZC24
Jiazhan Feng, Ruochen Xu, Junheng Hao, Hiteshi Sharma, Yelong Shen, Dongyan Zhao, Weizhu Chen:
Language Models can be Deductive Solvers. NAACL-HLT (Findings) 2024: 4026-4042
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-19332
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-19332
Shenao Zhang, Donghan Yu, Hiteshi Sharma, Ziyi Yang, Shuohang Wang, Hany Hassan, Zhaoran Wang:
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment. CoRR abs/2405.19332 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-02119
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-02119
Yifang Chen, Shuohang Wang, Ziyi Yang, Hiteshi Sharma, Nikos Karampatziakis, Donghan Yu, Kevin Jamieson, Simon Shaolei Du, Yelong Shen:
Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning. CoRR abs/2407.02119 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-13833
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-13833
Emman Haider, Daniel Perez-Becker, Thomas Portet, Piyush Madan, Amit Garg, David Majercak, Wen Wen, Dongwoo Kim, Ziyi Yang, Jianwen Zhang, Hiteshi Sharma, Blake Bullwinkel, Martin Pouliot, Amanda J. Minnich, Shiven Chawla, Solianna Herrera, Shahed Warreth, Maggie Engler, Gary Lopez, Nina Chikanov, Raja Sekhar Rao Dheekonda, Bolor-Erdene Jagdagdorj, Roman Lutz, Richard Lundeen, Tori Westerhoff, Pete Bryan, Christian Seifert, Ram Shankar Siva Kumar, Andrew Berkley, Alex Kessler:
Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle. CoRR abs/2407.13833 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-24096
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-24096
Nabil Omi, Hosein Hasanbeig, Hiteshi Sharma, Sriram K. Rajamani, Siddhartha Sen:
Progressive Safeguards for Safe and Model-Agnostic Reinforcement Learning. CoRR abs/2410.24096 (2024)
2023
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/MomennejadHFSJP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MomennejadHFSJP23
Ida Momennejad, Hosein Hasanbeig, Felipe Vieira Frujeri, Hiteshi Sharma, Nebojsa Jojic, Hamid Palangi, Robert Osazuwa Ness, Jonathan Larson:
Evaluating Cognitive Maps and Planning in Large Language Models with CogEval. NeurIPS 2023
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-02231
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-02231
Banghua Zhu, Hiteshi Sharma, Felipe Vieira Frujeri, Shi Dong, Chenguang Zhu, Michael I. Jordan, Jiantao Jiao:
Fine-Tuning Language Models with Advantage-Induced Policy Alignment. CoRR abs/2306.02231 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13701
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13701
Hosein Hasanbeig, Hiteshi Sharma, Leo Betthauser, Felipe Vieira Frujeri, Ida Momennejad:
ALLURE: Auditing and Improving LLM-based Evaluation of Text using Iterative In-Context-Learning. CoRR abs/2309.13701 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-15129
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-15129
Ida Momennejad, Hosein Hasanbeig, Felipe Vieira Frujeri, Hiteshi Sharma, Robert Osazuwa Ness, Nebojsa Jojic, Hamid Palangi, Jonathan Larson:
Evaluating Cognitive Maps and Planning in Large Language Models with CogEval. CoRR abs/2309.15129 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-06158
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-06158
Jiazhan Feng, Ruochen Xu, Junheng Hao, Hiteshi Sharma, Yelong Shen, Dongyan Zhao, Weizhu Chen:
Language Models can be Logical Solvers. CoRR abs/2311.06158 (2023)
2020
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tac/HaskellJSY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tac/HaskellJSY20
William B. Haskell, Rahul Jain, Hiteshi Sharma, Pengqian Yu:
A Universal Empirical Dynamic Programming Algorithm for Continuous State MDPs. IEEE Trans. Autom. Control. 65(1): 115-129 (2020)
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/cdc/Sharma020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cdc/Sharma020
Hiteshi Sharma, Rahul Jain:
Finite Time Guarantees for Continuous State MDPs with Generative Model. CDC 2020: 3617-3622
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WeiJLS020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WeiJLS020
Chen-Yu Wei, Mehdi Jafarnia-Jahromi, Haipeng Luo, Hiteshi Sharma, Rahul Jain:
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes. ICML 2020: 10170-10180
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-04331
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-04331
Hiteshi Sharma, Rahul Jain:
Randomized Policy Learning for Continuous State and Action MDPs. CoRR abs/2006.04331 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/allerton/Sharma019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/allerton/Sharma019
Hiteshi Sharma, Rahul Jain:
An Approximately Optimal Relative Value Learning Algorithm for Averaged MDPs with Continuous States and Actions. Allerton 2019: 734-740
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/cdc/Sharma0H19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cdc/Sharma0H19
Hiteshi Sharma, Rahul Jain, William B. Haskell:
Empirical Algorithms for General Stochastic Systems with Continuous States and Actions. CDC 2019: 6344-6349
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/eucc/Sharma0G19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eucc/Sharma0G19
Hiteshi Sharma, Rahul Jain, Abhishek K. Gupta:
An Empirical Relative Value Learning Algorithm for Non-parametric MDPs with Continuous State Space. ECC 2019: 1368-1373
[c4]
- view
- export record
  dblp key:
  - conf/uai/SharmaJ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/SharmaJ019
Hiteshi Sharma, Mehdi Jafarnia-Jahromi, Rahul Jain:
Approximate Relative Value Learning for Average-reward Continuous State MDPs. UAI 2019: 956-964
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-07072
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-07072
Chen-Yu Wei, Mehdi Jafarnia-Jahromi, Haipeng Luo, Hiteshi Sharma, Rahul Jain:
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes. CoRR abs/1910.07072 (2019)
2017
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/cdc/HaskellYS017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cdc/HaskellYS017
William B. Haskell, Pengqian Yu, Hiteshi Sharma, Rahul Jain:
Randomized function fitting-based empirical value iteration. CDC 2017: 2467-2472
2016
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/cdc/Haskell0S16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cdc/Haskell0S16
William B. Haskell, Rahul Jain, Hiteshi Sharma:
A dynamical systems framework for stochastic iterative optimization. CDC 2016: 4504-4509
2014
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/vtc/SharmaPMD14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/vtc/SharmaPMD14
Hiteshi Sharma, Aaqib Patel, S. N. Merchant, Uday B. Desai:
Optimal Spectrum Sensing for Cognitive Radio with Imperfect Detector. VTC Spring 2014: 1-5

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.