Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

Kavosh Asadi

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c15]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/LiuZA0ZSF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiuZA0ZSF24
Zuxin Liu, Jesse Zhang, Kavosh Asadi, Yao Liu, Ding Zhao, Shoham Sabach, Rasool Fakoor:
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models. ICLR 2024
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-01838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-01838
Kavosh Asadi, Yao Liu, Shoham Sabach, Ming Yin, Rasool Fakoor:
Learning the Target Network in Function Space. CoRR abs/2406.01838 (2024)
2023
[c14]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/GottesmanAAL0L23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/GottesmanAAL0L23
Omer Gottesman, Kavosh Asadi, Cameron S. Allen, Samuel Lobel, George Konidaris, Michael Littman:
Coarse-Grained Smoothness for Reinforcement Learning in Metric Spaces. AISTATS 2023: 1390-1410
[c13]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/AsadiFS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AsadiFS23
Kavosh Asadi, Rasool Fakoor, Shoham Sabach:
Resetting the Optimizer in Deep RL: An Empirical Study. NeurIPS 2023
[c12]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/AsadiS0GF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AsadiS0GF23
Kavosh Asadi, Shoham Sabach, Yao Liu, Omer Gottesman, Rasool Fakoor:
TD Convergence: An Optimization Perspective. NeurIPS 2023
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-17750
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-17750
Kavosh Asadi, Shoham Sabach, Yao Liu, Omer Gottesman, Rasool Fakoor:
TD Convergence: An Optimization Perspective. CoRR abs/2306.17750 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-17833
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-17833
Kavosh Asadi, Rasool Fakoor, Shoham Sabach:
Resetting the Optimizer in Deep RL: An Empirical Study. CoRR abs/2306.17833 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-05905
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-05905
Zuxin Liu, Jesse Zhang, Kavosh Asadi, Yao Liu, Ding Zhao, Shoham Sabach, Rasool Fakoor:
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models. CoRR abs/2310.05905 (2023)
2022
[c11]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/AsadiFGKLS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AsadiFGKLS22
Kavosh Asadi, Rasool Fakoor, Omer Gottesman, Taesup Kim, Michael L. Littman, Alexander J. Smola:
Faster Deep Reinforcement Learning with Slower Online Network. NeurIPS 2022
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/KlissarovFMAKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KlissarovFMAKS22
Martin Klissarov, Rasool Fakoor, Jonas W. Mueller, Kavosh Asadi, Taesup Kim, Alexander J. Smola:
Adaptive Interest for Emphatic Reinforcement Learning. NeurIPS 2022
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-05588
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-05588
Zhiyuan Zhou, Cameron Allen, Kavosh Asadi, George Konidaris:
Characterizing the Action-Generalization Gap in Deep Q-Learning. CoRR abs/2205.05588 (2022)
2021
[b1]
- view
  - electronic edition @ brown.edu
  - no references & citations available
- export record
  dblp key:
  - phd/us/Asadi21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/us/Asadi21
Kavosh Asadi:
Smoothness in Reinforcement Learning with Large State and Action Spaces. Brown University, USA, 2021
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/AsadiPPKL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/AsadiPPKL21
Kavosh Asadi, Neev Parikh, Ronald E. Parr, George Dimitri Konidaris, Michael L. Littman:
Deep Radial-Basis Value Functions for Continuous Control. AAAI 2021: 6696-6704
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LecarpentierAAJ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LecarpentierAAJ21
Erwan Lecarpentier, David Abel, Kavosh Asadi, Yuu Jinnai, Emmanuel Rachelson, Michael L. Littman:
Lipschitz Lifelong Reinforcement Learning. AAAI 2021: 8270-8278
[c7]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/FakoorMACS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/FakoorMACS21
Rasool Fakoor, Jonas Mueller, Kavosh Asadi, Pratik Chaudhari, Alexander J. Smola:
Continuous Doubly Constrained Batch Reinforcement Learning. NeurIPS 2021: 11260-11273
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-07054
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-07054
Ishaan Shah, David Halpern, Kavosh Asadi, Michael L. Littman:
Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback. CoRR abs/2109.07054 (2021)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-12276
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-12276
Omer Gottesman, Kavosh Asadi, Cameron Allen, Sam Lobel, George Konidaris, Michael Littman:
Coarse-Grained Smoothness for RL in Metric Spaces. CoRR abs/2110.12276 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-05848
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-05848
Kavosh Asadi, Rasool Fakoor, Omer Gottesman, Michael L. Littman, Alexander J. Smola:
Deep Q-Network with Proximal Iteration. CoRR abs/2112.05848 (2021)
2020
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2001-05411
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-05411
Erwan Lecarpentier, David Abel, Kavosh Asadi, Yuu Jinnai, Emmanuel Rachelson, Michael L. Littman:
Lipschitz Lifelong Reinforcement Learning. CoRR abs/2001.05411 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-01883
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-01883
Kavosh Asadi, Ronald E. Parr, George Dimitri Konidaris, Michael L. Littman:
Deep RBF Value Functions for Continuous Control. CoRR abs/2002.01883 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-05518
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-05518
Kavosh Asadi, David Abel, Michael Littman:
Learning State Abstractions for Transfer in Continuous Control. CoRR abs/2002.05518 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/AbelAAJLW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/AbelAAJLW19
David Abel, Dilip Arumugam, Kavosh Asadi, Yuu Jinnai, Michael L. Littman, Lawson L. S. Wong:
State Abstraction as Compression in Apprenticeship Learning. AAAI 2019: 3134-3142
[c5]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/atal/KimALK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/KimALK19
Seungchan Kim, Kavosh Asadi, Michael L. Littman, George Dimitri Konidaris:
Removing the Target Network from Deep Q-Networks with the Mellowmax Operator. AAMAS 2019: 2060-2062
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/KimALK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/KimALK19
Seungchan Kim, Kavosh Asadi, Michael L. Littman, George Dimitri Konidaris:
DeepMellow: Removing the Need for a Target Network in Deep Q-Learning. IJCAI 2019: 2733-2739
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-13320
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-13320
Kavosh Asadi, Dipendra Misra, Seungchan Kim, Michael L. Littman:
Combating the Compounding-Error Problem with a Multi-step Model. CoRR abs/1905.13320 (2019)
2018
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/AsadiML18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/AsadiML18
Kavosh Asadi, Dipendra Misra, Michael L. Littman:
Lipschitz Continuity in Model-based Reinforcement Learning. ICML 2018: 264-273
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1804-07193
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-07193
Kavosh Asadi, Dipendra Misra, Michael L. Littman:
Lipschitz Continuity in Model-based Reinforcement Learning. CoRR abs/1804.07193 (2018)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1806-01265
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-01265
Kavosh Asadi, Evan Cater, Dipendra Misra, Michael L. Littman:
Equivalence Between Wasserstein and Value-Aware Model-based Reinforcement Learning. CoRR abs/1806.01265 (2018)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-00128
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-00128
Kavosh Asadi, Evan Cater, Dipendra Misra, Michael L. Littman:
Towards a Simple Approach to Multi-step Model-based Reinforcement Learning. CoRR abs/1811.00128 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1812-01129
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-01129
Dilip Arumugam, David Abel, Kavosh Asadi, Nakul Gopalan, Christopher Grimm, Jun Ki Lee, Lucas Lehnert, Michael L. Littman:
Mitigating Planner Overfitting in Model-Based Reinforcement Learning. CoRR abs/1812.01129 (2018)
2017
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/acl/WilliamsAZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WilliamsAZ17
Jason D. Williams, Kavosh Asadi, Geoffrey Zweig:
Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning. ACL (1) 2017: 665-677
[c1]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/AsadiL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/AsadiL17
Kavosh Asadi, Michael L. Littman:
An Alternative Softmax Operator for Reinforcement Learning. ICML 2017: 243-252
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/WilliamsAZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WilliamsAZ17
Jason D. Williams, Kavosh Asadi, Geoffrey Zweig:
Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning. CoRR abs/1702.03274 (2017)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1709-00503
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-00503
Kavosh Asadi, Cameron Allen, Melrose Roderick, Abdel-rahman Mohamed, George Dimitri Konidaris, Michael L. Littman:
Mean Actor Critic. CoRR abs/1709.00503 (2017)
2016
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/AsadiL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AsadiL16
Kavosh Asadi, Michael L. Littman:
A New Softmax Operator for Reinforcement Learning. CoRR abs/1612.05628 (2016)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/AsadiW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AsadiW16
Kavosh Asadi, Jason D. Williams:
Sample-efficient Deep Reinforcement Learning for Dialog Control. CoRR abs/1612.06000 (2016)

Coauthor Index

see FAQ

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.