Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

Gerald Tesauro

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

◀ ▶ joint publications with Miao Liu 0001

> Home > Persons > Gerald Tesauro

Publications

2023
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-17508
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-17508
Tyler Malloy, Miao Liu, Matthew D. Riemer, Tim Klinger, Gerald Tesauro, Chris R. Sims:
Learning in Factored Domains with Information-Constrained Visual Representations. CoRR abs/2303.17508 (2023)
2022
[c67]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/AbdulhaiKR0TH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/AbdulhaiKR0TH22
Marwa Abdulhai, Dong-Ki Kim, Matthew Riemer, Miao Liu, Gerald Tesauro, Jonathan P. How:
Context-Specific Representation Abstraction for Deep Option Learning. AAAI 2022: 5959-5967
[c66]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/KimRLFESTH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KimRLFESTH22
Dong-Ki Kim, Matthew Riemer, Miao Liu, Jakob N. Foerster, Michael Everett, Chuangchuang Sun, Gerald Tesauro, Jonathan P. How:
Influencing Long-Term Behavior in Multiagent Reinforcement Learning. NeurIPS 2022
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-00669
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-00669
Junkyu Lee, Michael Katz, Don Joven Agravante, Miao Liu, Tim Klinger, Murray Campbell, Shirin Sohrabi, Gerald Tesauro:
AI Planning Annotation for Sample Efficient Reinforcement Learning. CoRR abs/2203.00669 (2022)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-03535
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-03535
Dong-Ki Kim, Matthew Riemer, Miao Liu, Jakob N. Foerster, Michael Everett, Chuangchuang Sun, Gerald Tesauro, Jonathan P. How:
Influencing Long-Term Behavior in Multiagent Reinforcement Learning. CoRR abs/2203.03535 (2022)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-16175
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-16175
Dong-Ki Kim, Matthew Riemer, Miao Liu, Jakob N. Foerster, Gerald Tesauro, Jonathan P. How:
Game-Theoretical Perspectives on Active Equilibria: A Preferred Solution Concept over Nash Equilibria. CoRR abs/2210.16175 (2022)
2021
[c64]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/MalloyK0TRS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/MalloyK0TRS21
Tyler Malloy, Tim Klinger, Miao Liu, Gerald Tesauro, Matthew Riemer, Chris R. Sims:
RL Generalization in a Theory of Mind Game Through a Sleep Metaphor (Student Abstract). AAAI 2021: 15841-15842
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/cig/MalloySK0RT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cig/MalloySK0RT21
Tyler Malloy, Chris R. Sims, Tim Klinger, Miao Liu, Matthew Riemer, Gerald Tesauro:
Capacity-Limited Decentralized Actor-Critic for Multi-Agent Games. CoG 2021: 1-8
[c62]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/Kim0RSAHLTH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Kim0RSAHLTH21
Dong-Ki Kim, Miao Liu, Matthew Riemer, Chuangchuang Sun, Marwa Abdulhai, Golnaz Habibi, Sebastian Lopez-Cot, Gerald Tesauro, Jonathan P. How:
A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning. ICML 2021: 5541-5550
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-09876
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-09876
Marwa Abdulhai, Dong-Ki Kim, Matthew Riemer, Miao Liu, Gerald Tesauro, Jonathan P. How:
Context-Specific Representation Abstraction for Deep Option Learning. CoRR abs/2109.09876 (2021)
2020
[c60]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/RiemerCR0T20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/RiemerCR0T20
Matthew Riemer, Ignacio Cases, Clemens Rosenbaum, Miao Liu, Gerald Tesauro:
On the Role of Weight Sharing During Deep Option Learning. AAAI 2020: 5519-5526
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/Kim0OLRHTMCH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/Kim0OLRHTMCH20
Dong-Ki Kim, Miao Liu, Shayegan Omidshafiei, Sebastian Lopez-Cot, Matthew Riemer, Golnaz Habibi, Gerald Tesauro, Sami Mourad, Murray Campbell, Jonathan P. How:
Learning Hierarchical Teaching Policies for Cooperative Agents. AAMAS 2020: 620-628
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-04646
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-04646
Tyler Malloy, Chris R. Sims, Tim Klinger, Miao Liu, Matthew Riemer, Gerald Tesauro:
Deep RL With Information Constrained Policies: Generalization in Continuous Control. CoRR abs/2010.04646 (2020)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-00382
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-00382
Dong-Ki Kim, Miao Liu, Matthew Riemer, Chuangchuang Sun, Marwa Abdulhai, Golnaz Habibi, Sebastian Lopez-Cot, Gerald Tesauro, Jonathan P. How:
A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning. CoRR abs/2011.00382 (2020)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-11517
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-11517
Tyler Malloy, Tim Klinger, Miao Liu, Matthew Riemer, Gerald Tesauro, Chris R. Sims:
Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games. CoRR abs/2011.11517 (2020)
2019
[c56]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/OmidshafieiKLTR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/OmidshafieiKLTR19
Shayegan Omidshafiei, Dong-Ki Kim, Miao Liu, Gerald Tesauro, Matthew Riemer, Christopher Amato, Murray Campbell, Jonathan P. How:
Learning to Teach in Cooperative Multiagent Reinforcement Learning. AAAI 2019: 6128-6136
[c55]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/RiemerCALRTT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/RiemerCALRTT19
Matthew Riemer, Ignacio Cases, Robert Ajemian, Miao Liu, Irina Rish, Yuhai Tu, Gerald Tesauro:
Learning to Learn without Forgetting by Maximizing Transfer and Minimizing Interference. ICLR (Poster) 2019
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1903-03216
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-03216
Dong-Ki Kim, Miao Liu, Shayegan Omidshafiei, Sebastian Lopez-Cot, Matthew Riemer, Golnaz Habibi, Gerald Tesauro, Sami Mourad, Murray Campbell, Jonathan P. How:
Learning Hierarchical Teaching in Cooperative Multiagent Reinforcement Learning. CoRR abs/1903.03216 (2019)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1912-13408
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-13408
Matthew Riemer, Ignacio Cases, Clemens Rosenbaum, Miao Liu, Gerald Tesauro:
On the Role of Weight Sharing During Deep Option Learning. CoRR abs/1912.13408 (2019)
2018
[c53]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/MachadoRGLTC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MachadoRGLTC18
Marlos C. Machado, Clemens Rosenbaum, Xiaoxiao Guo, Miao Liu, Gerald Tesauro, Murray Campbell:
Eigenoption Discovery through the Deep Successor Representation. ICLR (Poster) 2018
[c49]
- view
- export record
  dblp key:
  - conf/nips/RiemerLT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/RiemerLT18
Matthew Riemer, Miao Liu, Gerald Tesauro:
Learning Abstract Options. NeurIPS 2018: 10445-10455
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-07830
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-07830
Shayegan Omidshafiei, Dong-Ki Kim, Miao Liu, Gerald Tesauro, Matthew Riemer, Christopher Amato, Murray Campbell, Jonathan P. How:
Learning to Teach in Cooperative Multiagent Reinforcement Learning. CoRR abs/1805.07830 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-11583
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-11583
Matthew Riemer, Miao Liu, Gerald Tesauro:
Learning Abstract Options. CoRR abs/1810.11583 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-11910
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-11910
Matthew Riemer, Ignacio Cases, Robert Ajemian, Miao Liu, Irina Rish, Yuhai Tu, Gerald Tesauro:
Learning to Learn without Forgetting By Maximizing Transfer and Minimizing Interference. CoRR abs/1810.11910 (2018)
2017
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1710-11089
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-11089
Marlos C. Machado, Clemens Rosenbaum, Xiaoxiao Guo, Miao Liu, Gerald Tesauro, Murray Campbell:
Eigenoption Discovery through the Deep Successor Representation. CoRR abs/1710.11089 (2017)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1712-04065
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-04065
Miao Liu, Marlos C. Machado, Gerald Tesauro, Murray Campbell:
The Eigenoption-Critic Framework. CoRR abs/1712.04065 (2017)

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.