Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

Thiago D. Simão

> Home > Persons

Person information

affiliation: Radboud University, Nijmegen, The Netherlands

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/GaleslootSJ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/GaleslootSJ024
Maris F. L. Galesloot, Thiago D. Simão, Sebastian Junges, Nils Jansen:
Factored Online Planning in Many-Agent POMDPs. AAAI 2024: 17407-17415
[c20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/KraleST024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/KraleST024
Merlijn Krale, Thiago D. Simão, Jana Tumova, Nils Jansen:
Robust Active Measuring under Model Uncertainty. AAAI 2024: 21276-21284
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icaart/SchmidlS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icaart/SchmidlS024
Christoph Schmidl, Thiago D. Simão, Nils Jansen:
A Supervised Learning Approach to Robust Reinforcement Learning for Job Shop Scheduling. ICAART (3) 2024: 1324-1335
2023
[b1]
- view
  authority control:
- export record
  dblp key:
  - phd/basesearch/Simao23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/basesearch/Simao23
Thiago D. Simão:
Safe Online and Offline Reinforcement Learning. Delft University of Technology, Netherlands, 2023
[j2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ml/YangSTS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/YangSTS23
Qisong Yang, Thiago D. Simão, Simon H. Tindemans, Matthijs T. J. Spaan:
Safety-constrained reinforcement learning with a distributional safety critic. Mach. Learn. 112(3): 859-887 (2023)
[j1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/sttt/BadingsSSJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/sttt/BadingsSSJ23
Thom S. Badings, Thiago D. Simão, Marnix Suilen, Nils Jansen:
Decision-making under uncertainty: beyond probabilities. Int. J. Softw. Tools Technol. Transf. 25(3): 375-391 (2023)
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/SimaoS023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/SimaoS023
Thiago D. Simão, Marnix Suilen, Nils Jansen:
Safe Policy Improvement for POMDPs via Finite-State Controllers. AAAI 2023: 15109-15117
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/aips/KraleS023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aips/KraleS023
Merlijn Krale, Thiago D. Simão, Nils Jansen:
Act-Then-Measure: Reinforcement Learning for Partially Observable Environments with Active Measuring. ICAPS 2023: 212-220
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/ecai/YangSJTS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/YangSJTS23
Qisong Yang, Thiago D. Simão, Nils Jansen, Simon H. Tindemans, Matthijs T. J. Spaan:
Reinforcement Learning by Guided Safe Exploration. ECAI 2023: 2858-2865
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icaart/GrossS0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icaart/GrossS0023
Dennis Gross, Thiago D. Simão, Nils Jansen, Guillermo A. Pérez:
Targeted Adversarial Attacks on Deep Reinforcement Learning Policies via Model Checking. ICAART (3) 2023: 501-508
[c14]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/HogewindSK023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HogewindSK023
Yannick Hogewind, Thiago D. Simão, Tal Kachman, Nils Jansen:
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation. ICLR 2023
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/Castellini0ZSFS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Castellini0ZSFS23
Alberto Castellini, Federico Bianchi, Edoardo Zorzi, Thiago D. Simão, Alessandro Farinelli, Matthijs T. J. Spaan:
Scalable Safe Policy Improvement via Monte Carlo Tree Search. ICML 2023: 3732-3756
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/WienhoftSSDB023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/WienhoftSSDB023
Patrick Wienhöft, Marnix Suilen, Thiago D. Simão, Clemens Dubslaff, Christel Baier, Nils Jansen:
More for Less: Safe Policy Improvement with Stronger Performance Guarantees. IJCAI 2023: 4406-4415
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/Koops0JS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/Koops0JS23
Wietze Koops, Nils Jansen, Sebastian Junges, Thiago D. Simão:
Recursive Small-Step Multi-Agent A* for Dec-POMDPs. IJCAI 2023: 5402-5410
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/uai/KopruluS0T23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/KopruluS0T23
Cevahir Köprülü, Thiago D. Simão, Nils Jansen, Ufuk Topcu:
Risk-aware curriculum generation for heavy-tailed task distributions. UAI 2023: 1132-1142
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-04939
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-04939
Thiago D. Simão, Marnix Suilen, Nils Jansen:
Safe Policy Improvement for POMDPs via Finite-State Controllers. CoRR abs/2301.04939 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-05848
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-05848
Thom S. Badings, Thiago D. Simão, Marnix Suilen, Nils Jansen:
Decision-Making Under Uncertainty: Beyond Probabilities. CoRR abs/2303.05848 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08271
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08271
Merlijn Krale, Thiago D. Simão, Nils Jansen:
Act-Then-Measure: Reinforcement Learning for Partially Observable Environments with Active Measuring. CoRR abs/2303.08271 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-07958
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-07958
Patrick Wienhöft, Marnix Suilen, Thiago D. Simão, Clemens Dubslaff, Christel Baier, Nils Jansen:
More for Less: Safe Policy Improvement With Stronger Performance Guarantees. CoRR abs/2305.07958 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-14316
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-14316
Qisong Yang, Thiago D. Simão, Nils Jansen, Simon H. Tindemans, Matthijs T. J. Spaan:
Reinforcement Learning by Guided Safe Exploration. CoRR abs/2307.14316 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-11227
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-11227
Merlijn Krale, Thiago D. Simão, Jana Tumova, Nils Jansen:
Robust Active Measuring under Model Uncertainty. CoRR abs/2312.11227 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-11434
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-11434
Maris F. L. Galesloot, Thiago D. Simão, Sebastian Junges, Nils Jansen:
Factored Online Planning in Many-Agent POMDPs. CoRR abs/2312.11434 (2023)
2022
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/itsc/KamranSYPFSL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/itsc/KamranSYPFSL22
Danial Kamran, Thiago D. Simão, Qisong Yang, Canmanie T. Ponnambalam, Johannes Fischer, Matthijs T. J. Spaan, Martin Lauer:
A Modern Perspective on Safe Automated Driving for Different Traffic Dynamics Using Constrained Reinforcement Learning. ITSC 2022: 4017-4023
[c8]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/SuilenS0022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SuilenS0022
Marnix Suilen, Thiago D. Simão, David Parker, Nils Jansen:
Robust Anytime Learning of Markov Decision Processes. NeurIPS 2022
[i4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-15827
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-15827
Marnix Suilen, Thiago D. Simão, Nils Jansen, David Parker:
Robust Anytime Learning of Markov Decision Processes. CoRR abs/2205.15827 (2022)
[i3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-01801
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-01801
Yannick Hogewind, Thiago D. Simão, Tal Kachman, Nils Jansen:
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation. CoRR abs/2210.01801 (2022)
[i2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-05337
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-05337
Dennis Gross, Thiago D. Simão, Nils Jansen, Guillermo A. Pérez:
Targeted Adversarial Attacks on Deep Reinforcement Learning Policies via Model Checking. CoRR abs/2212.05337 (2022)
2021
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YangSTS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YangSTS21
Qisong Yang, Thiago D. Simão, Simon H. Tindemans, Matthijs T. J. Spaan:
WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning. AAAI 2021: 10639-10646
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/SimaoJS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/SimaoJS21
Thiago D. Simão, Nils Jansen, Matthijs T. J. Spaan:
AlwaysSafe: Reinforcement Learning without Safety Constraint Violations during Training. AAMAS 2021: 1226-1235
2020
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/SimaoLC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/SimaoLC20
Thiago D. Simão, Romain Laroche, Rémi Tachet des Combes:
Safe Policy Improvement with an Estimated Baseline Policy. AAMAS 2020: 1269-1277

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/SimaoS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/SimaoS19
Thiago D. Simão, Matthijs T. J. Spaan:
Safe Policy Improvement with Baseline Bootstrapping in Factored Environments. AAAI 2019: 4967-4974
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/SimaoS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/SimaoS19
Thiago D. Simão, Matthijs T. J. Spaan:
Structure Learning for Safe Policy Improvement. IJCAI 2019: 3453-3459
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/Simao19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/Simao19
Thiago D. Simão:
Safe and Sample-Efficient Reinforcement Learning Algorithms for Factored Environments. IJCAI 2019: 6460-6461
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-05236
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-05236
Thiago D. Simão, Romain Laroche, Rémi Tachet des Combes:
Safe Policy Improvement with an Estimated Baseline Policy. CoRR abs/1909.05236 (2019)
2018
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iberamia/AndresBMS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iberamia/AndresBMS18
Ignasi Andrés, Leliane Nunes de Barros, Denis Deratani Mauá, Thiago D. Simão:
When a Robot Reaches Out for Human Help. IBERAMIA 2018: 277-289

Coauthor Index

see FAQ

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.