Follow
Thiago D. Simão
Title
Cited by
Cited by
Year
WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning
Q Yang, TD Simão, SH Tindemans, MTJ Spaan
AAAI, 10639-10646, 2021
982021
AlwaysSafe: Reinforcement learning without safety constraint violations during training
TD Simão, N Jansen, MTJ Spaan
Proceedings of the 20th International Conference on Autonomous Agents and …, 2021
442021
Safe Policy Improvement with an Estimated Baseline Policy
TD Simão, R Laroche, R Tachet des Combes
Proceedings of the 19th International Conference on Autonomous Agents and …, 2020
30*2020
Safe Policy Improvement with Baseline Bootstrapping in Factored Environments
TD Simão, MTJ Spaan
Proceedings of the AAAI Conference on Artificial Intelligence 33, 4967-4974, 2019
302019
Safety-constrained reinforcement learning with a distributional safety critic
Q Yang, TD Simão, SH Tindemans, MTJ Spaan
Machine Learning 112 (3), 859-887, 2023
292023
Robust anytime learning of Markov decision processes
M Suilen, TD Simão, D Parker, N Jansen
Advances in Neural Information Processing Systems 35, 28790-28802, 2022
182022
Structure Learning for Safe Policy Improvement
TD Simão, MTJ Spaan
Proceedings of the 28th International Joint Conference on Artificial …, 2019
112019
Safe policy improvement for POMDPs via finite-state controllers
TD Simão, M Suilen, N Jansen
Proceedings of the AAAI Conference on Artificial Intelligence 37 (12), 15109 …, 2023
82023
Decision-making under uncertainty: beyond probabilities: Challenges and perspectives
T Badings, TD Simão, M Suilen, N Jansen
International Journal on Software Tools for Technology Transfer 25 (3), 375-391, 2023
82023
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Y Hogewind, TD Simão, T Kachman, N Jansen
arXiv preprint arXiv:2210.01801, 2022
82022
Reinforcement Learning by Guided Safe Exploration
Q Yang, TD Simão, N Jansen, SH Tindemans, MTJ Spaan
arXiv preprint arXiv:2307.14316, 2023
7*2023
A Modern Perspective on Safe Automated Driving for Different Traffic Dynamics Using Constrained Reinforcement Learning
D Kamran, TD Simão, Q Yang, CT Ponnambalam, J Fischer, MTJ Spaan, ...
2022 IEEE 25th International Conference on Intelligent Transportation …, 2022
42022
Act-then-measure: reinforcement learning for partially observable environments with active measuring
M Krale, TD Simão, N Jansen
Proceedings of the International Conference on Automated Planning and …, 2023
32023
More for Less: Safe Policy Improvement With Stronger Performance Guarantees
P Wienhöft, M Suilen, TD Simão, C Dubslaff, C Baier, N Jansen
arXiv preprint arXiv:2305.07958, 2023
32023
Scalable Safe Policy Improvement via Monte Carlo Tree Search
A Castellini, F Bianchi, E Zorzi, TD Simão, A Farinelli, MTJ Spaan
International Conference on Machine Learning, 3732-3756, 2023
32023
Planejamento probabilístico com becos sem saída
TD Simão
Universidade de São Paulo, 2017
22017
Utilização de algoritmos genéticos para otimização de soluções para o timetabling escolar
TD SIMÃO
Tese apresentada ao Departamento de Ciência da Computação da Universidade …, 2013
22013
Risk-aware curriculum generation for heavy-tailed task distributions
C Koprulu, TD Simão, N Jansen, U Topcu
Uncertainty in Artificial Intelligence, 1132-1142, 2023
12023
Recursive small-step multi-agent A* for dec-POMDPs
W Koops, N Jansen, S Junges, TD Simão
Sl: IJCAI, 2023
12023
When a Robot Reaches Out for Human Help
I Andrés, LN de Barros, DD Mauá, TD Simão
Ibero-American Conference on Artificial Intelligence, 277-289, 2018
12018
The system can't perform the operation now. Try again later.
Articles 1–20