Bilal Piot

Cited by

	All	Since 2019
Citations	16446	15597
h-index	37	35
i10-index	48	46

4500

2250

1125

3375

2014201520162017201820192020202120222023202448 45 91 130 469 849 1318 2458 3703 4475 2777

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Verified email at univ-lille.fr
Mohammad Gheshlaghi AzarCohereVerified email at google.com
Zhaohan Daniel GuoDeepMindVerified email at google.com
Rémi MunosGoogle DeepMindVerified email at inria.fr
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindVerified email at meta.com
Florent AltchéResearch Engineer, DeepMindVerified email at google.com
Jean-bastien GrillVerified email at google.com
Florian STRUBCohereVerified email at cohere.com
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Verified email at univ-lorraine.fr
Corentin TallecDeepMindVerified email at google.com
Pierre RichemondGoogle DeepMindVerified email at deepmind.com
Charles BlundellResearch Scientist at DeepMindVerified email at google.com
Todd HesterWaymoVerified email at waymo.com
Pablo SprechmannResearch Scientist at Google DeepMindVerified email at google.com
Steven KapturowskiDeepMindVerified email at google.com
Mel VecerikDeepMind, University College LondonVerified email at ucl.ac.uk
Dan HorganGoogle DeepMindVerified email at google.com
Adrià Puigdomènech BadiaDeepMindVerified email at google.com
Alex VitvitskyiDeepMindVerified email at google.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLVerified email at google.com

Bilal Piot

Google Deepmind

Verified email at google.com

reinforcement learning inverse reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Bootstrap your own latent: A new approach to self-supervised learning JB Grill, F Strub, F Altché, C Tallec, PH Richemond, E Buchatskaya, ... arXiv preprint arXiv:2006.07733, 2020	6108	2020
Rainbow: Combining improvements in deep reinforcement learning M Hessel, J Modayil, H Van Hasselt, T Schaul, G Ostrovski, W Dabney, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2641	2018
Deep q-learning from demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	1227	2018
Noisy Networks for Exploration M Fortunato, MG Azar, B Piot, J Menick, I Osband, A Graves, V Mnih, ... arXiv preprint arXiv:1706.10295 2018, 2017	1174*	2017
Leveraging demonstrations for deep reinforcement learning on robotics problems with sparse rewards M Vecerik, T Hester, J Scholz, F Wang, O Pietquin, B Piot, N Heess, ... arXiv preprint arXiv:1707.08817, 2017	785	2017
Agent57: Outperforming the atari human benchmark AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ... International conference on machine learning, 507-517, 2020	644	2020
k. kavukcuoglu, R JB Grill, F Strub, F Altché, C Tallec, P Richemond, E Buchatskaya, ... Munos, and M. Valko,“Bootstrap your own latent-a new approach to self …, 2020	455*	2020
Never give up: Learning directed exploration strategies AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ... arXiv preprint arXiv:2002.06038, 2020	336	2020
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	239	2020
Learning from demonstrations for real world reinforcement learning T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, A Sendonaris, ... arXiv preprint arXiv:1704.03732, 2017	179	2017
Mastering the game of Stratego with model-free multiagent reinforcement learning J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub, V de Boer, ... Science 378 (6623), 990-996, 2022	172	2022
A general theoretical paradigm to understand learning from human preferences MG Azar, ZD Guo, B Piot, R Munos, M Rowland, M Valko, D Calandriello International Conference on Artificial Intelligence and Statistics, 4447-4455, 2024	159	2024
Bootstrap latent-predictive representations for multitask reinforcement learning ZD Guo, BA Pires, B Piot, JB Grill, F Altché, R Munos, MG Azar International Conference on Machine Learning, 3875-3886, 2020	145	2020
Observe and look further: Achieving consistent performance on atari T Pohlen, B Piot, T Hester, MG Azar, D Horgan, D Budden, G Barth-Maron, ... arXiv preprint arXiv:1805.11593, 2018	137	2018
Inverse reinforcement learning through structured classification E Klein, M Geist, B Piot, O Pietquin Advances in neural information processing systems 25, 2012	122	2012
Approximate dynamic programming for two-player zero-sum Markov games J Perolat, B Scherrer, B Piot, O Pietquin International Conference on Machine Learning, 1321-1329, 2015	119	2015
Bridging the gap between imitation learning and inverse reinforcement learning B Piot, M Geist, O Pietquin IEEE transactions on neural networks and learning systems 28 (8), 1814-1826, 2016	108	2016
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning A Gruslys, W Dabney, MG Azar, B Piot, M Bellemare, R Munos arXiv preprint arXiv:1704.04651, 2017	100	2017
Hindsight credit assignment A Harutyunyan, W Dabney, T Mesnard, M Gheshlaghi Azar, B Piot, ... Advances in neural information processing systems 32, 2019	96	2019
Byol works even without batch statistics PH Richemond, JB Grill, F Altché, C Tallec, F Strub, A Brock, S Smith, ... arXiv preprint arXiv:2010.10241, 2020	94	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors