Zhaohan Daniel Guo

Cited by

	All	Since 2019
Citations	7759	7678
h-index	16	16
i10-index	19	18

3000

1500

750

2250

201820192020202120222023202446 72 248 1117 2241 2955 1035

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityVerified email at cs.stanford.edu
Philip ThomasUniversity of Massachusetts AmherstVerified email at cs.umass.edu
Shayan DoroudiAssistant Professor at the University of California, IrvineVerified email at uci.edu
Yao LiuAmazonVerified email at stanford.edu

Zhaohan Daniel Guo

DeepMind

Verified email at google.com - Homepage

Reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Bootstrap your own latent-a new approach to self-supervised learning JB Grill, F Strub, F Altché, C Tallec, P Richemond, E Buchatskaya, ... Advances in neural information processing systems 33, 21271-21284, 2020	5874	2020
Agent57: Outperforming the atari human benchmark AP Badia, B Piot, S Kapturowski, P Sprechmann, A Vitvitskyi, ZD Guo, ... International conference on machine learning, 507-517, 2020	600	2020
Never give up: Learning directed exploration strategies AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ... arXiv preprint arXiv:2002.06038, 2020	312	2020
Joint semantic utterance classification and slot filling with recursive neural networks D Guo, G Tur, W Yih, G Zweig 2014 IEEE Spoken Language Technology Workshop (SLT), 554-559, 2014	241	2014
Bootstrap latent-predictive representations for multitask reinforcement learning ZD Guo, BA Pires, B Piot, JB Grill, F Altché, R Munos, MG Azar International Conference on Machine Learning, 3875-3886, 2020	138	2020
Neural predictive belief representations ZD Guo, MG Azar, B Piot, BA Pires, R Munos arXiv preprint arXiv:1811.06407, 2018	83	2018
k. kavukcuoglu, R JB Grill, F Strub, F Altché, C Tallec, P Richemond, E Buchatskaya, ... Munos, and M. Valko,“Bootstrap your own latent-a new approach to self …, 2020	63	2020
A pac rl algorithm for episodic pomdps ZD Guo, S Doroudi, E Brunskill Artificial Intelligence and Statistics, 510-518, 2016	62	2016
A general theoretical paradigm to understand learning from human preferences MG Azar, ZD Guo, B Piot, R Munos, M Rowland, M Valko, D Calandriello International Conference on Artificial Intelligence and Statistics, 4447-4455, 2024	53	2024
Byol-explore: Exploration by bootstrapped prediction Z Guo, S Thakoor, M Pîslar, B Avila Pires, F Altché, C Tallec, A Saade, ... Advances in neural information processing systems 35, 31855-31870, 2022	53	2022
Using options and covariance testing for long horizon off-policy policy evaluation Z Guo, PS Thomas, E Brunskill Advances in Neural Information Processing Systems 30, 2017	47	2017
Geometric entropic exploration ZD Guo, MG Azar, A Saade, S Thakoor, B Piot, BA Pires, M Valko, ... arXiv preprint arXiv:2101.02055, 2021	38	2021
Bootstrap your own latent: A new approach to self-supervised learning. arXiv JB Grill, F Strub, F Altché, C Tallec, PH Richemond, E Buchatskaya, ... arXiv preprint arXiv:2006.07733, 2020	36	2020
Concurrent pac rl Z Guo, E Brunskill Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015	29	2015
Understanding self-predictive learning for reinforcement learning Y Tang, ZD Guo, PH Richemond, BA Pires, Y Chandak, R Munos, ... International Conference on Machine Learning, 33632-33656, 2023	21	2023
Pac continuous state online multitask reinforcement learning with identification Y Liu, Z Guo, E Brunskill Proceedings of the 2016 International Conference on Autonomous Agents …, 2016	19	2016
Nash learning from human feedback R Munos, M Valko, D Calandriello, MG Azar, M Rowland, ZD Guo, Y Tang, ... arXiv preprint arXiv:2312.00886, 2023	14	2023
Directed exploration for reinforcement learning ZD Guo, E Brunskill arXiv preprint arXiv:1906.07805, 2019	11	2019
Sample efficient feature selection for factored mdps ZD Guo, E Brunskill arXiv preprint arXiv:1703.03454, 2017	11	2017
Never give up: Learning directed exploration strategies. arXiv AP Badia, P Sprechmann, A Vitvitskyi, D Guo, B Piot, S Kapturowski, ... arXiv preprint arXiv:2002.06038, 2020	8	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors