Jonathan Uesato

Cited by

	All	Since 2019
Citations	6109	5955
h-index	25	25
i10-index	29	29

2200

1100

550

1650

2017201820192020202120222023202423 113 340 583 729 1247 2176 866

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Jonathan Uesato

Unknown affiliation

Verified email at mit.edu


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Scaling language models: Methods, analysis & insights from training gopher JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ... arXiv preprint arXiv:2112.11446, 2021	758	2021
Adversarial risk and the dangers of evaluating against weak attacks J Uesato, B O’donoghue, P Kohli, A Oord International Conference on Machine Learning, 5025-5034, 2018	619	2018
Ethical and social risks of harm from language models L Weidinger, J Mellor, M Rauh, C Griffin, J Uesato, PS Huang, M Cheng, ... arXiv preprint arXiv:2112.04359, 2021	608	2021
On the effectiveness of interval bound propagation for training verifiably robust models S Gowal, K Dvijotham, R Stanforth, R Bunel, C Qin, J Uesato, ... arXiv preprint arXiv:1810.12715, 2018	478	2018
Robustfill: Neural program learning under noisy i/o J Devlin, J Uesato, S Bhupatiraju, R Singh, A Mohamed, P Kohli International conference on machine learning, 990-998, 2017	428	2017
Technical report on the cleverhans v2. 1.0 adversarial examples library N Papernot, F Faghri, N Carlini, I Goodfellow, R Feinman, A Kurakin, ... arXiv preprint arXiv:1610.00768, 2016	390	2016
Are labels required for improving adversarial robustness? JB Alayrac, J Uesato, PS Huang, A Fawzi, R Stanforth, P Kohli Advances in Neural Information Processing Systems 32, 2019	338	2019
Robustness via curvature regularization, and vice versa SM Moosavi-Dezfooli, A Fawzi, J Uesato, P Frossard Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019	323	2019
Taxonomy of risks posed by language models L Weidinger, J Uesato, M Rauh, C Griffin, PS Huang, J Mellor, A Glaese, ... Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022	315	2022
Improving alignment of dialogue agents via targeted human judgements A Glaese, N McAleese, M Trębacz, J Aslanides, V Firoiu, T Ewalds, ... arXiv preprint arXiv:2209.14375, 2022	295	2022
Uncovering the limits of adversarial training against norm-bounded adversarial examples S Gowal, C Qin, J Uesato, T Mann, P Kohli arXiv preprint arXiv:2010.03593, 2020	295	2020
Training verified learners with learned verifiers K Dvijotham, S Gowal, R Stanforth, R Arandjelovic, B O'Donoghue, ... arXiv preprint arXiv:1805.10265, 2018	170	2018
Scalable verified training for provably robust image classification S Gowal, KD Dvijotham, R Stanforth, R Bunel, C Qin, J Uesato, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019	164	2019
Challenges in detoxifying language models J Welbl, A Glaese, J Uesato, S Dathathri, J Mellor, LA Hendricks, ... arXiv preprint arXiv:2109.07445, 2021	150	2021
Enabling certification of verification-agnostic networks via memory-efficient semidefinite programming S Dathathri, K Dvijotham, A Kurakin, A Raghunathan, J Uesato, RR Bunel, ... Advances in Neural Information Processing Systems 33, 5318-5331, 2020	101	2020
Specification gaming: the flip side of AI ingenuity V Krakovna, J Uesato, V Mikulik, M Rahtz, T Everitt, R Kumar, Z Kenton, ... DeepMind Blog 3, 2020	87	2020
Cyprien de Masson d’Autume JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ...	80	2021
Rigorous agent evaluation: An adversarial approach to uncover catastrophic failures J Uesato, A Kumar, C Szepesvari, T Erez, A Ruderman, K Anderson, ... arXiv preprint arXiv:1812.01647, 2018	76	2018
An alternative surrogate loss for pgd-based adversarial testing S Gowal, J Uesato, C Qin, PS Huang, T Mann, P Kohli arXiv preprint arXiv:1910.09338, 2019	75	2019
Solving math word problems with process-and outcome-based feedback J Uesato, N Kushman, R Kumar, F Song, N Siegel, L Wang, A Creswell, ... arXiv preprint arXiv:2211.14275, 2022	73	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by