Katherine Lee

Cited by

	All	Since 2019
Citations	27015	26990
h-index	22	22
i10-index	24	24

11000

5500

2750

8250

20202021202220232024735 2058 4437 10059 9586

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Adam RobertsGoogle DeepMindVerified email at google.com
Colin RaffelUniversity of Toronto, Vector Institute and Hugging FaceVerified email at cs.toronto.edu
Nicholas CarliniGoogle DeepMindVerified email at google.com
Florian TramèrAssistant Professor of Computer Science, ETH ZurichVerified email at inf.ethz.ch
Sharan NarangResearch Engineer, Meta AIVerified email at meta.com
Matthew JagielskiGoogle DeepMindVerified email at google.com
Daphne IppolitoGoogle BrainVerified email at google.com
Yanqi ZhouGoogleVerified email at google.com
Noam ShazeerCharacter.aiVerified email at character.ai
Chiyuan ZhangGoogle ResearchVerified email at google.com
Eric WallaceUC BerkeleyVerified email at berkeley.edu
A. Feder CooperCo-founder of The GenLaw Center | ML Postdoc | Incoming CS ProfessorVerified email at microsoft.com
Milad NasrGoogle DeepMindVerified email at srxzr.com
Christopher A. Choquette-ChooGoogle DeepMindVerified email at google.com
James GrimmelmannCornell Tech and Cornell Law SchoolVerified email at cornell.edu
Orhan FiratGoogle AIVerified email at google.com
David SussilloMeta Reality Labs and Adjunct Professor @ StanfordVerified email at stanford.edu

Katherine Lee

Researcher, Google DeepMind

Verified email at google.com - Homepage

natural language processing machine learning privacy security


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Exploring the limits of transfer learning with a unified text-to-text transformer C Raffel, N Shazeer, A Roberts, K Lee, S Narang, M Matena, Y Zhou, W Li, ... The Journal of Machine Learning Research 21 (1), 5485-5551, 2020	16731	2020
Palm: Scaling language modeling with pathways A Chowdhery, S Narang, J Devlin, M Bosma, G Mishra, A Roberts, ... Journal of Machine Learning Research 24 (240), 1-113, 2023	4083	2023
Extracting Training Data from Large Language Models. N Carlini, F Tramer, E Wallace, M Jagielski, A Herbert-Voss, K Lee, ... USENIX Security Symposium 6, 2021	1439	2021
PaLM 2 Technical Report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023	1056	2023
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	1042	2023
Quantifying Memorization Across Neural Language Models N Carlini, D Ippolito, M Jagielski, K Lee, F Tramer, C Zhang arXiv preprint arXiv:2202.07646, 2022	478	2022
Deduplicating training data makes language models better K Lee, D Ippolito, A Nystrom, C Zhang, D Eck, C Callison-Burch, N Carlini arXiv preprint arXiv:2107.06499, 2021	415	2021
Gemma: Open Models Based on Gemini Research and Technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024	254	2024
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024	196	2024
WT5?! Training Text-to-Text Models to Explain their Predictions S Narang, C Raffel, K Lee, A Roberts, N Fiedel, K Malkan arXiv preprint arXiv:2004.14546, 2020	183	2020
Are aligned neural networks adversarially aligned? N Carlini, M Nasr, CA Choquette-Choo, M Jagielski, I Gao, PWW Koh, ... Advances in Neural Information Processing Systems 36, 2024	159	2024
What Does it Mean for a Language Model to Preserve Privacy? H Brown, K Lee, F Mireshghallah, R Shokri, F Tramèr 2022 ACM Conference on Fairness, Accountability, and Transparency, 2280-2292, 2022	154	2022
Scalable Extraction of Training Data from (Production) Language Models M Nasr, N Carlini, J Hayase, M Jagielski, AF Cooper, D Ippolito, ... arXiv preprint arXiv:2311.17035, 2023	127	2023
Hallucinations in neural machine translation K Lee, O Firat, A Agarwal, C Fannjiang, D Sussillo	123	2018
Counterfactual memorization in neural language models C Zhang, D Ippolito, K Lee, M Jagielski, F Tramèr, N Carlini Advances in Neural Information Processing Systems 36, 39321-39362, 2023	98	2023
Preventing Verbatim Memorization in Language Models Gives a False Sense of Privacy D Ippolito, F Tramèr, M Nasr, C Zhang, M Jagielski, K Lee, ... arXiv preprint arXiv:2210.17546, 2022	92	2022
Propagation of information along the cortical hierarchy as a function of attention while reading and listening to stories M Regev, E Simony, K Lee, KM Tan, J Chen, U Hasson Cerebral Cortex 29 (10), 4017-4034, 2019	72	2019
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity S Longpre, G Yauney, E Reif, K Lee, A Roberts, B Zoph, D Zhou, J Wei, ... arXiv preprint arXiv:2305.13169, 2023	68	2023
Measuring Forgetting of Memorized Training Examples M Jagielski, O Thakkar, F Tramèr, D Ippolito, K Lee, N Carlini, E Wallace, ... arXiv preprint arXiv:2207.00099, 2022	67	2022
Madlad-400: A multilingual and document-level large audited dataset S Kudugunta, I Caswell, B Zhang, X Garcia, D Xin, A Kusupati, R Stella, ... Advances in Neural Information Processing Systems 36, 2024	45	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors