Thomas Wolf

Cited by

	All	Since 2019
Citations	30134	29776
h-index	31	30
i10-index	44	42

10000

5000

2500

7500

201920202021202220232024240 2200 4329 6482 9714 6762

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Victor SanhHugging FaceVerified email at huggingface.co
Julien ChaumondHugging FaceVerified email at huggingface.co
Lysandre DebutMachine Learning Engineer, Hugging FaceVerified email at huggingface.co
Clément DelangueHugging FaceVerified email at huggingface.co
Yacine JerniteResearch Scientist, HuggingFaceVerified email at cs.nyu.edu
Quentin LhoestHugging FaceVerified email at huggingface.co
Alexander M. RushAssociate Professor, Cornell UniversityVerified email at cornell.edu
Joe DavisonUniversity of UtahVerified email at utah.edu
Canwen XuUniversity of California, San DiegoVerified email at ucsd.edu
Patrick von PlatenResearch Engineer at Hugging FaceVerified email at huggingface.co
Julien PluResearch Scientist, LettriaVerified email at eurecom.fr
Morgan FuntowiczHugging FaceVerified email at huggingface.co
Mariama DRAMÉétudianteVerified email at edu.em-lyon.com
Sam ShleiferFacebook AI ResearchVerified email at fb.com
Jérôme LesueurProfessor of Physics, ESPCI Paris, Université PSL, CNRSVerified email at mines-nancy.org
Lewis TunstallHugging FaceVerified email at itp.unibe.ch
Rémi Louf🤗 Hugging Face Inc.Verified email at huggingface.co
Nicolas BergealESPCI Paris - CNRS - PSL University - Sorbonne UniversitéVerified email at espci.fr
Teven Le ScaoHugging FaceVerified email at huggingface.co
Nathan LambertResearch Scientist, Allen AIVerified email at allenai.org

Thomas Wolf

Co-founder at HuggingFace

Verified email at polytechnique.edu - Homepage

machine learning deep learning natural language processing computational linguistics artificial


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Transformers: State-of-the-art natural language processing T Wolf, L Debut, V Sanh, J Chaumond, C Delangue, A Moi, P Cistac, ... Proceedings of the 2020 conference on empirical methods in natural language …, 2020	14448*	2020
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter V Sanh, L Debut, J Chaumond, T Wolf arXiv preprint arXiv:1910.01108, 2019	7160	2019
Multitask prompted training enables zero-shot task generalization V Sanh, A Webson, C Raffel, SH Bach, L Sutawika, Z Alyafeai, A Chaffin, ... arXiv preprint arXiv:2110.08207, 2021	1398	2021
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...	1350	2023
Transfer learning in natural language processing S Ruder, ME Peters, S Swayamdipta, T Wolf Proceedings of the 2019 conference of the North American chapter of the …, 2019	707	2019
Starcoder: may the source be with you! R Li, LB Allal, Y Zi, N Muennighoff, D Kocetkov, C Mou, M Marone, C Akiki, ... arXiv preprint arXiv:2305.06161, 2023	513*	2023
Transfertransfo: A transfer learning approach for neural network based conversational agents T Wolf, V Sanh, J Chaumond, C Delangue arXiv preprint arXiv:1901.08149, 2019	511	2019
Datasets: A community library for natural language processing Q Lhoest, AV Del Moral, Y Jernite, A Thakur, P Von Platen, S Patil, ... arXiv preprint arXiv:2109.02846, 2021	479*	2021
Movement pruning: Adaptive sparsity by fine-tuning V Sanh, T Wolf, A Rush Advances in neural information processing systems 33, 20378-20389, 2020	400	2020
Two-dimensional superconductivity at a Mott insulator/band insulator interface LaTiO₃/SrTiO₃ J Biscaras, N Bergeal, A Kushwaha, T Wolf, A Rastogi, RC Budhani, ... Nature communications 1 (1), 89, 2010	343	2010
Natural language processing with transformers L Tunstall, L Von Werra, T Wolf " O'Reilly Media, Inc.", 2022	305	2022
Diffusers: State-of-the-art diffusion models P Von Platen, S Patil, A Lozhkov, P Cuenca, N Lambert, K Rasul, ...	292	2022
A hierarchical multi-task approach for learning embeddings from semantic tasks V Sanh, T Wolf, S Ruder Proceedings of the AAAI conference on artificial intelligence 33 (01), 6949-6956, 2019	263	2019
Zephyr: Direct distillation of lm alignment L Tunstall, E Beeching, N Lambert, N Rajani, K Rasul, Y Belkada, ... arXiv preprint arXiv:2310.16944, 2023	245	2023
Open llm leaderboard E Beeching, C Fourrier, N Habib, S Han, N Lambert, N Rajani, ... Hugging Face, 2023	200	2023
The stack: 3 tb of permissively licensed source code D Kocetkov, R Li, LB Allal, J Li, C Mou, CM Ferrandis, Y Jernite, M Mitchell, ... arXiv preprint arXiv:2211.15533, 2022	179	2022
Scaling data-constrained language models N Muennighoff, A Rush, B Barak, T Le Scao, N Tazi, A Piktus, S Pyysalo, ... Advances in Neural Information Processing Systems 36, 2024	119	2024
Large-scale transfer learning for natural language generation S Golovanov, R Kurbanov, S Nikolenko, K Truskovskyi, A Tselousov, ... Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019	103	2019
Distilbert, a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR abs/1910.01108 (2019) V Sanh, L Debut, J Chaumond, T Wolf URL: http://arxiv. org/abs/1910 1108, 1910	102	1910
Grounding large language models in interactive environments with online reinforcement learning T Carta, C Romac, T Wolf, S Lamprier, O Sigaud, PY Oudeyer International Conference on Machine Learning, 3676-3713, 2023	91	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors