Eugene Kharitonov
Eugene Kharitonov
Google DeepMind
Verified email at - Homepage
Cited by
Cited by
Libri-light: A benchmark for asr with limited or no supervision
J Kahn, M Riviere, W Zheng, E Kharitonov, Q Xu, PE Mazaré, J Karadayi, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
AudioLM: a language modeling approach to audio generation
Z Borsos, R Marinier, D Vincent, E Kharitonov, O Pietquin, M Sharifi, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
On generative spoken language modeling from raw audio
K Lakhotia, E Kharitonov, WN Hsu, Y Adi, A Polyak, B Bolte, TA Nguyen, ...
Transactions of the Association for Computational Linguistics 9, 1336-1354, 2021
Speech resynthesis from discrete disentangled self-supervised representations
A Polyak, Y Adi, J Copet, E Kharitonov, K Lakhotia, WN Hsu, A Mohamed, ...
arXiv preprint arXiv:2104.00355, 2021
Compositionality and generalization in emergent languages
R Chaabouni, E Kharitonov, D Bouchacourt, E Dupoux, M Baroni
arXiv preprint arXiv:2004.09124, 2020
Data augmenting contrastive learning of speech representations in the time domain
E Kharitonov, M Rivière, G Synnaeve, L Wolf, PE Mazaré, M Douze, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 215-222, 2021
Speak, Read and Prompt: High-fidelity Text-to-Speech with Minimal Supervision
E Kharitonov, D Vincent, Z Borsos, R Marinier, S Girgin, O Pietquin, ...
arXiv preprint arXiv:2302.03540, 2023
Anti-efficient encoding in emergent communication
R Chaabouni, E Kharitonov, E Dupoux, M Baroni
Advances in Neural Information Processing Systems 32, 2019
Audiopalm: A large language model that can speak and listen
PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ...
arXiv preprint arXiv:2306.12925, 2023
The zero resource speech benchmark 2021: Metrics and baselines for unsupervised spoken language modeling
TA Nguyen, M de Seyssel, P Rozé, M Rivière, E Kharitonov, A Baevski, ...
arXiv preprint arXiv:2011.11588, 2020
Text-free prosody-aware generative spoken language modeling
E Kharitonov, A Lee, A Polyak, Y Adi, J Copet, K Lakhotia, TA Nguyen, ...
arXiv preprint arXiv:2109.03264, 2021
EGG: a toolkit for research on Emergence of lanGuage in Games
E Kharitonov, R Chaabouni, D Bouchacourt, M Baroni
arXiv preprint arXiv:1907.00852, 2019
Generative spoken dialogue language modeling
TA Nguyen, E Kharitonov, J Copet, Y Adi, WN Hsu, A Elkahky, ...
Transactions of the Association for Computational Linguistics 11, 250-266, 2023
Soundstorm: Efficient parallel audio generation
Z Borsos, M Sharifi, D Vincent, E Kharitonov, N Zeghidour, M Tagliasacchi
arXiv preprint arXiv:2305.09636, 2023
Communicating artificial neural networks develop efficient color-naming systems
R Chaabouni, E Kharitonov, E Dupoux, M Baroni
Proceedings of the National Academy of Sciences 118 (12), e2016569118, 2021
Textless speech emotion conversion using discrete and decomposed representations
F Kreuk, A Polyak, J Copet, E Kharitonov, TA Nguyen, M Rivière, WN Hsu, ...
arXiv preprint arXiv:2111.07402, 2021
The zero resource speech challenge 2021: Spoken language modelling
E Dunbar, M Bernard, N Hamilakis, TA Nguyen, M De Seyssel, P Rozé, ...
arXiv preprint arXiv:2104.14700, 2021
Federated online learning to rank with evolution strategies
E Kharitonov
Proceedings of the Twelfth ACM International Conference on Web Search and …, 2019
Entropy minimization in emergent languages
E Kharitonov, R Chaabouni, D Bouchacourt, M Baroni
arXiv preprint arXiv:1905.13687, 2019
What they do when in doubt: a study of inductive biases in seq2seq learners
E Kharitonov, R Chaabouni
arXiv preprint arXiv:2006.14953, 2020
The system can't perform the operation now. Try again later.
Articles 1–20