Noise estimation for generative diffusion models R San-Roman, E Nachmani, L Wolf arXiv preprint arXiv:2104.02600, 2021 | 117 | 2021 |
Seamless: Multilingual Expressive and Streaming Speech Translation L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, M Duppenthaler, ... arXiv preprint arXiv:2312.05187, 2023 | 115 | 2023 |
Non Gaussian Denoising Diffusion Models E Nachmani, RS Roman, L Wolf arXiv preprint arXiv:2106.07582, 2021 | 67 | 2021 |
Proactive detection of voice cloning with localized watermarking RS Roman, P Fernandez, A Défossez, T Furon, T Tran, H Elsahar arXiv preprint arXiv:2401.17264, 2024 | 38 | 2024 |
Denoising diffusion gamma models E Nachmani, RS Roman, L Wolf arXiv preprint arXiv:2110.05948, 2021 | 27 | 2021 |
From discrete tokens to high-fidelity audio using multi-band diffusion R San Roman, Y Adi, A Deleforge, R Serizel, G Synnaeve, A Défossez Advances in Neural Information Processing Systems 36, 1526-1538, 2023 | 24 | 2023 |
Latent watermarking of audio generative models R San Roman, P Fernandez, A Deleforge, Y Adi, R Serizel ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025 | 7 | 2025 |
Large Concept Models: Language Modeling in a Sentence Representation Space L Barrault, PA Duquenne, M Elbayad, A Kozhevnikov, B Alastruey, ... arXiv e-prints, arXiv: 2412.08821, 2024 | 2 | 2024 |
MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling S Rouard, R San Roman, Y Adi, A Roebel ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and …, 2025 | | 2025 |