Follow
Honglie Chen
Honglie Chen
Meta AI, University of Oxford
Verified email at meta.com - Homepage
Title
Cited by
Cited by
Year
Vggsound: A large-scale audio-visual dataset
H Chen, W Xie, A Vedaldi, A Zisserman
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
4732020
Localizing Visual Sounds the Hard Way
H Chen, W Xie, T Afouras, A Nagrani, A Vedaldi, A Zisserman
Conference on Computer Vision and Pattern Recognition (CVPR), 2021, 2021
1772021
Auto-avsr: Audio-visual speech recognition with automatic labels
P Ma, A Haliassos, A Fernandez-Lopez, H Chen, S Petridis, M Pantic
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
682023
Audio-Visual Synchronisation in the Wild
H Chen, W Xie, T Afouras, A Nagrani, A Vedaldi, A Zisserman
British Machine Vision Conference (BMVC), 2021, 2021
362021
AutoCorrect: Deep Inductive Alignment of Noisy Geometric Annotations
H Chen, W Xie, A Vedaldi, A Zisserman
British Machine Vision Conference (BMVC), 2019, 2019
172019
Synthvsr: Scaling up visual speech recognition with synthetic supervision
X Liu, E Lakomkin, K Vougioukas, P Ma, H Chen, R Xie, M Doulaty, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
142023
SparseVSR: Lightweight and noise robust visual speech recognition
A Fernandez-Lopez, H Chen, P Ma, A Haliassos, S Petridis, M Pantic
arXiv preprint arXiv:2307.04552, 2023
42023
Localizing visual sounds the hard way
A Vedaldi, H Chen, W Xie, T Afouras, A Nagrani, A Zisserman
Institute of Electrical and Electronics Engineers, 2021
12021
RT-LA-VocE: Real-Time Low-SNR Audio-Visual Speech Enhancement
H Chen, R Mira, S Petridis, M Pantic
arXiv preprint arXiv:2407.07825, 2024
2024
MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization
A Fernandez-Lopez, H Chen, P Ma, L Yin, Q Xiao, S Petridis, S Liu, ...
arXiv preprint arXiv:2406.17614, 2024
2024
Learning with multimodal self-supervision
H Chen
University of Oxford, 2021
2021
Supplementary Material: SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
X Liu, E Lakomkin, K Vougioukas, P Ma, H Chen, R Xie, M Doulaty, ...
SparseVSR: Lightweight and Noise Robust Visual Speech Recognition–Extended Abstract
A Fernandez-Lopez, H Chen, P Ma, A Haliassos, S Petridis, M Pantic
The system can't perform the operation now. Try again later.
Articles 1–13