Follow
Georg Heigold
Georg Heigold
Research Scientist, Google Inc.
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
An image is worth 16x16 words: Transformers for image recognition at scale
A Dosovitskiy, L Beyer, A Kolesnikov, D Weissenborn, X Zhai, ...
arXiv preprint arXiv:2010.11929, 2020
61222020
End-to-end text-dependent speaker verification
G Heigold, I Moreno, S Bengio, N Shazeer
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
6212016
Small-footprint keyword spotting using deep neural networks
G Chen, C Parada, G Heigold
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
5152014
Vivit: A video vision transformer
A Arnab, M Dehghani, G Heigold, C Sun, M Lučić, C Schmid
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
3572021
An image is worth 16x16 words: Transformers for image recognition at scale. arXiv 2020
A Dosovitskiy, L Beyer, A Kolesnikov, D Weissenborn, X Zhai, ...
arXiv preprint arXiv:2010.11929, 2010
3462010
Multilingual acoustic models using distributed deep neural networks
G Heigold, V Vanhoucke, A Senior, P Nguyen, MA Ranzato, M Devin, ...
2013 IEEE international conference on acoustics, speech and signal …, 2013
3452013
Object-centric learning with slot attention
F Locatello, D Weissenborn, T Unterthiner, A Mahendran, G Heigold, ...
Advances in Neural Information Processing Systems 33, 11525-11538, 2020
2362020
An empirical study of learning rates in deep neural networks for speech recognition
A Senior, G Heigold, MA Ranzato, K Yang
2013 IEEE international conference on acoustics, speech and signal …, 2013
1652013
Word embeddings for speech recognition
S Bengio, G Heigold
1622014
Sequence discriminative distributed training of long short-term memory recurrent neural networks
H Sak, O Vinyals, G Heigold, A Senior, E McDermott, R Monga, M Mao
1502014
The RWTH Aachen University open source speech recognition system
D Rybach, C Gollan, G Heigold, B Hoffmeister, J Lööf, R Schlüter, H Ney
Tenth Annual Conference of the International Speech Communication Association, 2009
1352009
The RWTH 2007 TC-STAR evaluation system for european English and Spanish.
J Lööf, C Gollan, S Hahn, G Heigold, B Hoffmeister, C Plahl, D Rybach, ...
Interspeech, 2145-2148, 2007
782007
Asynchronous stochastic optimization for sequence training of deep neural networks
G Heigold, E McDermott, V Vanhoucke, A Senior, M Bacchiani
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
732014
A linguistic evaluation of rule-based, phrase-based, and neural MT engines
A Burchardt, V Macketanz, J Dehdari, G Heigold, P Jan-Thorsten, ...
The Prague Bulletin of Mathematical Linguistics 108 (1), 159, 2017
682017
Modified MMI/MPE: A direct evaluation of the margin in speech recognition
G Heigold, T Deselaers, R Schlüter, H Ney
Proceedings of the 25th international conference on Machine learning, 384-391, 2008
662008
Asynchronous optimization for sequence training of neural networks
G Heigold, E McDermott, VO Vanhoucke, AW Senior, MAU Bacchiani
US Patent 10,019,985, 2018
642018
Cross-lingual, character-level neural morphological tagging
R Cotterell, G Heigold
arXiv preprint arXiv:1708.09157, 2017
622017
Multiframe deep neural networks for acoustic modeling
V Vanhoucke, M Devin, G Heigold
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
622013
Discriminative training for automatic speech recognition: Modeling, criteria, optimization, implementation, and performance
G Heigold, H Ney, R Schluter, S Wiesler
IEEE Signal Processing Magazine 29 (6), 58-69, 2012
582012
A Gaussian mixture model layer jointly optimized with discriminative features within a deep neural network architecture
E Variani, E McDermott, G Heigold
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
512015
The system can't perform the operation now. Try again later.
Articles 1–20