Follow
Zihang Dai
Zihang Dai
Unknown affiliation
No verified email - Homepage
Title
Cited by
Cited by
Year
Xlnet: Generalized autoregressive pretraining for language understanding
Z Yang, Z Dai, Y Yang, J Carbonell, RR Salakhutdinov, QV Le
Advances in neural information processing systems 32, 2019
93762019
Transformer-xl: Attentive language models beyond a fixed-length context
Z Dai, Z Yang, Y Yang, J Carbonell, QV Le, R Salakhutdinov
arXiv preprint arXiv:1901.02860, 2019
40142019
Unsupervised data augmentation for consistency training
Q Xie, Z Dai, E Hovy, T Luong, Q Le
Advances in neural information processing systems 33, 6256-6268, 2020
22682020
Coatnet: Marrying convolution and attention for all data sizes
Z Dai, H Liu, QV Le, M Tan
Advances in neural information processing systems 34, 3965-3977, 2021
10962021
Meta pseudo labels
H Pham, Z Dai, Q Xie, QV Le
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
7472021
Simvlm: Simple visual language model pretraining with weak supervision
Z Wang, J Yu, AW Yu, Z Dai, Y Tsvetkov, Y Cao
arXiv preprint arXiv:2108.10904, 2021
6862021
Good semi-supervised learning that requires a bad gan
Z Dai, Z Yang, F Yang, WW Cohen, RR Salakhutdinov
Advances in neural information processing systems 30, 2017
5642017
Pay attention to mlps
H Liu, Z Dai, D So, QV Le
Advances in neural information processing systems 34, 9204-9215, 2021
5202021
Characterizing and avoiding negative transfer
Z Wang, Z Dai, B Póczos, J Carbonell
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
5012019
Breaking the softmax bottleneck: A high-rank RNN language model
Z Yang, Z Dai, R Salakhutdinov, WW Cohen
arXiv preprint arXiv:1711.03953, 2017
4012017
Controllable invariance through adversarial feature learning
Q Xie, Z Dai, Y Du, E Hovy, G Neubig
Advances in neural information processing systems 30, 2017
2852017
Unsupervised data augmentation
Q Xie, Z Dai, E Hovy, MT Luong, QV Le
arXiv preprint arXiv:1904.12848 2 (6), 7, 2019
2612019
SwitchOut: an efficient data augmentation algorithm for neural machine translation
X Wang, H Pham, Z Dai, G Neubig
arXiv preprint arXiv:1808.07512, 2018
2272018
Funnel-transformer: Filtering out sequential redundancy for efficient language processing
Z Dai, G Lai, Y Yang, Q Le
Advances in neural information processing systems 33, 4271-4282, 2020
1942020
Cfo: Conditional focused neural question answering with large-scale knowledge bases
Z Dai, L Li, W Xu
arXiv preprint arXiv:1606.01994, 2016
1832016
Transformer quality in linear time
W Hua, Z Dai, H Liu, Q Le
International conference on machine learning, 9099-9117, 2022
1782022
Combined scaling for zero-shot transfer learning
H Pham, Z Dai, G Ghiasi, K Kawaguchi, H Liu, AW Yu, J Yu, YT Chen, ...
Neurocomputing 555, 126658, 2023
1402023
Searching for efficient transformers for language modeling
D So, W Mańke, H Liu, Z Dai, N Shazeer, QV Le
Advances in neural information processing systems 34, 6010-6022, 2021
1302021
Transformer-xl: Attentive language models beyond a fixed-length context. arXiv 2019
Z Dai, Z Yang, Y Yang, J Carbonell, QV Le, R Salakhutdinov
arXiv preprint arXiv:1901.02860, 0
127
An interpretable knowledge transfer model for knowledge base completion
Q Xie, X Ma, Z Dai, E Hovy
arXiv preprint arXiv:1704.05908, 2017
1242017
The system can't perform the operation now. Try again later.
Articles 1–20