Follow
An Yan
Title
Cited by
Cited by
Year
CosRec: 2D convolutional neural networks for sequential recommendation
A Yan, S Cheng, WC Kang, M Wan, J McAuley
Proceedings of the 28th ACM international conference on information and …, 2019
1032019
PA3D: Pose-action 3D machine for video recognition
A Yan, Y Wang, Z Li, Y Qiao
Proceedings of the ieee/cvf conference on computer vision and pattern …, 2019
902019
RadBERT: adapting transformer-based language models to radiology
A Yan, J McAuley, X Lu, J Du, EY Chang, A Gentili, CN Hsu
Radiology: Artificial Intelligence 4 (4), e210258, 2022
682022
Weakly supervised contrastive learning for chest x-ray report generation
A Yan, Z He, X Lu, J Du, E Chang, A Gentili, J McAuley, CN Hsu
arXiv preprint arXiv:2109.12242, 2021
482021
Multimodal text style transfer for outdoor vision-and-language navigation
W Zhu, XE Wang, TJ Fu, A Yan, P Narayana, K Sone, S Basu, WY Wang
arXiv preprint arXiv:2007.00229, 2020
252020
Visualize before you write: Imagination-guided open-ended text generation
W Zhu, A Yan, Y Lu, W Xu, XE Wang, M Eckstein, WY Wang
arXiv preprint arXiv:2210.03765, 2022
242022
Personalized complementary product recommendation
A Yan, C Dong, Y Gao, J Fu, T Zhao, Y Sun, J McAuley
The ACM Web Conference, 2022
232022
Gpt-4v in wonderland: Large multimodal models for zero-shot smartphone gui navigation
A Yan, Z Yang, W Zhu, K Lin, L Li, J Wang, J Yang, Y Zhong, J McAuley, ...
arXiv preprint arXiv:2311.07562, 2023
192023
Learning concise and descriptive attributes for visual recognition
A Yan, Y Wang, Y Zhong, C Dong, Z He, Y Lu, WY Wang, J Shang, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
182023
Gpt-4v (ision) as a generalist evaluator for vision-language tasks
X Zhang, Y Lu, W Wang, A Yan, J Yan, L Qin, H Wang, X Yan, WY Wang, ...
arXiv preprint arXiv:2311.01361, 2023
142023
Personalized showcases: Generating multi-modal explanations for recommendations
A Yan, Z He, J Li, T Zhang, J McAuley
Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023
142023
Cross-lingual vision-language navigation
A Yan, XE Wang, J Feng, L Li, WY Wang
arXiv preprint arXiv:1910.11301, 2019
132019
L2c: Describing visual differences needs semantic understanding of individuals
A Yan, XE Wang, TJ Fu, WY Wang
arXiv preprint arXiv:2102.01860, 2021
82021
Imagine: An imagination-based automatic evaluation metric for natural language generation
W Zhu, XE Wang, A Yan, M Eckstein, WY Wang
arXiv preprint arXiv:2106.05970, 2021
72021
Robust and interpretable medical image classifiers via concept bottleneck models
A Yan, Y Wang, Y Zhong, Z He, P Karypis, Z Wang, C Dong, A Gentili, ...
arXiv preprint arXiv:2310.03182, 2023
52023
Medeval: A multi-level, multi-task, and multi-domain medical benchmark for language model evaluation
Z He, Y Wang, A Yan, Y Liu, EY Chang, A Gentili, J McAuley, CN Hsu
arXiv preprint arXiv:2310.14088, 2023
42023
Clip also understands text: Prompting clip for phrase understanding
A Yan, J Li, W Zhu, Y Lu, WY Wang, J McAuley
arXiv preprint arXiv:2210.05836, 2022
42022
“Nothing abnormal”: Disambiguating medical reports via contrastive knowledge infusion
Z He, A Yan, A Gentili, J McAuley, CN Hsu
Proceedings of the AAAI Conference on Artificial Intelligence 37 (12), 14232 …, 2023
22023
Semi-supervised Multi-Label Classification with 3D CBAM Resnet for Tuberculosis Cavern Report.
X Lu, A Yan, EY Chang, CN Hsu, JJ McAuley, J Du, A Gentili
CLEF (Working Notes), 1474-1479, 2022
22022
Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks
J Echterhoff, A Yan, K Han, A Abdelraouf, R Gupta, J McAuley
arXiv preprint arXiv:2310.16639, 2023
12023
The system can't perform the operation now. Try again later.
Articles 1–20