Follow
Anyi Rao
Title
Cited by
Cited by
Year
Adding Conditional Control to Text-to-Image Diffusion Models
L Zhang, A Rao, M Agrawala
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
12392023
HotFlip: White-Box Adversarial Examples for Text Classification
J Ebrahimi, A Rao, D Lowd, D Dou
Proceedings of Annual Meeting of the Association for Computational Linguistics, 2018
10552018
MovieNet: A Holistic Dataset for Movie Understanding
Q Huang, Y Xiong, A Rao, J Wang, D Lin
European Conference on Computer Vision, 2020
1772020
A Local-to-Global Approach to Multi-modal Movie Scene Segmentation
A Rao, L Xu, Y Xiong, G Xu, Q Huang, B Zhou, D Lin
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
1282020
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
Y Guo, C Yang, A Rao, Z Liang, Y Wang, Y Qiao, M Agrawala, D Lin, ...
International Conference on Learning Representations, 2024
1222024
BungeeNeRF: Progressive Neural Radiance Field for Extreme Multi-scale Scene Rendering
Y Xiangli, L Xu, X Pan, N Zhao, A Rao, C Theobalt, B Dai, D Lin
European Conference on Computer Vision, 2022
1222022
A Unified Framework for Shot Type Classification Based on Subject Centric Lens
A Rao, J Wang, L Xu, X Jiang, Q Huang, B Zhou, D Lin
European Conference on Computer Vision, 2020
632020
A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language
B Su, D Du, Z Yang, Y Zhou, J Li, A Rao, H Sun, Z Lu, JR Wen
arXiv preprint arXiv:2209.05481, 2022
54*2022
CityNeRF: Building NeRF at City Scale
Y Xiangli, L Xu, X Pan, N Zhao, A Rao, C Theobalt, B Dai, D Lin
arXiv preprint arXiv:2112.05504, 2021
452021
Online Multi-modal Person Search in Videos
J Xia, A Rao*, Q Huang, L Xu, J Wen, D Lin
European Conference on Computer Vision, 2020
312020
White-Box Adversarial Examples for NLP
J Ebrahimi, A Rao, D Lowd, D Dou
arXiv preprint arXiv:1712.06751, 2017
15*2017
Self-supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences
Y Zhou, H Duan, A Rao, B Su, J Wang
Proceedings of the AAAI Conference on Artificial Intelligence, 2023
132023
Jointly Learning the Attributes and Composition of Shots for Boundary Detection in Videos
X Jiang, L Jin, A Rao*, L Xu, D Lin
IEEE Transactions on Multimedia, 2021
92021
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
Y Guo, C Yang, A Rao, M Agrawala, D Lin, B Dai
arXiv preprint arXiv:2311.16933, 2023
82023
Dynamic Storyboard Generation in an Engine-based Virtual Environment for Video Production
A Rao, X Jiang, Y Guo, L Xu, L Yang, L Jin, D Lin, B Dai
ACM SIGGRAPH Special Interest Group on Computer Graphics and Interactive …, 2023
82023
AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation
X Liu, X Xu, A Rao, C Gan, L Yi
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
82022
BlockPlanner: City Block Generation With Vectorized Graph Representation
L Xu, Y Xiangli, A Rao, N Zhao, B Dai, Z Liu, D Lin
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
62021
Computer Vision–ECCV 2022 Workshops: Tel Aviv, Israel, October 23–27, 2022, Proceedings, AI for Creative Video Editing and Understanding
A Rao, F Caba, D Liu, L Xu, A Pardo, Y Xiong, V Escorcia, A Thabet, ...
Springer Nature, 2023
5*2023
A Coarse-to-Fine Framework for Automatic Video Unscreen
A Rao, L Xu, Z Li, Q Huang, Z Kuang, W Zhang, D Lin
IEEE Transactions on Multimedia, 2022
52022
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers
D Shi, C Tao, A Rao, Z Yang, C Yuan, J Wang
arXiv preprint arXiv:2305.17455, 2023
42023
The system can't perform the operation now. Try again later.
Articles 1–20