Supervision exists everywhere: A data efficient contrastive language-image pre-training paradigm Y Li*, F Liang*, L Zhao*, Y Cui, W Ouyang, J Shao, F Yu, J Yan International Conference on Learning Representations(ICLR) 2022, 2021 | 312 | 2021 |
3DVG-Transformer: Relation modeling for visual grounding on point clouds L Zhao, D Cai, L Sheng, D Xu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 87 | 2021 |
Transformer3D-Det: Improving 3D object detection by vote refinement L Zhao, J Guo, D Xu, L Sheng IEEE Transactions on Circuits and Systems for Video Technology 31 (12), 4735 …, 2021 | 50 | 2021 |
3djcg: A unified framework for joint dense captioning and visual grounding on 3d point clouds D Cai, L Zhao, J Zhang, L Sheng, D Xu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 49 | 2022 |
Democratizing contrastive language-image pre-training: A clip benchmark of data, model, and supervision Y Cui, L Zhao, F Liang, Y Li, J Shao ICML First Workshop on Pre-training 2022, 2022 | 28 | 2022 |
VL-SAT: visual-linguistic semantics assisted training for 3D semantic scene graph prediction in point cloud Z Wang, B Cheng, L Zhao, D Xu, Y Tang, L Sheng Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 11 | 2023 |
Towards explainable 3d grounded visual question answering: A new benchmark and strong baseline L Zhao, D Cai, J Zhang, L Sheng, D Xu, R Zheng, Y Zhao, L Wang, X Fan IEEE Transactions on Circuits and Systems for Video Technology, 2022 | 10 | 2022 |
Distortion-aware Transformer in 360° Salient Object Detection Y Zhao, L Zhao, Q Yu, L Sheng, J Zhang, D Xu Proceedings of the 31st ACM International Conference on Multimedia, 499-508, 2023 | | 2023 |