Show, reward and tell: Automatic generation of narrative paragraph from photo stream by adversarial training J Wang, J Fu, J Tang, Z Li, T Mei Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018 | 69 | 2018 |
Multimodal attention with image text spatial relationship for ocr-based image captioning J Wang, J Tang, J Luo Proceedings of the 28th ACM International Conference on Multimedia, 4337-4345, 2020 | 58 | 2020 |
Improving OCR-based image captioning by incorporating geometrical relationship J Wang, J Tang, M Yang, X Bai, J Luo Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 44 | 2021 |
Convolutional auto-encoding of sentence topics for image paragraph generation J Wang, Y Pan, T Yao, J Tang, T Mei Proceedings of the Twenty-Eighth International Joint Conference on …, 2019 | 44 | 2019 |
Show, reward, and tell: Adversarial visual story generation J Tang, J Wang, Z Li, J Fu, T Mei ACM Transactions on Multimedia Computing, Communications, and Applications …, 2019 | 13 | 2019 |
Contextual and Selective Attention Networks for Image Captioning J Wang, Y Li, Y Pan, T Yao, J Tang, T Mei SCIENCE CHINA Information Sciences, 2022 | 8 | 2022 |
A Multiscale Grouping Transformer with CLIP Latents for Remote Sensing Image Captioning L Meng, J Wang, R Meng, Y Yang, L Xiao IEEE Transactions on Geoscience and Remote Sensing, 2024 | 6 | 2024 |
Prior Knowledge-Guided Transformer for Remote Sensing Image Captioning L Meng, J Wang, Y Yang, L Xiao IEEE Transactions on Geoscience and Remote Sensing, 2023 | 3 | 2023 |