Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 2144 | 2023 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 671* | 2024 |
Search query predictions by a keyboard J Cao, A Greenberg, A Sharma, Y Su, K Nicholas, M Mohsin, J Jurewicz, ... US Patent 9,720,955, 2017 | 77 | 2017 |
Screenai: A vision-language model for ui and infographics understanding G Baechler, S Sunkara, M Wang, F Zubach, H Mansoor, V Etter, ... arXiv preprint arXiv:2402.04615, 2024 | 33 | 2024 |
Mapping images to search queries M Sharifi, D Petrou, A Sharma US Patent 10,489,410, 2019 | 22 | 2019 |
Search query predictions by a keyboard J Cao, A Greenberg, A Sharma, Y Su, K Nicholas, M Mohsin, J Jurewicz, ... US Patent 10,305,828, 2019 | 21 | 2019 |
Towards better semantic understanding of mobile interfaces S Sunkara, M Wang, L Liu, G Baechler, YC Hsiao, A Sharma, J Stout arXiv preprint arXiv:2210.02663, 2022 | 20 | 2022 |
On-device image recognition A Sharma, F Zubach, T Binder, L Mach, S El Ghazzal, M Sharifi US Patent 10,769,428, 2020 | 7 | 2020 |
Visual recognition using user tap locations A Sharma, D Petrou, M Sharifi US Patent 10,353,950, 2019 | 7 | 2019 |
Chart-based reasoning: Transferring capabilities from llms to vlms V Carbune, H Mansoor, F Liu, R Aralikatte, G Baechler, J Chen, A Sharma arXiv preprint arXiv:2403.12596, 2024 | 3 | 2024 |
Visual recognition using user tap locations A Sharma, D Petrou, M Sharifi US Patent 10,664,519, 2020 | 2 | 2020 |
Surfacing images of a collection based on device context M Sharifi, K Naliuka, A Sharma | 1 | 2018 |
Visual Recognition Using User Tap Locations A Sharma, D Petrou, M Sharifi US Patent App. 18/741,176, 2024 | | 2024 |
Automated assistant control of non-assistant applications via identification of synonymous term and/or speech processing biasing J Lange, A Sharma, A Coimbra, G Bakir, G Taubman, II Firman, J Chen, ... US Patent App. 18/642,010, 2024 | | 2024 |
Automated assistant control of non-assistant applications via identification of synonymous term and/or speech processing biasing J Lange, A Sharma, A Coimbra, G Bakir, G Taubman, I Firman, J Chen, ... US Patent 11,967,321, 2024 | | 2024 |
Mapping Images to Search Queries M Sharifi, A Sharma, D Petrou US Patent App. 18/344,509, 2023 | | 2023 |
Mapping images to search queries M Sharifi, D Petrou, A Sharma US Patent 11,734,287, 2023 | | 2023 |
Visual Recognition Using User Tap Locations A Sharma, D Petrou, M Sharifi US Patent App. 17/958,728, 2023 | | 2023 |
Visual recognition using user tap locations A Sharma, D Petrou, M Sharifi US Patent 11,461,386, 2022 | | 2022 |
Mapping images to search queries M Sharifi, D Petrou, A Sharma US Patent 11,269,897, 2022 | | 2022 |