Follow
Abhanshu Sharma
Title
Cited by
Cited by
Year
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805, 2023
21442023
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
671*2024
Search query predictions by a keyboard
J Cao, A Greenberg, A Sharma, Y Su, K Nicholas, M Mohsin, J Jurewicz, ...
US Patent 9,720,955, 2017
772017
Screenai: A vision-language model for ui and infographics understanding
G Baechler, S Sunkara, M Wang, F Zubach, H Mansoor, V Etter, ...
arXiv preprint arXiv:2402.04615, 2024
332024
Mapping images to search queries
M Sharifi, D Petrou, A Sharma
US Patent 10,489,410, 2019
222019
Search query predictions by a keyboard
J Cao, A Greenberg, A Sharma, Y Su, K Nicholas, M Mohsin, J Jurewicz, ...
US Patent 10,305,828, 2019
212019
Towards better semantic understanding of mobile interfaces
S Sunkara, M Wang, L Liu, G Baechler, YC Hsiao, A Sharma, J Stout
arXiv preprint arXiv:2210.02663, 2022
202022
On-device image recognition
A Sharma, F Zubach, T Binder, L Mach, S El Ghazzal, M Sharifi
US Patent 10,769,428, 2020
72020
Visual recognition using user tap locations
A Sharma, D Petrou, M Sharifi
US Patent 10,353,950, 2019
72019
Chart-based reasoning: Transferring capabilities from llms to vlms
V Carbune, H Mansoor, F Liu, R Aralikatte, G Baechler, J Chen, A Sharma
arXiv preprint arXiv:2403.12596, 2024
32024
Visual recognition using user tap locations
A Sharma, D Petrou, M Sharifi
US Patent 10,664,519, 2020
22020
Surfacing images of a collection based on device context
M Sharifi, K Naliuka, A Sharma
12018
Visual Recognition Using User Tap Locations
A Sharma, D Petrou, M Sharifi
US Patent App. 18/741,176, 2024
2024
Automated assistant control of non-assistant applications via identification of synonymous term and/or speech processing biasing
J Lange, A Sharma, A Coimbra, G Bakir, G Taubman, II Firman, J Chen, ...
US Patent App. 18/642,010, 2024
2024
Automated assistant control of non-assistant applications via identification of synonymous term and/or speech processing biasing
J Lange, A Sharma, A Coimbra, G Bakir, G Taubman, I Firman, J Chen, ...
US Patent 11,967,321, 2024
2024
Mapping Images to Search Queries
M Sharifi, A Sharma, D Petrou
US Patent App. 18/344,509, 2023
2023
Mapping images to search queries
M Sharifi, D Petrou, A Sharma
US Patent 11,734,287, 2023
2023
Visual Recognition Using User Tap Locations
A Sharma, D Petrou, M Sharifi
US Patent App. 17/958,728, 2023
2023
Visual recognition using user tap locations
A Sharma, D Petrou, M Sharifi
US Patent 11,461,386, 2022
2022
Mapping images to search queries
M Sharifi, D Petrou, A Sharma
US Patent 11,269,897, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–20