Adobe Media and Data Science Research (MDSR) Laboratory
Adobe Media and Data Science Research (MDSR) Laboratory
News
Publications
People
Join us!
Collaborators
Evaluating Variance in Visual Question Answering Benchmarks
Multimodal large language models (MLLMs) have emerged as powerful tools for visual question answering (VQA), enabling reasoning and …
Nikitha SR
PDF
HIRE: Lightweight High-Resolution Image Feature Enrichment for Multimodal LLMs
The integration of high-resolution image features in modern multimodal large language models has demonstrated significant improvements …
Nikitha SR
,
Aradhya Neeraj Mathur
,
Tarun Ram Menta
,
Rishabh Jain
,
Mausoom Sarkar
PDF
Cite
×