Adobe Media and Data Science Research (MDSR) Laboratory
Adobe Media and Data Science Research (MDSR) Laboratory
News
Publications
People
Join us!
Collaborators
Evaluating Variance in Visual Question Answering Benchmarks
Multimodal large language models (MLLMs) have emerged as powerful tools for visual question answering (VQA), enabling reasoning and …
Nikitha SR
PDF
HIRE: Lightweight High-Resolution Image Feature Enrichment for Multimodal LLMs
The integration of high-resolution image features in modern multimodal large language models has demonstrated significant improvements …
Nikitha SR
,
Aradhya Neeraj Mathur
,
Tarun Ram Menta
,
Rishabh Jain
,
Mausoom Sarkar
PDF
Measuring And Improving Engagement of Text-to-Image Generation Models
Recent advances in text-to-image generation have achieved impressive aesthetic quality, making these models usable for both personal …
Varun Khurana
,
Yaman Kumar Singla
,
Jayakumar Subramanian
,
Changyou Chen
,
Rajiv Ratn Shah
,
Zhiqiang Xu
,
Balaji Krishnamurthy
PDF
Cite
DeAR Debiasing vision-language models are with additive residuals
Large pre-trained vision-language models (VLMs) reduce the time for developing predictive models for various vision-grounded language …
Ashish Seth
,
Mayur Hemani
,
Chirag Agarwal.
PDF
Cite
×