Adobe Media and Data Science Research (MDSR) Laboratory
Adobe Media and Data Science Research (MDSR) Laboratory
News
Publications
People
Join us!
Collaborators
SPRO: Improving Image Generation via Self-Play
Recent advances in diffusion models have dramatically improved image fidelity and diversity. However, aligning these models with …
Ritika Jha
,
Aanisha Bhattacharyya
,
Yaman Kumar Singla
,
Rajiv Ratn Shah
,
Changyou Chen
,
Balaji Krishnamurthy
Cite
Project
Evaluating Variance in Visual Question Answering Benchmarks
Multimodal large language models (MLLMs) have emerged as powerful tools for visual question answering (VQA), enabling reasoning and …
Nikitha SR
PDF
HIRE: Lightweight High-Resolution Image Feature Enrichment for Multimodal LLMs
The integration of high-resolution image features in modern multimodal large language models has demonstrated significant improvements …
Nikitha SR
,
Aradhya Neeraj Mathur
,
Tarun Ram Menta
,
Rishabh Jain
,
Mausoom Sarkar
PDF
Learning Together to Perform Better: Teaching Small-Scale LLMs to Collaborate via Preferential Rationale Tuning
LLMs such as GPT-4 have shown a remarkable ability to solve complex questions by generating step-by-step rationales. Prior works have …
Sohan Patnaik
,
Milan Aggarwal
,
Sumit Bhatia
,
Balaji Krishnamurthy
PDF
Cite
Code
EOPose: Exemplar-based object reposing using Generalized Pose Correspondences
Reposing generic objects without the use of 3D models poses a significant challenge due to the absence of a standardized pose …
Sarthak Mehrotra
,
Rishabh Jain
,
Mayur Hemani
,
Balaji Krishnamurthy
,
Mausoom Sarkar
PDF
AesthetiQ: Enhancing Graphic Layout Design via Aesthetic-Aware Preference Alignment
Numerous pose-guided human editing methods have been explored by the vision community due to their extensive practical applications. …
Sohan Patnaik
,
Rishabh Jain
,
Balaji Krishnamurthy
,
Mausoom Sarkar
PDF
Cite
Project
LLaVA Finds Free Lunch: Teaching Human Behavior Improves Content Understanding Abilities Of LLMs
Communication is defined as “Who says what to whom with what effect”. A message from a communicator generates downstream …
Somesh Singh
,
S I Harini
,
Yaman Kumar Singla
,
Veeky Baths
,
Rajiv Ratn Shah
,
Changyou Chen
,
Balaji Krishnamurthy
PDF
Cite
Measuring and Improving Persuasiveness of Large Language Models
LLMs are increasingly being used in workflows involving generating content to be consumed by humans (e.g., marketing) and also in …
Somesh Singh
,
Yaman Kumar Singla
,
S I Harini
,
Balaji Krishnamurthy
PDF
Cite
Code
Project
It Helps to Take a Second Opinion: Teaching Smaller LLMs To Deliberate Mutually via Selective Rationale Optimisation
Very large language models (LLMs) such as GPT-4 have shown the ability to handle complex tasks by generating and self-refining …
Sohan Patnaik
,
Milan Aggarwal
,
Sumit Bhatia
,
Balaji Krishnamurthy
PDF
Cite
Code
Measuring And Improving Engagement of Text-to-Image Generation Models
Recent advances in text-to-image generation have achieved impressive aesthetic quality, making these models usable for both personal …
Varun Khurana
,
Yaman Kumar Singla
,
Jayakumar Subramanian
,
Changyou Chen
,
Rajiv Ratn Shah
,
Zhiqiang Xu
,
Balaji Krishnamurthy
PDF
Cite
»
Cite
×