Adobe Media and Data Science Research (MDSR) Laboratory
Adobe Media and Data Science Research (MDSR) Laboratory
News
Publications
People
Join us!
Collaborators
ShapeVis: High-dimensional Data Visualization at Scale
We present ShapeVis, a scalable visualization technique for point cloud data inspired from topological data analysis. Our method …
Nupur Kumari
,
Siddarth R.
,
Akash Rupela
,
Piyush Gupta
,
Balaji Krishnamurthy
PDF
SieveNet: A Unified Framework for Robust Image-based Virtual Try-On
Image-based virtual try-on for fashion has attracted considerable attention recently. The task requires trying on the desired clothing …
Ayush Chopra
,
Surgan Jandial
,
Kumar Ayush
,
Mayur Hemani
,
Balaji Krishnamurthy
PDF
SimPropNet: Improved Similarity Propagation for Few-shot Image Segmentation
Few-shot segmentation (FSS) methods perform image segmentation for a particular object class in a target (query) image, using a small …
Siddhartha Gairola
,
Mayur Hemani
,
Ayush Chopra
,
Balaji Krishnamurthy
PDF
Towards A Unified Framework for Visual Compatibility Prediction
Visual compatibility prediction refers to the task of determining if a set of items go well together. Existing techniques for …
Ayush Chopra
,
Kumar Ayush
,
Anirudh Singhal
,
Utkarsh Patel
,
Balaji Krishnamurthy
PDF
Harnessing the Vulnerability of Latent Layers in Adversarially Trained Models
Neural networks are vulnerable to adversarial attacks – small visually imperceptible crafted noise which when added to the input …
Mayank Singh
,
Abhishek Sinha
,
Nupur Kumari
,
Harshitha Machiraju
,
Balaji Krishnamurthy
,
Vineeth N Balasubramanian
PDF
Hush-Hush Speak: Speech Reconstruction Using Silent Videos
Speech Reconstruction is the task of recreation of speech using silent videos as input. In the literature, it is also referred to as …
Shashwat Uttam
,
Yaman Kumar Singla
,
Dhruva Sharawat
,
Mansi Aggarwal
,
Debanjan Mahata
,
Rajiv Ratn Shah
,
Amanda Stent
PDF
Lipper: Synthesizing Thy Speech using Multi-View Lipreading
Lipreading has a lot of potential applications such as in the domain of surveillance and video conferencing. Despite this, most of the …
Yaman Kumar Singla
,
Rohit Jain
,
Khwaja Mohd. Salik
,
Rajiv Ratn Shah
,
Yifang Yin
,
Roger Zimmerman
PDF
OpticalGAN : Generative Adversarial Networks for Continuous Variable Quantum Computation
We present OpticalGAN, an extension of quantum generative adversarial networks for continuous-variable quantum computation. OpticalGAN …
Nilay Shrivastava
,
Nikaash Puri
,
Piyush Gupta
,
Balaji Krishnamurthy
,
Sukriti Verma
PDF
Powering Robust Fashion Retrieval with Information Rich Feature Embeddings
Visual content based product retrieval has become increasingly important for e-commerce. Fashion retrieval, in particular, is a …
Ayush Chopra
,
Abhishek Sinha
,
Mausoom Sarkar
,
Hiresh Gupta
,
Kumar Ayush
,
Balaji Krishnamurthy
PDF
Harnessing AI for speech reconstruction using multiview silent video feed
Speechreading or lipreading is the technique of understanding and getting phonetic features from a speaker’s visual features such …
Yaman Kumar Singla
,
Mayank Aggarwal
,
Pratham Nawal
,
Shin'ichi Satoh
,
Rajiv Ratn Shah
,
Roger Zimmerman
PDF
«
Cite
×