Adobe Media and Data Science Research (MDSR) Laboratory
Adobe Media and Data Science Research (MDSR) Laboratory
People
Join us!
Publications
Collaborators
Hush-Hush Speak: Speech Reconstruction Using Silent Videos
Speech Reconstruction is the task of recreation of speech using silent videos as input. In the literature, it is also referred to as …
Shashwat Uttam
,
Yaman Kumar Singla
,
Dhruva Sharawat
,
Mansi Aggarwal
,
Debanjan Mahata
,
Rajiv Ratn Shah
,
Amanda Stent
PDF
Lipper: Synthesizing Thy Speech using Multi-View Lipreading
Lipreading has a lot of potential applications such as in the domain of surveillance and video conferencing. Despite this, most of the …
Yaman Kumar Singla
,
Rohit Jain
,
Khwaja Mohd. Salik
,
Rajiv Ratn Shah
,
Yifang Yin
,
Roger Zimmerman
PDF
OpticalGAN : Generative Adversarial Networks for Continuous Variable Quantum Computation
We present OpticalGAN, an extension of quantum generative adversarial networks for continuous-variable quantum computation. OpticalGAN …
Nilay Shrivastava
,
Nikaash Puri
,
Piyush Gupta
,
Balaji Krishnamurthy
,
Sukriti Verma
PDF
Powering Robust Fashion Retrieval with Information Rich Feature Embeddings
Visual content based product retrieval has become increasingly important for e-commerce. Fashion retrieval, in particular, is a …
Ayush Chopra
,
Abhishek Sinha
,
Mausoom Sarkar
,
Hiresh Gupta
,
Kumar Ayush
,
Balaji Krishnamurthy
PDF
Harnessing AI for speech reconstruction using multiview silent video feed
Speechreading or lipreading is the technique of understanding and getting phonetic features from a speaker’s visual features such …
Yaman Kumar Singla
,
Mayank Aggarwal
,
Pratham Nawal
,
Shin'ichi Satoh
,
Rajiv Ratn Shah
,
Roger Zimmerman
PDF
«
Cite
×