Conference | Adobe Media and Data Science Research (MDSR) Laboratory

Document Structure Extraction using Prior based High Resolution Hierarchical Semantic Segmentation

Structure extraction from document images has been a long-standing research topic due to its high impact on a wide range of practical …

Mausoom Sarkar, Milan Aggarwal, Arneh Jain, Hiresh Gupta, Balaji Krishnamurthy

Explain Your Move: Understanding Agent Actions Using Specific and Relevant Feature Attribution

As deep reinforcement learning (RL) is applied to more tasks, there is a need to visualize and understand the behavior of learned …

Nikaash Puri, Sukriti Verma, Piyush Gupta, Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh

Form2Seq : A Framework for Higher-Order Form Structure Extraction

Document structure extraction has been a widely researched area for decades with recent works performing it as a semantic segmentation …

Milan Aggarwal, Hiresh Gupta, Mausoom Sarkar, Balaji Krishnamurthy

Harnessing GANs for Zero-shot Learning of New Classes in Visual Speech Recognition

Visual Speech Recognition (VSR) is the process of recognizing or interpreting speech by watching the lip movements of the speaker. …

Yaman Kumar Singla, Dhruva Sharawat, Shubham Maheshwari, Debanjan Mahata, Rajiv Ratn Shah, Yifang Yin, Roger Zimmermann, Amanda Stent

Keyphrase Extraction as Sequence Labeling Task using Transformers

In this paper, we formulate keyphrase extraction from scholarly articles as a sequence labeling task solved using a BiLSTM-CRF, where …

Dhruva Sahrawat, Debanjan Mahata, Raymond Zhang, Mayank Kulkarni, Agniv Sharma, Rakesh Gosangi, Amanda Stent, Yaman Kumar Singla, Rajiv Ratn Shah, Roger Zimmermann

Learning based Methods for Code Runtime Complexity Prediction

Predicting the runtime complexity of a programming code is an arduous task. In fact, even for humans, it requires a subtle analysis and …

Jagriti Sikka, Kushal Satya, Yaman Kumar Singla, Shagun Uppal, Rajiv Ratn Shah, Roger Zimmermann

MixBoost: Synthetic Oversampling with Boosted Mixup for Handling Extreme Imbalance

Training a classification model on a dataset where the instances of one class outnumber those of the other class is a challenging …

Anubha Kabra, Ayush Chopra, Nikaash Puri, Pinkesh Badjatiya, Sukriti Verma, Piyush Gupta, Balaji Krishnamurthy

Multi-Modal Association based Grouping for Form Structure Extraction

Document structure extraction has been a widely researched area for decades. Recent work in this direction has been deep …

Milan Aggarwal, Mausoom Sarkar, Hiresh Gupta, Balaji Krishnamurthy

Retrospective Loss: Looking Back to Improve Training of Deep Neural Networks

Deep neural networks (DNNs) are powerful learning machines that have enabled breakthroughs in several domains. In this work, we …

Surgan Jandial, Ayush Chopra, Mausoom Sarkar, Piyush Gupta, Balaji Krishnamurthy, Vineeth Balasubramanian

ShapeVis: High-dimensional Data Visualization at Scale

We present ShapeVis, a scalable visualization technique for point cloud data inspired from topological data analysis. Our method …

Nupur Kumari, Siddarth R., Akash Rupela, Piyush Gupta, Balaji Krishnamurthy