1

A Deep Dive Into Neural Synchrony Evaluation for Audio-visual Translation

This paper presents a comprehensive analysis of the neural audio-visual synchrony evaluation tool SyncNet.

Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video Podcasts

We introduce the Merkel Podcast Corpus, an audio-visual-text corpus in German collected from 16 years of (almost) weekly Internet podcasts of former German chancellor Angela Merkel.

Ensembling strategies for Transformer-based Offensive language Detection

We present an exhaustive exploration of different transformer and fusion models and a genetic algorithm ensembling technique for Offensive Language Identification in Dravidian Languages.

Warehouse Management Using Real-Time QR-Code and Text Detection

The objective of this paper is the development of the computer vision tools for efficient inventory management of packages in a warehouse.