This paper presents a comprehensive analysis of the neural audio-visual synchrony evaluation tool SyncNet.
We introduce the Merkel Podcast Corpus, an audio-visual-text corpus in German collected from 16 years of (almost) weekly Internet podcasts of former German chancellor Angela Merkel.
We present an exhaustive exploration of different transformer and fusion models and a genetic algorithm ensembling technique for Offensive Language Identification in Dravidian Languages.
The objective of this paper is the development of the computer vision tools for efficient inventory management of packages in a warehouse.