The following papers related to the challenge have been accepted at Interspeech 2024:
Latest Updates (as of 14/06/2024):
In multilingual communities, the social conversations often involve code-mixed and
code-switched speech. The code-mixing refers to the scenario where words or
morphemes from one language (secondary) are used within a sentence of another
language (primary). However, the switching of languages at the sentence or phrase level
is known as code-switching, where the conversational language is itself shifted. In such
cases, the extraction of various analytics for speech-based systems, such as speaker
and language information or automatic speech recognition (ASR) to generate rich
transcriptions, becomes highly challenging. The current speaker diarization systems
are simply not equipped to deal with multilingual conversations, where the same talker
speaks in multiple code-mixed languages.
Focusing on the Interspeech-2024 theme, i.e., Speech and Beyond, the DISPLACE-2024 challenge aims to address research issues related to speaker and language diarization along with Automatic Speech Recognition (ASR) in an inclusive manner. The goal of the challenge is to establish new benchmarks for speaker diarization (SD) in multilingual settings, language diarization (LD) in multi-speaker settings, and ASR in multi-accent settings, using the same underlying dataset. The previous works have addressed speaker and language diarization, & ASR but in isolation. A collective effort from worldwide researchers is required to address associated research issues. We look forward to your participation in reaching a new milestone in the speaker and language diarization & ASR areas. We also encourage general submissions in the field of speaker and/or language diarization and/or ASR under DISPLACE-2024 challenge/special session.
We also encourage general submissions related to speaker and language diarization & ASR in DISPLACE-2024 challenge / special session in Interspeech-2024.
Summary of the DISPLACE Challenge 2023 -- DIarization of SPeaker and LAnguage in Conversational Environments (Click here for full paper) has been accepted in Speech Communication.
This challenge organizes three tracks and you can participate in one, two, or even all of them.
Track-1 is dedicated to speaker diarization (SD).
Track-2 focuses on language diarization (LD).
Track-3 is exclusive for Automatic Speech Recognition (ASR)