Accéder directement au contenu Accéder directement à la navigation
Pré-publication, Document de travail

The Speed Submission to DIHARD II: Contributions & Lessons Learned

Abstract : This paper describes the speaker diarization systems developed for the Second DIHARD Speech Diarization Challenge (DIHARD II) by the Speed team. Besides describing the system, which considerably outperformed the challenge baselines, we also focus on the lessons learned from numerous approaches that we tried for single and multi-channel systems. We present several components of our diarization system, including categorization of domains, speech enhancement, speech activity detection, speaker embeddings, clustering methods, resegmentation, and system fusion. We analyze and discuss the effect of each such component on the overall diarization performance within the realistic settings of the challenge.
Liste complète des métadonnées

Littérature citée [37 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-02352840
Contributeur : Md Sahidullah <>
Soumis le : jeudi 7 novembre 2019 - 09:18:23
Dernière modification le : jeudi 2 juillet 2020 - 03:46:23
Document(s) archivé(s) le : samedi 8 février 2020 - 23:12:37

Fichier

DIHARDSpeed.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-02352840, version 1

Collections

Citation

Md Sahidullah, Jose Patino, Samuele Cornell, Ruiqing Yin, Sunit Sivasankaran, et al.. The Speed Submission to DIHARD II: Contributions & Lessons Learned. 2019. ⟨hal-02352840v1⟩

Partager

Métriques

Consultations de la notice

117

Téléchargements de fichiers

584