Global speaker clustering towards optimal stopping criterion in binary key speaker diarization

Héctor Delgado, Javier Serrano, Xavier Anguera, Corinne Fredouille

Research output: Contribution to journalArticleResearchpeer-review

5 Citations (Scopus)

Abstract

© Springer International Publishing Switzerland 2014. The recently proposed speaker diarization technique based on binary keys provides a very fast alternative to state-of-the-art systems with little increase of Diarization Error Rate (DER). Although the approach shows great potential, it also presents issues, mainly in the stopping criterion. Therefore, exploring alternative clustering/stopping criterion approaches is needed. Recently some works have addressed the speaker clustering as a global optimization problem in order to tackle the intrinsic issues of the Agglomerative Hierarchical Clustering (AHC) (mainly the local-maximum-based decision making). This paper aims at adapting and applying this new framework to the binary key diarization system. In addition, an analysis of cluster purity across the AHC iterations is done using reference speaker ground-truth labels to select the purer clustering as input for the global framework. Experiments on the REPERE phase 1 test database show improvements of around 6% absolute DER compared to the baseline system output.
Original languageEnglish
Pages (from-to)59-68
JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8854
Publication statusPublished - 1 Jan 2014

Keywords

  • Binary key
  • Cluster purity
  • ILP
  • Speaker diarization

Fingerprint Dive into the research topics of 'Global speaker clustering towards optimal stopping criterion in binary key speaker diarization'. Together they form a unique fingerprint.

Cite this