Improved binary key speaker diarization system

Hector Delgado, Xavier Anguera, Corinne Fredouille, Javier Serrano

Research output: Contribution to journalArticleResearchpeer-review

6 Citations (Scopus)

Abstract

The recently proposed speaker diarization technique based on binary keys provides a very fast alternative to state-of-the-art systems. However, this speed up has the cost of a little increase in Diarization Error Rate (DER). This paper proposes a series of improvements to the original algorithm with the aim to get closer to state-of-the-art performance. First, several alternative similarity measures between binary key speaker/segment models are introduced. Second, we perform a first attempt at applying Intra-Session and IntraSpeaker Variability (ISISV) compensation within the binary diarization approach through the Nuisance Attribute Projection. Experimental results show the benefits of the newly introduced similarity metrics, as well as the potential of the Nuisance Attribute Projection for ISISV compensation in the binary key speaker diarization framework.

Original languageAmerican English
Pages (from-to)2087-2091
Number of pages5
Journal2015 23rd European Signal Processing Conference, EUSIPCO 2015
DOIs
Publication statusPublished - 22 Dec 2015

Keywords

  • binary key
  • chi-square distance
  • cosine distance
  • nuisance attribute projection
  • session variability compensation
  • Speaker diarization

Fingerprint

Dive into the research topics of 'Improved binary key speaker diarization system'. Together they form a unique fingerprint.

Cite this