Webbläsaren som du använder stöds inte av denna webbplats. Alla versioner av Internet Explorer stöds inte längre, av oss eller Microsoft (läs mer här: * https://www.microsoft.com/en-us/microsoft-365/windows/end-of-ie-support).

Var god och använd en modern webbläsare för att ta del av denna webbplats, som t.ex. nyaste versioner av Edge, Chrome, Firefox eller Safari osv.

What Else is New Than the Hamming Window? Robust MFCCs for Speaker Recognition via Multitapering

Författare

Summary, in English

Usually the mel-frequency cepstral coefficients (MFCCs) are derived via Hamming windowed DFT spectrum. In this paper, we advocate to use a so-called multitaper method instead. Multitaper methods form a spectrum estimate using multiple window functions and frequency-domain averaging. Multitapers provide a robust spectrum estimate but have not received much attention in speech processing. Our speaker recognition experiment on NIST 2002 yields equal error rates (EERs) of 9.66 % (clean data) and 16.41 % (-10 dB SNR) for the conventional Hamming method and 8.13 % (clean data) and 14.63 % (-10 dB SNR) using multitapers. Multitapering is a simple and robust alternative to the Hamming window method.

Avdelning/ar

Publiceringsår

2010

Språk

Engelska

Sidor

2734-2737

Publikation/Tidskrift/Serie

InterSpecch 2010

Dokumenttyp

Konferensbidrag

Ämne

  • Probability Theory and Statistics

Nyckelord

  • speaker verification
  • multiple window method

Conference name

Interspeech 2010

Conference date

0001-01-02

Conference place

Makuhari, Japan

Status

Published

Forskningsgrupp

  • Statistical Signal Processing Group