What Else is New Than the Hamming Window? Robust MFCCs for Speaker Recognition via Multitapering
Författare
Summary, in English
Usually the mel-frequency cepstral coefficients (MFCCs) are derived via Hamming windowed DFT spectrum. In this paper, we advocate to use a so-called multitaper method instead. Multitaper methods form a spectrum estimate using multiple window functions and frequency-domain averaging. Multitapers provide a robust spectrum estimate but have not received much attention in speech processing. Our speaker recognition experiment on NIST 2002 yields equal error rates (EERs) of 9.66 % (clean data) and 16.41 % (-10 dB SNR) for the conventional Hamming method and 8.13 % (clean data) and 14.63 % (-10 dB SNR) using multitapers. Multitapering is a simple and robust alternative to the Hamming window method.
Avdelning/ar
- Matematisk statistik
- Statistical Signal Processing Group
Publiceringsår
2010
Språk
Engelska
Sidor
2734-2737
Publikation/Tidskrift/Serie
InterSpecch 2010
Länkar
Dokumenttyp
Konferensbidrag
Ämne
- Probability Theory and Statistics
Nyckelord
- speaker verification
- multiple window method
Conference name
Interspeech 2010
Conference date
0001-01-02
Conference place
Makuhari, Japan
Status
Published
Forskningsgrupp
- Statistical Signal Processing Group