Du är här

What Else is New Than the Hamming Window? Robust MFCCs for Speaker Recognition via Multitapering

Författare:
Publiceringsår: 2010
Språk: Engelska
Sidor: 2734-2737
Publikation/Tidskrift/Serie: InterSpecch 2010
Dokumenttyp: Konferensbidrag

Sammanfattning

Usually the mel-frequency cepstral coefficients (MFCCs) are derived via Hamming windowed DFT spectrum. In this paper, we advocate to use a so-called multitaper method instead. Multitaper methods form a spectrum estimate using multiple window functions and frequency-domain averaging. Multitapers provide a robust spectrum estimate but have not received much attention in speech processing. Our speaker recognition experiment on NIST 2002 yields equal error rates (EERs) of 9.66 % (clean data) and 16.41 % (-10 dB SNR) for the conventional Hamming method and 8.13 % (clean data) and 14.63 % (-10 dB SNR) using multitapers. Multitapering is a simple and robust alternative to the Hamming window method.

Disputation

Nyckelord

  • Mathematics and Statistics
  • speaker verification
  • multiple window method

Övrigt

Interspeech 2010
2010-09-01
Makuhari, Japan
Published
Yes

Box 117, 221 00 LUND
Telefon 046-222 00 00 (växel)
Telefax 046-222 47 20
lu [at] lu [dot] se

Fakturaadress: Box 188, 221 00 LUND
Organisationsnummer: 202100-3211
Om webbplatsen

LERU logo U21 logo