Webbläsaren som du använder stöds inte av denna webbplats. Alla versioner av Internet Explorer stöds inte längre, av oss eller Microsoft (läs mer här: * https://www.microsoft.com/en-us/microsoft-365/windows/end-of-ie-support).

Var god och använd en modern webbläsare för att ta del av denna webbplats, som t.ex. nyaste versioner av Edge, Chrome, Firefox eller Safari osv.

Using cepstral coefficients for Inhalation pause detection in spontaneous speech

Författare

Redaktör

  • G. Kokkinakis
  • N. Fakotakis
  • E. Dermatas
  • R. Potapova

Summary, in English

A method for recognizing inhalations in spontaneous speech is presented. It is similar to the template matching technique; a distance measure is calculated between a reference sound and an equally long portion of the same sound being tracked. A feature representation consisting of the standard Mel Frequency Cepstral Coefficients (MFCC), obtained by performing a discrete Cosine Transform of the mel-scaled filterbank spectrum is used. MFCC's are calculated every 5 ms. The comparison is then done by computing the euclidian distance between the cepstral coefficients of each frame of the two sounds. A low distance value means that the two compared inhalations are likely to be similar. The method can detect inhalations in both male and female spontaneous speech. The method is most suited for signals with low noise and high average intensity (studio recording) but can also be used on noisier recordings with lower average intensity, albeit with poorer results.

Publiceringsår

2005

Språk

Engelska

Sidor

143-146

Publikation/Tidskrift/Serie

Proceedings of SPECOM 2005

Volym

1

Dokumenttyp

Konferensbidrag

Förlag

University of Patras

Ämne

  • General Language Studies and Linguistics

Nyckelord

  • breathing pauses
  • inhalations
  • inhalation pause
  • cepstral coefficient
  • pause
  • spontaneous speech

Conference name

SPECOM 2005

Conference date

0001-01-02

Conference place

Patras, Greece

Status

Published

Projekt

  • The role of function words in spontaneous speech processing

ISBN/ISSN/Övrigt

  • ISBN: 5-7452-0110-x