Predictive value and discriminant capacity of cepstral- and spectral-based measures during continuous speech

Soren Y. Lowell, Raymond H. Colton, Richard T. Kelley, Sarah A. Mizia

Research output: Contribution to journalArticlepeer-review

60 Scopus citations

Abstract

Objectives/Hypothesis: The purpose of this study was to determine the relative strength of various cepstral- and spectral-based measures for predicting dysphonia severity and differentiating voice quality types. Study Design: Prospective, quasi-experimental research design. Methods: Twenty-eight dysphonic speakers and 14 normal speakers were included in this study. Among the dysphonic speakers, 14 had a predominant voice quality of breathiness and 14 had a predominant voice quality of roughness. Cepstral and spectral analyses of the first and second sentences of the Rainbow passage were performed, along with perceptual ratings of overall dysphonia severity. Linear regression was performed to determine the predictive capacity of each variable for dysphonia severity, and discriminant analysis determined the combination of variables that optimally differentiated the three voice quality types. Results: A four-factor model that incorporated the cepstral- and spectral-based measures produced an R value of 0.899, explaining 81% of the variance in auditory-perceptual dysphonia severity. Cepstral peak prominence (CPP) showed the greatest predictive contribution to dysphonia severity in the regression model. The discriminant analysis produced two discriminant functions that included both CPP and its standard deviation (CPP SD) as significant contributors (P < 0.001), with an overall classification accuracy for the combined functions of 79%. Conclusions: Acoustic measures reflecting the distribution of harmonic energy and low- to high-frequency energy in continuous speech, along with the variability (standard deviations) of each, were highly predictive of dysphonia severity when combined in a multivariate linear model. Cepstral-based measures showed the highest capacity to discriminate voice quality types, with better classification accuracy for normal and dysphonic-breathy than for dysphonic-rough voices.

Original languageEnglish (US)
Pages (from-to)393-400
Number of pages8
JournalJournal of Voice
Volume27
Issue number4
DOIs
StatePublished - Jul 2013

Keywords

  • Acoustic
  • Cepstral
  • Cepstral peak prominence
  • Cepstrum
  • Dysphonia
  • Low-high frequency
  • Spectral
  • Spectrum
  • Voice
  • Voice disorder

ASJC Scopus subject areas

  • Speech and Hearing
  • LPN and LVN
  • Otorhinolaryngology

Fingerprint

Dive into the research topics of 'Predictive value and discriminant capacity of cepstral- and spectral-based measures during continuous speech'. Together they form a unique fingerprint.

Cite this