Spectral- and cepstral-based acoustic features of dysphonic, strained voice quality

Soren Lowell, Richard T. Kelley, Shaheen N. Awan, Raymond H. Colton, Natalie H. Chan

Research output: Contribution to journalArticle

30 Citations (Scopus)

Abstract

Objectives: We sought to determine whether spectral- and cepstral-based acoustic measures were effective in distinguishing dysphonic-strained voice quality from normal voice quality and whether these measures were related to auditory-perceptual ratings of strain severity. Methods: Voice samples from 23 speakers with dysphonia characterized predominantly by strained voice quality and 23 speakers with normal voice were acoustically analyzed. Measures related to the prominence of the cepstral peak and the ratio of low- to high-frequency spectral energies, as well as the variation of each, were computed from continuous speech and a sustained vowel. Correlations to perceptually rated strain severity were determined. Results: Measures related to the cepstrum were the strongest discriminators between dysphonic-strained voice and normal voice. Variation in the ratio of low- to high-frequency spectral energies also significantly differentiated the two speaker groups. All measures were significantly correlated with perceptually rated strain severity, including an acoustic severity index that incorporated both cepstral- and spectral-based measures. Conclusions: Cepstral- and spectral-based measures that have been previously studied in dysphonia characterized by breathiness and roughness are effective in distinguishing strained dysphonia from normal voice quality. The utility of these acoustic measures is supported by their moderate-to-high relationship with perceptually rated strain severity.

Original languageEnglish (US)
Pages (from-to)539-548
Number of pages10
JournalAnnals of Otology, Rhinology and Laryngology
Volume121
Issue number8
StatePublished - Aug 2012

Fingerprint

Voice Quality
Acoustics
Dysphonia

Keywords

  • Acoustic measures
  • Cepstral measures
  • Dysphonia
  • Spectral measures
  • Strain
  • Voice disorder

ASJC Scopus subject areas

  • Otorhinolaryngology

Cite this

Spectral- and cepstral-based acoustic features of dysphonic, strained voice quality. / Lowell, Soren; Kelley, Richard T.; Awan, Shaheen N.; Colton, Raymond H.; Chan, Natalie H.

In: Annals of Otology, Rhinology and Laryngology, Vol. 121, No. 8, 08.2012, p. 539-548.

Research output: Contribution to journalArticle

Lowell, Soren ; Kelley, Richard T. ; Awan, Shaheen N. ; Colton, Raymond H. ; Chan, Natalie H. / Spectral- and cepstral-based acoustic features of dysphonic, strained voice quality. In: Annals of Otology, Rhinology and Laryngology. 2012 ; Vol. 121, No. 8. pp. 539-548.
@article{c70d4f2bc64e4aae883fff449ecee147,
title = "Spectral- and cepstral-based acoustic features of dysphonic, strained voice quality",
abstract = "Objectives: We sought to determine whether spectral- and cepstral-based acoustic measures were effective in distinguishing dysphonic-strained voice quality from normal voice quality and whether these measures were related to auditory-perceptual ratings of strain severity. Methods: Voice samples from 23 speakers with dysphonia characterized predominantly by strained voice quality and 23 speakers with normal voice were acoustically analyzed. Measures related to the prominence of the cepstral peak and the ratio of low- to high-frequency spectral energies, as well as the variation of each, were computed from continuous speech and a sustained vowel. Correlations to perceptually rated strain severity were determined. Results: Measures related to the cepstrum were the strongest discriminators between dysphonic-strained voice and normal voice. Variation in the ratio of low- to high-frequency spectral energies also significantly differentiated the two speaker groups. All measures were significantly correlated with perceptually rated strain severity, including an acoustic severity index that incorporated both cepstral- and spectral-based measures. Conclusions: Cepstral- and spectral-based measures that have been previously studied in dysphonia characterized by breathiness and roughness are effective in distinguishing strained dysphonia from normal voice quality. The utility of these acoustic measures is supported by their moderate-to-high relationship with perceptually rated strain severity.",
keywords = "Acoustic measures, Cepstral measures, Dysphonia, Spectral measures, Strain, Voice disorder",
author = "Soren Lowell and Kelley, {Richard T.} and Awan, {Shaheen N.} and Colton, {Raymond H.} and Chan, {Natalie H.}",
year = "2012",
month = "8",
language = "English (US)",
volume = "121",
pages = "539--548",
journal = "Annals of Otology, Rhinology and Laryngology",
issn = "0003-4894",
publisher = "Annals Publishing Company",
number = "8",

}

TY - JOUR

T1 - Spectral- and cepstral-based acoustic features of dysphonic, strained voice quality

AU - Lowell, Soren

AU - Kelley, Richard T.

AU - Awan, Shaheen N.

AU - Colton, Raymond H.

AU - Chan, Natalie H.

PY - 2012/8

Y1 - 2012/8

N2 - Objectives: We sought to determine whether spectral- and cepstral-based acoustic measures were effective in distinguishing dysphonic-strained voice quality from normal voice quality and whether these measures were related to auditory-perceptual ratings of strain severity. Methods: Voice samples from 23 speakers with dysphonia characterized predominantly by strained voice quality and 23 speakers with normal voice were acoustically analyzed. Measures related to the prominence of the cepstral peak and the ratio of low- to high-frequency spectral energies, as well as the variation of each, were computed from continuous speech and a sustained vowel. Correlations to perceptually rated strain severity were determined. Results: Measures related to the cepstrum were the strongest discriminators between dysphonic-strained voice and normal voice. Variation in the ratio of low- to high-frequency spectral energies also significantly differentiated the two speaker groups. All measures were significantly correlated with perceptually rated strain severity, including an acoustic severity index that incorporated both cepstral- and spectral-based measures. Conclusions: Cepstral- and spectral-based measures that have been previously studied in dysphonia characterized by breathiness and roughness are effective in distinguishing strained dysphonia from normal voice quality. The utility of these acoustic measures is supported by their moderate-to-high relationship with perceptually rated strain severity.

AB - Objectives: We sought to determine whether spectral- and cepstral-based acoustic measures were effective in distinguishing dysphonic-strained voice quality from normal voice quality and whether these measures were related to auditory-perceptual ratings of strain severity. Methods: Voice samples from 23 speakers with dysphonia characterized predominantly by strained voice quality and 23 speakers with normal voice were acoustically analyzed. Measures related to the prominence of the cepstral peak and the ratio of low- to high-frequency spectral energies, as well as the variation of each, were computed from continuous speech and a sustained vowel. Correlations to perceptually rated strain severity were determined. Results: Measures related to the cepstrum were the strongest discriminators between dysphonic-strained voice and normal voice. Variation in the ratio of low- to high-frequency spectral energies also significantly differentiated the two speaker groups. All measures were significantly correlated with perceptually rated strain severity, including an acoustic severity index that incorporated both cepstral- and spectral-based measures. Conclusions: Cepstral- and spectral-based measures that have been previously studied in dysphonia characterized by breathiness and roughness are effective in distinguishing strained dysphonia from normal voice quality. The utility of these acoustic measures is supported by their moderate-to-high relationship with perceptually rated strain severity.

KW - Acoustic measures

KW - Cepstral measures

KW - Dysphonia

KW - Spectral measures

KW - Strain

KW - Voice disorder

UR - http://www.scopus.com/inward/record.url?scp=84865298999&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84865298999&partnerID=8YFLogxK

M3 - Article

C2 - 22953661

AN - SCOPUS:84865298999

VL - 121

SP - 539

EP - 548

JO - Annals of Otology, Rhinology and Laryngology

JF - Annals of Otology, Rhinology and Laryngology

SN - 0003-4894

IS - 8

ER -