TY - JOUR
T1 - The Reliability of Expert Diagnosis of Childhood Apraxia of Speech
AU - Murray, Elizabeth
AU - Velleman, Shelley
AU - Preston, Jonathan L.
AU - Heard, Robert
AU - Shibu, Akhila
AU - McCabe, Patricia
N1 - Publisher Copyright:
© 2023 American Speech-Language-Hearing Association.
PY - 2024/9
Y1 - 2024/9
N2 - Purpose: The current standard for clinical diagnosis of childhood apraxia of speech (CAS) is expert clinician judgment. The psychometric properties of this standard are not well understood; however, they are important for improving clinical diagnosis. The purpose of this study is to determine the extent to which experts agree on the clinical diagnosis of CAS using two cohorts of children with mixed speech sound disorders (SSDs). Method: Speech samples of children with SSDs were obtained from previous and ongoing research from video recordings of children aged 3–8 years (n = 36) and audio recordings of children aged 8–17 years (n = 56). A total of 23 expert, English-speaking clinicians were recruited internationally. Three of these experts rated each speech sample to provide a description of the observed features and a diagnosis. Intrarater reliability was acceptable at 85% agreement. Results: Interrater reliability on the presence or absence of CAS among experts was poor both as a categorical diagnosis (κ = .187, 95% confidence interval [CI] [0.089, 0.286]) and on a continuous “likelihood of CAS” scale (0–100; intraclass correlation = .183, 95% CI [.037, .347]). Reliability was similar across the video-recorded and audio-only samples. There was greater agreement on other diagnoses (such as articulation disorder) than on the diagnosis of CAS, although these too did not meet the predetermined standard. Likelihood of CAS was greater in children who presented with more American Speech-LanguageHearing Association CAS consensus features. Conclusions: Different expert raters had different thresholds for applying the diagnosis of CAS. If expert clinician judgment is to be used for diagnosis of CAS or other SSDs, further standardization and calibration is needed to increase interrater reliability. Diagnosis may require operationalized checklists or reliable measures that operate along a diagnostic continuum.
AB - Purpose: The current standard for clinical diagnosis of childhood apraxia of speech (CAS) is expert clinician judgment. The psychometric properties of this standard are not well understood; however, they are important for improving clinical diagnosis. The purpose of this study is to determine the extent to which experts agree on the clinical diagnosis of CAS using two cohorts of children with mixed speech sound disorders (SSDs). Method: Speech samples of children with SSDs were obtained from previous and ongoing research from video recordings of children aged 3–8 years (n = 36) and audio recordings of children aged 8–17 years (n = 56). A total of 23 expert, English-speaking clinicians were recruited internationally. Three of these experts rated each speech sample to provide a description of the observed features and a diagnosis. Intrarater reliability was acceptable at 85% agreement. Results: Interrater reliability on the presence or absence of CAS among experts was poor both as a categorical diagnosis (κ = .187, 95% confidence interval [CI] [0.089, 0.286]) and on a continuous “likelihood of CAS” scale (0–100; intraclass correlation = .183, 95% CI [.037, .347]). Reliability was similar across the video-recorded and audio-only samples. There was greater agreement on other diagnoses (such as articulation disorder) than on the diagnosis of CAS, although these too did not meet the predetermined standard. Likelihood of CAS was greater in children who presented with more American Speech-LanguageHearing Association CAS consensus features. Conclusions: Different expert raters had different thresholds for applying the diagnosis of CAS. If expert clinician judgment is to be used for diagnosis of CAS or other SSDs, further standardization and calibration is needed to increase interrater reliability. Diagnosis may require operationalized checklists or reliable measures that operate along a diagnostic continuum.
UR - http://www.scopus.com/inward/record.url?scp=85205083671&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85205083671&partnerID=8YFLogxK
U2 - 10.1044/2023_JSLHR-22-00677
DO - 10.1044/2023_JSLHR-22-00677
M3 - Article
C2 - 37642523
AN - SCOPUS:85205083671
SN - 1092-4388
VL - 67
SP - 3309
EP - 3326
JO - Journal of Speech, Language, and Hearing Research
JF - Journal of Speech, Language, and Hearing Research
IS - 9s
ER -