Abstract
Mispronunciation detection tools could increase treatment access for speech sound disorders impacting, e.g.,/ɹ/. We show age-and-sex normalized formant estimation outperforms cepstral representation for detection of fully rhotic vs. derhotic/ɹ/in the PERCEPT-R Corpus. Gated recurrent neural networks trained on this feature set achieve a mean test participant-specific F1-score = .81 (σx = .10, med = .83, n = 48), with post hoc modeling showing no significant effect of child age or sex.
Original language | English (US) |
---|---|
Pages (from-to) | 4563-4567 |
Number of pages | 5 |
Journal | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
Volume | 2023-August |
DOIs | |
State | Published - 2023 |
Event | 24th International Speech Communication Association, Interspeech 2023 - Dublin, Ireland Duration: Aug 20 2023 → Aug 24 2023 |
Keywords
- clinical
- mispronunciation detection
- rhotics
ASJC Scopus subject areas
- Language and Linguistics
- Human-Computer Interaction
- Signal Processing
- Software
- Modeling and Simulation