TY - JOUR
T1 - Use of a correlational method to estimate a listener's weighting function for speech
AU - Doherty, Karen A.
AU - Turner, Christopher W.
PY - 1996/12
Y1 - 1996/12
N2 - The purpose of this study was to determine if it is feasible to use the correlational method (Lutfi, 1995; Richards and Zhu, 1994) to estimate how listeners use or weight the information contained within various frequency bands of speech. Three naturally spoken vowel-consonant-vowel (VCV) syllables (/aba/, /aga/, and /ada/) were presented monaurally to listeners. Each of the VCV waveforms were filtered into three separate frequency bands (i.e., low, mid, and high). Each band was then independently and randomly degraded at various signal-to-noise (S/N) levels (-7, -5, -3, 1, or +1). On each trial, listeners were asked to identify the VCV that was presented to them. For each trial, the S/N level of each frequency band, the stimulus that was presented, and the listener's responses were all recorded and stored in a file. From this trial-by-trial data, a point biserial correlation was computed between the listener's response (correct or incorrect identification) and the degradation within each frequency band. The stronger the correlation, the greater influence that given frequency band had on the listener's performance on the task. From these relations it was shown that it is possible to obtain a listener's weighting function for speech. Results showed that although most listeners weighted the mid-frequency band the greatest, several of the listeners used different weighting strategies to perform the task. Several methodological issues are discussed in regard to improving the future application of the correlational method to speech.
AB - The purpose of this study was to determine if it is feasible to use the correlational method (Lutfi, 1995; Richards and Zhu, 1994) to estimate how listeners use or weight the information contained within various frequency bands of speech. Three naturally spoken vowel-consonant-vowel (VCV) syllables (/aba/, /aga/, and /ada/) were presented monaurally to listeners. Each of the VCV waveforms were filtered into three separate frequency bands (i.e., low, mid, and high). Each band was then independently and randomly degraded at various signal-to-noise (S/N) levels (-7, -5, -3, 1, or +1). On each trial, listeners were asked to identify the VCV that was presented to them. For each trial, the S/N level of each frequency band, the stimulus that was presented, and the listener's responses were all recorded and stored in a file. From this trial-by-trial data, a point biserial correlation was computed between the listener's response (correct or incorrect identification) and the degradation within each frequency band. The stronger the correlation, the greater influence that given frequency band had on the listener's performance on the task. From these relations it was shown that it is possible to obtain a listener's weighting function for speech. Results showed that although most listeners weighted the mid-frequency band the greatest, several of the listeners used different weighting strategies to perform the task. Several methodological issues are discussed in regard to improving the future application of the correlational method to speech.
UR - http://www.scopus.com/inward/record.url?scp=0029776607&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0029776607&partnerID=8YFLogxK
U2 - 10.1121/1.417336
DO - 10.1121/1.417336
M3 - Article
C2 - 8969478
AN - SCOPUS:0029776607
SN - 0001-4966
VL - 100
SP - 3769
EP - 3773
JO - Journal of the Acoustical Society of America
JF - Journal of the Acoustical Society of America
IS - 6
ER -