TY - GEN
T1 - Augmented Tension Detection in Communication
T2 - Thematic Area on Human Computer Interaction, HCI 2020, held as part of the 22nd International Conference on Human-Computer Interaction, HCII 2020
AU - Zhang, Bo
AU - Xiao, Lu
N1 - Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
PY - 2020
Y1 - 2020
N2 - Tension in communication often prevents effective flows of information among members in the conversation and thus can negatively influence practices in teams and learning efficiency at school. Interested in developing a computational technique that automatically detects tension in communication, we explore features that signal the existence of tension in human-human communications and investigate the potential of a supervised learning approach. While there is no tension-annotated dataset available, there are language resources that have distress annotated. Although tension may occur during the communication as a result of various factors, distress creates discomfort and tension. Leveraging an interview dataset that has marked the presence/absence of distress, we investigated the prosodic features and LIWC features that indicate tension. Specifically, we compare 23 prosodic features and LIWC features extracted from 186 interviews in terms of how effective they are to indicate the speaker’s distress in a one-to-one conversation. Our analysis shows that there are seven prosodic features and one LIWC features that differ between distress and non-distress interviews. The seven prosodic features are mean intensity, jitter, shimmer, longest silence duration, longest silence position, standard deviation of interviewee speaking rate, and hesitation. And the one effective LIWC feature is health.
AB - Tension in communication often prevents effective flows of information among members in the conversation and thus can negatively influence practices in teams and learning efficiency at school. Interested in developing a computational technique that automatically detects tension in communication, we explore features that signal the existence of tension in human-human communications and investigate the potential of a supervised learning approach. While there is no tension-annotated dataset available, there are language resources that have distress annotated. Although tension may occur during the communication as a result of various factors, distress creates discomfort and tension. Leveraging an interview dataset that has marked the presence/absence of distress, we investigated the prosodic features and LIWC features that indicate tension. Specifically, we compare 23 prosodic features and LIWC features extracted from 186 interviews in terms of how effective they are to indicate the speaker’s distress in a one-to-one conversation. Our analysis shows that there are seven prosodic features and one LIWC features that differ between distress and non-distress interviews. The seven prosodic features are mean intensity, jitter, shimmer, longest silence duration, longest silence position, standard deviation of interviewee speaking rate, and hesitation. And the one effective LIWC feature is health.
KW - Distress
KW - Machine learning
KW - Prosodic features
KW - Tension
UR - http://www.scopus.com/inward/record.url?scp=85088741972&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85088741972&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-49062-1_20
DO - 10.1007/978-3-030-49062-1_20
M3 - Conference contribution
AN - SCOPUS:85088741972
SN - 9783030490614
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 290
EP - 301
BT - Human-Computer Interaction. Multimodal and Natural Interaction - Thematic Area, HCI 2020, Held as Part of the 22nd International Conference, HCII 2020, Proceedings
A2 - Kurosu, Masaaki
PB - Springer
Y2 - 19 July 2020 through 24 July 2020
ER -