TY - JOUR
T1 - Coding and Classifying Knowledge Exchange on Social Media
T2 - a Comparative Analysis of the #Twitterstorians and AskHistorians Communities
AU - Gruzd, Anatoliy
AU - Kumar, Priya
AU - Abul-Fottouh, Deena
AU - Haythornthwaite, Caroline
N1 - © The Author(s) 2020.
PY - 2020
Y1 - 2020
N2 - As social media become a staple for knowledge discovery and sharing, questions arise about how self-organizing communities manage learning outside the domain of organized, authority-led institutions. Yet examination of such communities is challenged by the quantity of posts and variety of media now used for learning. This paper addresses the challenges of identifying (1) what information, communication, and discursive practices support successful online communities, (2) whether such practices are similar on Twitter and Reddit, and (3) whether machine learning classifiers can be successfully used to analyze larger datasets of learning exchanges. This paper builds on earlier work that used manual coding of learning and exchange in Reddit 'Ask' communities to derive a coding schema we refer to as 'learning in the wild'. This schema of eight categories: explanation with disagreement, agreement, or neutral presentation; socializing with negative, or positive intent; information seeking; providing resources; and comments about forum rules and norms. To compare across media, results from coding Reddit's AskHistorians are compared to results from coding a sample of #Twitterstorians tweets ( n = 594). High agreement between coders affirmed the applicability of the coding schema to this different medium. LIWC lexicon-based text analysis was used to build machine learning classifiers and apply these to code a larger dataset of tweets ( n = 69,101). This research shows that the 'learning in the wild' coding schema holds across at least two different platforms, and is partially scalable to study larger online learning communities.
AB - As social media become a staple for knowledge discovery and sharing, questions arise about how self-organizing communities manage learning outside the domain of organized, authority-led institutions. Yet examination of such communities is challenged by the quantity of posts and variety of media now used for learning. This paper addresses the challenges of identifying (1) what information, communication, and discursive practices support successful online communities, (2) whether such practices are similar on Twitter and Reddit, and (3) whether machine learning classifiers can be successfully used to analyze larger datasets of learning exchanges. This paper builds on earlier work that used manual coding of learning and exchange in Reddit 'Ask' communities to derive a coding schema we refer to as 'learning in the wild'. This schema of eight categories: explanation with disagreement, agreement, or neutral presentation; socializing with negative, or positive intent; information seeking; providing resources; and comments about forum rules and norms. To compare across media, results from coding Reddit's AskHistorians are compared to results from coding a sample of #Twitterstorians tweets ( n = 594). High agreement between coders affirmed the applicability of the coding schema to this different medium. LIWC lexicon-based text analysis was used to build machine learning classifiers and apply these to code a larger dataset of tweets ( n = 69,101). This research shows that the 'learning in the wild' coding schema holds across at least two different platforms, and is partially scalable to study larger online learning communities.
U2 - 10.1007/s10606-020-09376-y
DO - 10.1007/s10606-020-09376-y
M3 - Article
C2 - 33343085
SN - 0925-9724
VL - 29
SP - 629
EP - 656
JO - Computer Supported Cooperative Work
JF - Computer Supported Cooperative Work
IS - 6
ER -