TY - GEN
T1 - Taming a menagerie of heavy tails with skew path analysis
AU - Introne, Josh
AU - Goggins, Sean
N1 - Publisher Copyright:
© 2015 ACM.
PY - 2015/6/28
Y1 - 2015/6/28
N2 - The discovery of stable, heavy-tailed distributions of activity on the web has inspired many researchers to search for simple mechanisms that can cut through the complexity of countless social interactions to yield powerful new theories about human behavior. A dominant mode of investigation involves fitting a mathematical model to an observed distribution, and then inferring the behaviors that generate the modeled distribution. Yet, distributions of activity are not always stable, and the process of fitting a mathematical model to empirical distributions can be highly uncertain, especially for smaller and highly variable datasets. In this paper, we introduce an approach called skew-path analysis, which measures how concentrated information production is along different dimensions in community-generated data. The approach scales from small to large datasets, and is suitable for investigating the dynamics of online behavior. We offer a preliminary demonstration of the approach by using it to analyze six years of data from an online health community, and show that the technique offers interesting insights into the dynamics of information production. In particular, we find evidence for two distinct point attractors within a subset of the forums analyzed, demonstrating the utility of the approach.
AB - The discovery of stable, heavy-tailed distributions of activity on the web has inspired many researchers to search for simple mechanisms that can cut through the complexity of countless social interactions to yield powerful new theories about human behavior. A dominant mode of investigation involves fitting a mathematical model to an observed distribution, and then inferring the behaviors that generate the modeled distribution. Yet, distributions of activity are not always stable, and the process of fitting a mathematical model to empirical distributions can be highly uncertain, especially for smaller and highly variable datasets. In this paper, we introduce an approach called skew-path analysis, which measures how concentrated information production is along different dimensions in community-generated data. The approach scales from small to large datasets, and is suitable for investigating the dynamics of online behavior. We offer a preliminary demonstration of the approach by using it to analyze six years of data from an online health community, and show that the technique offers interesting insights into the dynamics of information production. In particular, we find evidence for two distinct point attractors within a subset of the forums analyzed, demonstrating the utility of the approach.
KW - Diversity
KW - Dynamics
KW - Power Law
KW - Social media
UR - http://www.scopus.com/inward/record.url?scp=84978044475&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84978044475&partnerID=8YFLogxK
U2 - 10.1145/2786451.2786484
DO - 10.1145/2786451.2786484
M3 - Conference contribution
AN - SCOPUS:84978044475
T3 - Proceedings of the 2015 ACM Web Science Conference
BT - Proceedings of the 2015 ACM Web Science Conference
PB - Association for Computing Machinery, Inc
T2 - 7th ACM Web Science Conference, WebSci 2015
Y2 - 28 June 2015 through 1 July 2015
ER -