Emergence of collaboration networks around large scale data repositories: a study of the genomics community using GenBank

Mark R. Costa, Jian Qin, Sarah Bratt

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

The advent of large data repositories and the necessity of distributed skillsets have led to a need to study the scientific collaboration network emerging around cyber-infrastructure-enabled repositories. To explore the impact of scientific collaboration and large-scale repositories in the field of genomics, we analyze coauthorship patterns in NCBIs big data repository GenBank using trace metadata from coauthorship of traditional publications and coauthorship of datasets. We demonstrate that using complex network analysis to explore both networks independently and jointly provides a much richer description of the community, and addresses some of the methodological concerns discussed in previous literature regarding the use of coauthorship data to study scientific collaboration.

Original languageEnglish (US)
Pages (from-to)21-40
Number of pages20
JournalScientometrics
Volume108
Issue number1
DOIs
StatePublished - Jul 1 2016

Keywords

  • Big data repository
  • Complex network analysis
  • Cyber-infrastructure enabled science
  • Scientific collaboration
  • Team science

ASJC Scopus subject areas

  • General Social Sciences
  • Computer Science Applications
  • Library and Information Sciences

Fingerprint

Dive into the research topics of 'Emergence of collaboration networks around large scale data repositories: a study of the genomics community using GenBank'. Together they form a unique fingerprint.

Cite this