Emergence of collaboration networks around large scale data repositories: a study of the genomics community using GenBank

Mark R. Costa, Jian Qin, Sarah Bratt

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

The advent of large data repositories and the necessity of distributed skillsets have led to a need to study the scientific collaboration network emerging around cyber-infrastructure-enabled repositories. To explore the impact of scientific collaboration and large-scale repositories in the field of genomics, we analyze coauthorship patterns in NCBIs big data repository GenBank using trace metadata from coauthorship of traditional publications and coauthorship of datasets. We demonstrate that using complex network analysis to explore both networks independently and jointly provides a much richer description of the community, and addresses some of the methodological concerns discussed in previous literature regarding the use of coauthorship data to study scientific collaboration.

Original languageEnglish (US)
Pages (from-to)1-20
Number of pages20
JournalScientometrics
DOIs
StateAccepted/In press - May 9 2016

Keywords

  • Big data repository
  • Complex network analysis
  • Cyber-infrastructure enabled science
  • Scientific collaboration
  • Team science

ASJC Scopus subject areas

  • Computer Science Applications
  • Social Sciences(all)
  • Library and Information Sciences
  • Law

Fingerprint Dive into the research topics of 'Emergence of collaboration networks around large scale data repositories: a study of the genomics community using GenBank'. Together they form a unique fingerprint.

Cite this