Collaboration Networks and Career Trajectories: What Do Metadata from Data Repositories Tell Us?

Jeff Hemsley, Jian Qin, Sarah Bratt, Alexander Smith

Research output: Contribution to journalArticlepeer-review

2 Scopus citations


Science is increasingly carried out through scientific collaborations, allowing researchers pool their experience, knowledge, and skills. In this work we identify factors related to a scientist’s collaboration capacity, their ability accumulate new collaborations over their career. To do this offer a new collaboration capacity framework and begin the work of validating it empirically by testing a number of hypotheses. We use data from GenBank, a cyberinfrastructure (CI)-enabled data repository that stores and manages scientific data. The data allow us to construct longitudinal networks, thereby giving us yearly scientific collaboration maps. We find that a scientist’s network position at an early stage is related to their capacity to build new collaborations and that researchers who manage an upward trend in productivity tend to have higher collaboration capacity. Our work makes a contribution to science of science studies by offering a collaboration capacity framework and providing partial empirical support for it.

Original languageEnglish (US)
Pages (from-to)100-110
Number of pages11
JournalProceedings of the Association for Information Science and Technology
Issue number1
StatePublished - 2022


  • Assessment metrics
  • Collaboration capacity
  • Collaboration networks
  • Metadata analytics
  • Research performance assessment

ASJC Scopus subject areas

  • General Computer Science
  • Library and Information Sciences


Dive into the research topics of 'Collaboration Networks and Career Trajectories: What Do Metadata from Data Repositories Tell Us?'. Together they form a unique fingerprint.

Cite this