Predicting prokaryotic ecological niches using genome sequence analysis

Garret Suen, Barry S. Goldman, Roy D. Welch

Research output: Contribution to journalArticlepeer-review

25 Scopus citations

Abstract

Automated DNA sequencing technology is so rapid that analysis has become the rate-limiting step. Hundreds of prokaryotic genome sequences are publicly available, with new genomes uploaded at the rate of approximately 20 per month. As a result, this growing body of genome sequences will include microorganisms not previously identified, isolated, or observed. We hypothesize that evolutionary pressure exerted by an ecological niche selects for a similar genetic repertoire in those prokaryotes that occupy the same niche, and that this is due to both vertical and horizontal transmission. To test this, we have developed a novel method to classify prokaryotes, by calculating their Pfam protein domain distributions and clustering them with all other sequenced prokaryotic species. Clusters of organisms are visualized in two dimensions as 'mountains' on a topological map. When compared to a phylogenetic map constructed using 16S rRNA, this map more accurately clusters prokaryotes according to functional and environmental attributes. We demonstrate the ability of this map, which we term a "niche map", to cluster according to ecological niche both quantitatively and qualitatively, and propose that this method be used to associate uncharacterized prokaryotes with their ecological niche as a means of predicting their functional role directly from their genome sequence.

Original languageEnglish (US)
Article numbere743
JournalPloS one
Volume2
Issue number8
DOIs
StatePublished - Aug 15 2007

ASJC Scopus subject areas

  • General Biochemistry, Genetics and Molecular Biology
  • General Agricultural and Biological Sciences
  • General

Fingerprint

Dive into the research topics of 'Predicting prokaryotic ecological niches using genome sequence analysis'. Together they form a unique fingerprint.

Cite this