TY - JOUR
T1 - Bioinformatic mining of type I microsatellites from expressed sequence tags of channel catfish (Ictalurus punctatus)
AU - Serapion, Jerry
AU - Kucuktas, Huseyin
AU - Feng, Jinian
AU - Liu, Zhanjiang
PY - 2004/8
Y1 - 2004/8
N2 - Gene-derived markers are pivotal to the analysis of genome structure, organization, and evolution and necessary for comparative genomics. However, gene-derived markers are relatively difficult to develop. This project utilized the genomic resources of channel catfish expressed sequence tags (ESTs) to identify simple sequence repeats (SSRs), or microsatellites. It took the advantage of ESTs for the establishment of gene identities, and of microsatellites for the acquisition of high polymorphism. When microsatellites are tagged to genes, the microsatellites can then be used as gene markers. A bioinformatic analysis of 43,033 ESTs identified 4855 ESTs containing microsatellites. Cluster analysis indicated that 1312 of these ESTs fell into 569 contigs, and the remaining 3534 ESTs were singletons. A total of 4103 unique microsatellite-containing genes were identified. The dinucleotide CA/TG and GA/TC pairs were the most abundant microsatellites. AT-rich microsatellite types were predominant among trinucleotide and tetranucleotide microsatellites, consistent with our earlier estimation that the catfish genome is highly AT-rich. Our preliminary results indicated that the majority of the identified microsatellites were polymorphic and, therefore, useful for genetic linkage mapping of catfish. Mapping of these gene-derived markers is under way, which will set the foundation for comparative genome analysis in catfish.
AB - Gene-derived markers are pivotal to the analysis of genome structure, organization, and evolution and necessary for comparative genomics. However, gene-derived markers are relatively difficult to develop. This project utilized the genomic resources of channel catfish expressed sequence tags (ESTs) to identify simple sequence repeats (SSRs), or microsatellites. It took the advantage of ESTs for the establishment of gene identities, and of microsatellites for the acquisition of high polymorphism. When microsatellites are tagged to genes, the microsatellites can then be used as gene markers. A bioinformatic analysis of 43,033 ESTs identified 4855 ESTs containing microsatellites. Cluster analysis indicated that 1312 of these ESTs fell into 569 contigs, and the remaining 3534 ESTs were singletons. A total of 4103 unique microsatellite-containing genes were identified. The dinucleotide CA/TG and GA/TC pairs were the most abundant microsatellites. AT-rich microsatellite types were predominant among trinucleotide and tetranucleotide microsatellites, consistent with our earlier estimation that the catfish genome is highly AT-rich. Our preliminary results indicated that the majority of the identified microsatellites were polymorphic and, therefore, useful for genetic linkage mapping of catfish. Mapping of these gene-derived markers is under way, which will set the foundation for comparative genome analysis in catfish.
KW - Bioinformatics
KW - Data mining
KW - EST
KW - Fish
KW - Marker
KW - Microsatellite
UR - http://www.scopus.com/inward/record.url?scp=4344695759&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=4344695759&partnerID=8YFLogxK
U2 - 10.1007/s10126-003-0039-z
DO - 10.1007/s10126-003-0039-z
M3 - Article
C2 - 15136916
AN - SCOPUS:4344695759
SN - 1436-2228
VL - 6
SP - 364
EP - 377
JO - Marine Biotechnology
JF - Marine Biotechnology
IS - 4
ER -