Comparative Analysis of CpG Islands in Four Fish Genomes

Comparative and Functional Genomics - Tập 2008 - Trang 1-6 - 2008
Leng Han1,2,3, Zhongming Zhao4,5,3
1Graduate School, Chinese Academy of Sciences, Beijing 100039, China
2State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, 650223, China
3Virginia Institute for Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Richmond, VA 23298, USA
4Center for the Study of Biological Complexity, Virginia Commonwealth University, Richmond, VA 23284, USA
5Department of Human Genetics, Virginia Commonwealth University, Richmond, VA 23298, USA

Tóm tắt

There has been much interest in CpG islands (CGIs), clusters of CpG dinucleotides in GC-rich regions, because they are considered gene markers and involved in gene regulation. To date, there has been no genome-wide analysis of CGIs in the fish genome. We first evaluated the performance of three popular CGI identification algorithms in four fish genomes (tetraodon, stickleback, medaka, and zebrafish). Our results suggest that Takai and Jones' (2002) algorithm is most suitable for comparative analysis of CGIs in the fish genome. Then, we performed a systematic analysis of CGIs in the four fish genomes using Takai and Jones' algorithm, compared to other vertebrate genomes. We found that both the number of CGIs and the CGI density vary greatly among these genomes. Remarkably, each fish genome presents a distinct distribution of CGI density with some genomic factors (e.g., chromosome size and chromosome GC content). These findings are helpful for understanding evolution of fish genomes and the features of fish CGIs.

Từ khóa


Tài liệu tham khảo

10.1038/35057062

10.1007/s00018-003-3088-6

10.1093/nar/8.7.1499

10.1038/287560a0

10.1073/pnas.87.12.4692

10.1038/ng1990

10.1073/pnas.052410099

10.1158/0008-5472.CAN-05-1980

10.1073/pnas.90.24.11995

10.1007/BF01233381

10.1093/molbev/msm128

10.1038/ng0396-321

10.1038/nature03154

10.1126/science.1072104

10.1016/0022-2836(87)90689-9

10.1186/1471-2105-7-446

10.1093/bioinformatics/18.4.631

10.1093/bioinformatics/bth059

10.1093/bib/3.1.87

10.1038/nature01262

2003, In Silico Biology, 3, 235

10.1038/sj.hdy.6800635

10.1016/j.gene.2005.08.024

10.1016/j.ygeno.2005.09.012