The Human Genome Browser at UCSC

Genome Research - Tập 12 Số 6 - Trang 996-1006 - 2002
Lior Pachter1, Charles W. Sugnet2, Terrence S. Furey2, Krishna M. Roskin2, Tom H. Pringle3, Alan M. Zahler1, and David Haussler4
1Department of Molecular, Cellular, and Developmental Biology, and Center for Molecular Biology of RNA, University of California, Santa Cruz, California 95064, USA;
2Department of Computer Science, University of California, Santa Cruz, California 95064, USA;
3Sperling Biomedical Foundation; Eugene, Oregon, 97405, USA;
4Howard Hughes Medical Institute and Department of Computer Science, University of California, Santa Cruz, California 95064, USA

Tóm tắt

As vertebrate genome sequences near completion and research refocuses to their analysis, the issue of effective genome annotation display becomes critical. A mature web tool for rapid and reliable display of any requested portion of the genome at any scale, together with several dozen aligned annotation tracks, is provided athttp://genome.ucsc.edu. This browser displays assembly contigs and gaps, mRNA and expressed sequence tag alignments, multiple gene predictions, cross-species homologies, single nucleotide polymorphisms, sequence-tagged sites, radiation hybrid data, transposon repeats, and more as a stack of coregistered tracks. Text and sequence-based searches provide quick and precise access to any region of specific interest. Secondary links from individual features lead to sequence details and supplementary off-site databases. One-half of the annotation tracks are computed at the University of California, Santa Cruz from publicly available sequence data; collaborators worldwide provide the rest. Users can stably add their own custom tracks to the browser for educational or research purposes. The conceptual and technical framework of the browser, its underlying MYSQL database, and overall use are described. The web site currently serves over 50,000 pages per day to over 3000 different users.

Từ khóa


Tài liệu tham khảo

10.1093/nar/27.1.12

10.1093/nar/27.2.573

10.1038/35057004

Birney, 1997, Dynamite: A flexible code generating language for dynamic programming methods used in sequence comparison., Ismb, 5, 56

10.1086/302011

10.1006/jmbi.1997.0951

10.1016/0014-4827(68)90538-7

10.1093/nar/26.1.73

10.1038/35057192

10.1126/science.282.5396.2012

10.1038/35057062

10.1126/science.282.5389.744

10.1038/380152a0

10.1186/1471-2105-2-7

10.1038/990031

Eeckman, 1995, ACeDB and macace., Methods Cell Biol., 48, 583, 10.1016/S0091-679X(08)61405-3

10.1126/science.270.5244.1945

10.1016/S0168-9525(00)02093-X

10.1093/bib/1.2.131

10.1101/gr.229202. Article published online before March 2002

10.1101/gr.183201

10.1101/gr.10.8.1115

10.1093/nar/28.1.91

10.1038/74149

Kulp, 1996, A generalized hidden Markov model for the recognition of human genes in DNA., Ismb, 4, 134

———. 1997. Integrating database homology in a probabilistic gene structure model. Pac. Symp. Biocomput. 232–244..

Lal, 1999, A public database for gene expression in human cancers., Cancer Res., 59, 5403

10.1101/gr.10.7.1051

10.1093/nar/28.1.126

10.1038/35057157

Mitelman F. (1995) An international system for human cytogenetic nomenclature. (S. Karger, Basel).

10.1093/nar/29.1.137

10.1038/76118

10.1038/35057149

10.1101/gr.10.4.516

10.1016/S0167-7799(98)01232-3

10.1038/35057141

10.1016/S0959-437X(99)00031-3

10.1093/nar/29.1.82

Trask B. (1999) Genome analysis: A laboratory manual. (Cold Spring Harbor Press, Cold Spring Harbor, New York).