Genome Research

  1549-5469

  1088-9051

  Mỹ

Cơ quản chủ quản:  COLD SPRING HARBOR LAB PRESS, PUBLICATIONS DEPT , Cold Spring Harbor Laboratory Press

Lĩnh vực:
Genetics (clinical)Genetics

Phân tích ảnh hưởng

Thông tin về tạp chí

 

Launched in 1995, Genome Research is an international, continuously published, peer-reviewed journal that publishes studies on the structure, function, biology, and evolution of genomes of all organisms, with a long-standing emphasis on genome-scale analyses, chromatin structure and function, epigenomics, systems genetics, and proteomics. We also feature gene discoveries that provide novel insights and reports of high-throughput experimental and computational approaches. In addition to these core areas, the journal is now particularly interested in the following areas: comprehensive characterization of genome sequence variation; understanding the basis of phenotypic complexity; network biology and the functional effects of genetic variation; evolutionary mechanisms of species origin, adaptation, and genomic diversity, including the study of ancient DNA; genomic medicine, including mechanisms of penetrance, expressivity, and modifier effects; understanding genomic and biological processes through single-cell analyses; novel computational algorithms that provide insight into biological processes; genome editing and engineering.

Các bài báo tiêu biểu

Transcription factor and microRNA motif discovery: The Amadeus platform and a compendium of metazoan target sets
Tập 18 Số 7 - Trang 1180-1189 - 2008
Chaim Linhart, Yonit Halperin, Ron Shamir
We present a threefold contribution to the computational task of motif discovery, a key component in the effort of delineating the regulatory map of a genome: (1) We constructed a comprehensive large-scale, publicly-available compendium of transcription factor and microRNA target gene sets derived from diverse high-throughput experiments in several metazoans. We used the compendium as a benchmark for motif discovery tools. (2) We developed Amadeus, a highly efficient, user-friendly software platform for genome-scale detection of novel motifs, applicable to a wide range of motif discovery tasks. Amadeus improves upon extant tools in terms of accuracy, running time, output information, and ease of use and is the only program that attained a high success rate on the metazoan compendium. (3) We demonstrate that by searching for motifs based on their genome-wide localization or chromosomal distributions (without using a predefined target set), Amadeus uncovers diverse known phenomena, as well as novel regulatory motifs.
Sex-biased genetic effects on gene regulation in humans
Tập 22 Số 12 - Trang 2368-2375 - 2012
Antigone S. Dimas, Alexandra C. Nica, Stephen B. Montgomery, Barbara E. Stranger, Towfique Raj, Alfonso Buil, Thomas Giger, Tuuli Lappalainen, María Gutiérrez‐Arcelus, Mark I. McCarthy, Emmanouil T. Dermitzakis
Human regulatory variation, reported as expression quantitative trait loci (eQTLs), contributes to differences between populations and tissues. The contribution of eQTLs to differences between sexes, however, has not been investigated to date. Here we explore regulatory variation in females and males and demonstrate that 12%–15% of autosomal eQTLs function in a sex-biased manner. We show that genes possessing sex-biased eQTLs are expressed at similar levels across the sexes and highlight cases of genes controlling sexually dimorphic and shared traits that are under the control of distinct regulatory elements in females and males. This study illustrates that sex provides important context that can modify the effects of functional genetic variants.
Fast model-based estimation of ancestry in unrelated individuals
Tập 19 Số 9 - Trang 1655-1664 - 2009
David H. Alexander, John Novembre, Kenneth Lange
Population stratification has long been recognized as a confounding factor in genetic association studies. Estimated ancestries, derived from multi-locus genotype data, can be used to perform a statistical correction for population stratification. One popular technique for estimation of ancestry is the model-based approach embodied by the widely applied program structure. Another approach, implemented in the program EIGENSTRAT, relies on Principal Component Analysis rather than model-based estimation and does not directly deliver admixture fractions. EIGENSTRAT has gained in popularity in part owing to its remarkable speed in comparison to structure. We present a new algorithm and a program, ADMIXTURE, for model-based estimation of ancestry in unrelated individuals. ADMIXTURE adopts the likelihood model embedded in structure. However, ADMIXTURE runs considerably faster, solving problems in minutes that take structure hours. In many of our experiments, we have found that ADMIXTURE is almost as fast as EIGENSTRAT. The runtime improvements of ADMIXTURE rely on a fast block relaxation scheme using sequential quadratic programming for block updates, coupled with a novel quasi-Newton acceleration of convergence. Our algorithm also runs faster and with greater accuracy than the implementation of an Expectation-Maximization (EM) algorithm incorporated in the program FRAPPE. Our simulations show that ADMIXTURE's maximum likelihood estimates of the underlying admixture coefficients and ancestral allele frequencies are as accurate as structure's Bayesian estimates. On real-world data sets, ADMIXTURE's estimates are directly comparable to those from structure and EIGENSTRAT. Taken together, our results show that ADMIXTURE's computational speed opens up the possibility of using a much larger set of markers in model-based ancestry estimation and that its estimates are suitable for use in correcting for population stratification in association studies.
Creating the Gene Ontology Resource: Design and Implementation
Tập 11 Số 8 - Trang 1425-1433 - 2001
The Gene Ontology Consortium
The exponential growth in the volume of accessible biological information has generated a confusion of voices surrounding the annotation of molecular information about genes and their products. The Gene Ontology (GO) project seeks to provide a set of structured vocabularies for specific biological domains that can be used to describe gene products in any organism. This work includes building three extensive ontologies to describe molecular function, biological process, and cellular component, and providing a community database resource that supports the use of these ontologies. The GO Consortium was initiated by scientists associated with three model organism databases: SGD, the Saccharomyces Genome database; FlyBase, the Drosophila genome database; and MGD/GXD, the Mouse Genome Informatics databases. Additional model organism database groups are joining the project. Each of these model organism information systems is annotating genes and gene products using GO vocabulary terms and incorporating these annotations into their respective model organism databases. Each database contributes its annotation files to a shared GO data resource accessible to the public athttp://www.geneontology.org/. The GO site can be used by the community both to recover the GO vocabularies and to access the annotated gene product data sets from the model organism databases. The GO Consortium supports the development of the GO database resource and provides tools enabling curators and researchers to query and manipulate the vocabularies. We believe that the shared development of this molecular annotation resource will contribute to the unification of biological information.
An Efficient Method to Generate Chromosomal Rearrangements by Targeted DNA Double-Strand Breaks in<i>Drosophila melanogaster</i>
Tập 14 Số 7 - Trang 1382-1393 - 2004
Dieter Egli, Ernst Hafen, Walter Schaffner
Homologous recombination (HR) is an indispensable tool to modify the genome of yeast and mammals. More recently HR is also being used for gene targeting inDrosophila. Here we show that HR can be used efficiently to engineer chromosomal rearrangements such as pericentric and paracentric inversions and translocations inDrosophila. Two chromosomal double-strand breaks (DSBs), introduced by the rare-cutting I-SceI endonuclease on two different mobile elements sharing homologous sequences, are sufficient to promote rearrangements at a frequency of 1% to 4%. Such rearrangements, once generated by HR, can be reverted by Cre recombinase. However, Cre-mediated recombination efficiency drops with increasing distance between recombination sites, unlike HR. We therefore speculate that physical constraints on chromosomal movement are modulated during DSB repair, to facilitate the homology search throughout the genome.
Genetic and environmental exposures constrain epigenetic drift over the human life course
Tập 24 Số 11 - Trang 1725-1733 - 2014
Sonia Shah, Allan F. McRae, Riccardo E. Marioni, Sarah E. Harris, Jude Gibson, Anjali K. Henders, Paul Redmond, Simon R. Cox, Ville Karhunen, Janie Corley, Lee Murphy, Nicholas G. Martin, Grant W. Montgomery, Robert Plomin, Naomi R. Wray, Ian J. Deary, Peter M. Visscher
Epigenetic mechanisms such as DNA methylation (DNAm) are essential for regulation of gene expression. DNAm is dynamic, influenced by both environmental and genetic factors. Epigenetic drift is the divergence of the epigenome as a function of age due to stochastic changes in methylation. Here we show that epigenetic drift may be constrained at many CpGs across the human genome by DNA sequence variation and by lifetime environmental exposures. We estimate repeatability of DNAm at 234,811 autosomal CpGs in whole blood using longitudinal data (2–3 repeated measurements) on 478 older people from two Scottish birth cohorts—the Lothian Birth Cohorts of 1921 and 1936. Median age was 79 yr and 70 yr, and the follow-up period was ∼10 yr and ∼6 yr, respectively. We compare this to methylation heritability estimated in the Brisbane Systems Genomics Study, a cross-sectional study of 117 families (offspring median age 13 yr; parent median age 46 yr). CpG repeatability in older people was highly correlated (0.68) with heritability estimated in younger people. Highly heritable sites had strong underlying cis-genetic effects. Thirty-seven and 1687 autosomal CpGs were associated with smoking and sex, respectively. Both sets were strongly enriched for high repeatability. Sex-associated CpGs were also strongly enriched for high heritability. Our results show that a large number of CpGs across the genome, as a result of environmental and/or genetic constraints, have stable DNAm variation over the human lifetime. Moreover, at a number of CpGs, most variation in the population is due to genetic factors, despite some sites being highly modifiable by the environment.
CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes
Tập 25 Số 7 - Trang 1043-1055 - 2015
Donovan H. Parks, Michael Imelfort, Connor T. Skennerton, Philip Hugenholtz, Gene W. Tyson
Large-scale recovery of genomes from isolates, single cells, and metagenomic data has been made possible by advances in computational methods and substantial reductions in sequencing costs. Although this increasing breadth of draft genomes is providing key information regarding the evolutionary and functional diversity of microbial life, it has become impractical to finish all available reference genomes. Making robust biological inferences from draft genomes requires accurate estimates of their completeness and contamination. Current methods for assessing genome quality are ad hoc and generally make use of a limited number of “marker” genes conserved across all bacterial or archaeal genomes. Here we introduce CheckM, an automated method for assessing the quality of a genome using a broader set of marker genes specific to the position of a genome within a reference genome tree and information about the collocation of these genes. We demonstrate the effectiveness of CheckM using synthetic data and a wide range of isolate-, single-cell-, and metagenome-derived genomes. CheckM is shown to provide accurate estimates of genome completeness and contamination and to outperform existing approaches. Using CheckM, we identify a diverse range of errors currently impacting publicly available isolate genomes and demonstrate that genomes obtained from single cells and metagenomic data vary substantially in quality. In order to facilitate the use of draft genomes, we propose an objective measure of genome quality that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities.
On the origin of prokaryotic species
Tập 19 Số 5 - Trang 744-756 - 2009
W. Ford Doolittle, Olga Zhaxybayeva
The notion that all prokaryotes belong to genomically and phenomically cohesive clusters that we might legitimately call “species” is a contentious one. At issue are (1) whether such clusters actually exist; (2) what species definition might most reliably identify them, if they do; and (3) what species concept—by which is meant a genetic and ecological theory of speciation—might best explain species existence and rationalize a species definition, if we could agree on one. We review existing theories and some relevant data. We conclude that microbiologists now understand in some detail the various genetic, population, and ecological processes that effect the evolution of prokaryotes. There will be on occasion circumstances under which these, working together, will form groups of related organisms sufficiently like each other that we might all agree to call them “species,” but there is no reason that this must always be so. Thus, there is no principled way in which questions about prokaryotic species, such as how many there are, how large their populations are, or how globally they are distributed, can be answered. These questions can, however, be reformulated so that metagenomic methods and thinking will meaningfully address the biological patterns and processes whose understanding is our ultimate target.
Genome-wide analysis of Fis binding in <i>Escherichia coli</i> indicates a causative role for A-/AT-tracts
Tập 18 Số 6 - Trang 900-910 - 2008
Byung‐Kwan Cho, Eric M. Knight, Christian Barrett, Bernhard Ø. Palsson
We determined the genome-wide distribution of the nucleoid-associated protein Fis in Escherichia coli using chromatin immunoprecipitation coupled with high-resolution whole genome-tiling microarrays. We identified 894 Fis-associated regions across the E. coli genome. A significant number of these binding sites were found within open reading frames (33%) and between divergently transcribed transcripts (5%). Analysis indicates that A-tracts and AT-tracts are an important signal for preferred Fis-binding sites, and that A6-tracts in particular constitute a high-affinity signal that dictates Fis phasing in stretches of DNA containing multiple and variably spaced A-tracts and AT-tracts. Furthermore, we find evidence for an average of two Fis-binding regions per supercoiling domain in the chromosome of exponentially growing cells. Transcriptome analysis shows that ∼21% of genes are affected by the deletion of fis; however, the changes in magnitude are small. To address the differential Fis bindings under growth environment perturbation, ChIP-chip analysis was performed using cells grown under aerobic and anaerobic growth conditions. Interestingly, the Fis-binding regions are almost identical in aerobic and anaerobic growth conditions—indicating that the E. coli genome topology mediated by Fis is superficially identical in the two conditions. These novel results provide new insight into how Fis modulates DNA topology at a genome scale and thus advance our understanding of the architectural bases of the E. coli nucleoid.
A Large Imprinted microRNA Gene Cluster at the Mouse Dlk1-Gtl2 Domain
Tập 14 Số 9 - Trang 1741-1748 - 2004
Hervé Seitz, Hélène Royo, Marie-Line Bortolin-Cavaillé, Shau‐Ping Lin, Anne C. Ferguson‐Smith, Jérôme Cavaillé
microRNAs (or miRNAs) are small noncoding RNAs (21 to 25 nucleotides) that are processed from longer hairpin RNA precursors and are believed to be involved in a wide range of developmental and cellular processes, by either repressing translation or triggering mRNA degradation (RNA interference). By using a computer-assisted approach, we have identified 46 potential miRNA genes located in the human imprinted 14q32 domain, 40 of which are organized as a large cluster. Although some of these clustered miRNA genes appear to be encoded by a single-copy DNA sequence, most of them are arranged in tandem arrays of closely related sequences. In the mouse, this miRNA gene cluster is conserved at the homologous distal 12 region. In vivo all the miRNAs that we have detected are expressed in the developing embryo (both in the head and in the trunk) and in the placenta, whereas in the adult their expression is mainly restricted to the brain. We also show that the miRNA genes are only expressed from the maternally inherited chromosome and that their imprinted expression is regulated by an intergenic germline-derived differentially methylated region (IG-DMR) located ∼200 kb upstream from the miRNA cluster. The functions of these miRNAs, which seem only conserved in mammals, are discussed both in terms of epigenetic control and gene regulation during development.