Abstract:
:As more genomes are sequenced, there is an increasing need for automated first-pass annotation which allows timely access to important genomic information. The Ensembl gene-building system enables fast automated annotation of eukaryotic genomes. It annotates genes based on evidence derived from known protein, cDNA, and EST sequences. The gene-building system rests on top of the core Ensembl (MySQL) database schema and Perl Application Programming Interface (API), and the data generated are accessible through the Ensembl genome browser (http://www.ensembl.org). To date, the Ensembl predicted gene sets are available for the A. gambiae, C. briggsae, zebrafish, mouse, rat, and human genomes and have been heavily relied upon in the publication of the human, mouse, rat, and A. gambiae genome sequence analysis. Here we describe in detail the gene-building system and the algorithms involved. All code and data are freely available from http://www.ensembl.org.
journal_name
Genome Resjournal_title
Genome researchauthors
Curwen V,Eyras E,Andrews TD,Clarke L,Mongin E,Searle SM,Clamp Mdoi
10.1101/gr.1858004subject
Has Abstractpub_date
2004-05-01 00:00:00pages
942-50issue
5eissn
1088-9051issn
1549-5469pii
14/5/942journal_volume
14pub_type
杂志文章相关文献
GENOME RESEARCH文献大全abstract::The regulation of gene expression is mediated at the transcriptional level by enhancer regions that are bound by sequence-specific transcription factors (TFs). Recent studies have shown that the in vivo binding sites of single TFs differ between developmental or cellular contexts. How this context-specific binding is ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.132811.111
更新日期:2012-10-01 00:00:00
abstract::Despite recent progress in genome topology knowledge, the role of repeats, which make up the majority of mammalian genomes, remains elusive. Satellite repeats are highly abundant sequences that cluster around centromeres, attract pericentromeric heterochromatin, and aggregate into nuclear chromocenters. These nuclear ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.186643.114
更新日期:2015-07-01 00:00:00
abstract::The recent publication of the FANTOM mouse transcriptome has provided a unique opportunity to study the diversity of transcripts arising from a single gene locus. We have focused on the Gnas complex, as imprinting loci themselves provide unique insights into transcriptional regulation. Thirteen full-length cDNAs from ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.955503
更新日期:2003-06-01 00:00:00
abstract::Comparative functional genomics studies the evolution of biological processes by analyzing functional data, such as gene expression profiles, across species. A major challenge is to compare profiles collected in a complex phylogeny. Here, we present Arboretum, a novel scalable computational algorithm that integrates e...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.146233.112
更新日期:2013-06-01 00:00:00
abstract::The in vitro cloning of DNA molecules traditionally uses PCR amplification or site-specific restriction endonucleases to generate linear DNA inserts with defined termini and requires DNA ligase to covalently join those inserts to vectors with the corresponding ends. We have used the properties of Vaccinia DNA topoisom...
journal_title:Genome research
pub_type: 杂志文章
doi:
更新日期:1999-04-01 00:00:00
abstract::Mycoplasma mycoides subsp. mycoidesSC (MmymySC)is the etiological agent of contagious bovine pleuropneumonia (CBPP), a highly contagious respiratory disease in cattle. The genome of Mmymy SC type strain PG1(T) has been sequenced to map all the genes and to facilitate further studies regarding the cell function of the ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.1673304
更新日期:2004-02-01 00:00:00
abstract::Long interspersed nuclear element-1 (LINE-1 or L1) retrotransposons are normally suppressed in somatic tissues mainly due to DNA methylation and antiviral defense. However, the mechanism to suppress L1s may be disrupted in cancers, thus allowing L1s to act as insertional mutagens and cause genomic rearrangement and in...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.231837.117
更新日期:2018-08-01 00:00:00
abstract::There are numerous examples from the genomes of viruses, mitochondria, and chromosomes that adjacent genes can overlap, sharing at least one nucleotide. Overlaps have been hypothesized to be involved in genome size minimization and as a regulatory mechanism of gene expression. Here we show that overlapping genes are a...
journal_title:Genome research
pub_type: 信件
doi:10.1101/gr.2433104
更新日期:2004-11-01 00:00:00
abstract::In vivo analyses of the occurrence, subcellular localization, and dynamics of protein-protein interactions (PPIs) are important issues in functional proteomic studies. The bimolecular fluorescence complementation (BiFC) assay has many advantages in that it provides a reliable way to detect PPIs in living cells with mi...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.231860.117
更新日期:2019-01-01 00:00:00
abstract::The human genome is estimated to contain 23,000 to 33,000 retropseudogenes. To study the properties of genes giving rise to these retroelements, we compared the structure and expression of genes with or without known retropseudogenes. Four main features have emerged from the analysis of 181 genes associated to retrops...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.10.5.672
更新日期:2000-05-01 00:00:00
abstract::While metagenomics has emerged as a technology of choice for analyzing bacterial populations, the assembly of metagenomic data remains challenging, thus stifling biological discoveries. Moreover, recent studies revealed that complex bacterial populations may be composed from dozens of related strains, thus further amp...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.213959.116
更新日期:2017-05-01 00:00:00
abstract::Natural killer (NK) cells contribute to the essential functions of innate immunity and reproduction. Various genes encode NK cell receptors that recognize the major histocompatibility complex (MHC) Class I molecules expressed by other cells. For primate NK cells, the killer-cell immunoglobulin-like receptors (KIR) are...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.085738.108
更新日期:2009-05-01 00:00:00
abstract::We present a powerful application of ultra high-throughput sequencing, SAGE-Seq, for the accurate quantification of normal and neoplastic mammary epithelial cell transcriptomes. We develop data analysis pipelines that allow the mapping of sense and antisense strands of mitochondrial and RefSeq genes, the normalization...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.108217.110
更新日期:2010-12-01 00:00:00
abstract::Diversity in the antigen-binding receptors of the immune system has long been a primary interest of biologists. Recently it has been suggested that polymorphism in regulatory (noncoding) gene segments is of substantial importance as well. Here, we survey the level of variation in MHC class II gene promoters in man and...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.8.2.124
更新日期:1998-02-01 00:00:00
abstract::A total of 202 genes were cytogenetically mapped to goat chromosomes, multiplying by five the total number of regional gene localizations in domestic ruminants (255). This map encompasses 249 and 173 common anchor loci regularly spaced along human and murine chromosomes, respectively, which makes it possible to perfor...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.8.9.901
更新日期:1998-09-01 00:00:00
abstract::Eukaryotic translation initiation involves preinitiation ribosomal complex 5'-to-3' directional probing of mRNA for codons suitable for starting protein synthesis. The recognition of codons as starts depends on the codon identity and on its immediate nucleotide context known as Kozak context. When the context is weak ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.257352.119
更新日期:2020-07-01 00:00:00
abstract::The epistatically interacting modifier loci (Apmt1 and Apmt2) accelerate the polyoma Middle-T (PyVT)-induced mammary tumor. To identify potential candidate genes loci, a combined bioinformatics and genomics strategy was used. On the basis of the assumption that the loci were functioning in the same or intersecting pat...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.210502
更新日期:2002-06-01 00:00:00
abstract::It is known that sequencing error can bias estimation of evolutionary or population genetic parameters. This problem is more prominent in deep resequencing studies because of their large sample size n, and a higher probability of error at each nucleotide site. We propose a new method based on the composite likelihood ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.097543.109
更新日期:2010-01-01 00:00:00
abstract::We previously described the whole-genome assembly program Arachne, presenting assemblies of simulated data for small to mid-sized genomes. Here we describe algorithmic adaptations to the program, allowing for assembly of mammalian-size genomes, and also improving the assembly of smaller genomes. Three principal change...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.828403
更新日期:2003-01-01 00:00:00
abstract::Human tumors are comprised of heterogeneous cell populations that display diverse molecular and phenotypic features. To examine the extent to which epigenetic differences contribute to intratumoral cellular heterogeneity, we have developed a high-throughput method, termed MAPit-patch. The method uses multiplexed ampli...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.161737.113
更新日期:2014-02-01 00:00:00
abstract::Theory is developed for the process of sequencing randomly selected large-insert clones. Genome size, library depth, clone size, and clone distribution are considered relevant properties and perfect overlap detection for contig assembly is assumed. Genome-specific and nonrandom effects are neglected. Order of magnitud...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.gr-1339r
更新日期:2001-02-01 00:00:00
abstract::Identifying transcriptional regulatory elements represents a significant challenge in annotating the genomes of higher vertebrates. We have developed a computational tool, rVista, for high-throughput discovery of cis-regulatory elements that combines clustering of predicted transcription factor binding sites (TFBSs) a...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.225502
更新日期:2002-05-01 00:00:00
abstract::We have developed a computer program that aligns spliced sequences to genomic sequences, using local alignment algorithms and heuristics to put together a global spliced alignment. Spidey can produce reliable alignments quickly, even when confronted with noise from alternative splicing, polymorphisms, sequencing error...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.195301
更新日期:2001-11-01 00:00:00
abstract::Mammalian genomes are partitioned into domains that replicate in a defined temporal order. These domains can replicate at similar times in all cell types (constitutive) or at cell type-specific times (developmental). Genome-wide chromatin conformation capture (Hi-C) has revealed sub-megabase topologically associating ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.183699.114
更新日期:2015-08-01 00:00:00
abstract::Mosaic mutations present in the germline have important implications for reproductive risk and disease transmission. We previously demonstrated a phenomenon occurring in the male germline, whereby specific mutations arising spontaneously in stem cells (spermatogonia) lead to clonal expansion, resulting in elevated mut...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.239186.118
更新日期:2018-12-01 00:00:00
abstract::MicroRNAs (miRNAs) are major post-transcriptional regulators of gene expression, yet their origins and functional evolution in mammals remain little understood due to the lack of appropriate comparative data. Using RNA sequencing, we have generated extensive and comparable miRNA data for five organs in six species tha...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.140269.112
更新日期:2013-01-01 00:00:00
abstract::Increasing evidence suggests that interactions between regulatory genomic elements play an important role in regulating gene expression. We generated a genome-wide interaction map of regulatory elements in human cells (ENCODE tier 1 cells, K562, GM12878) using Chromatin Interaction Analysis by Paired-End Tag sequencin...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.176586.114
更新日期:2014-12-01 00:00:00
abstract::As part of an effort to identify the gene responsible for the predominant form of polycystic kidney disease (PKD1), we used a gridded human P1 library for contig assembly. The interval of interest, a 700-kb segment on chromosome 16p13.3, can be physically delineated by the genetic markers D16S125 and D16S84 and chromo...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6.6.515
更新日期:1996-06-01 00:00:00
abstract::Infections by Shiga toxin-producing Escherichia coli O157:H7 (STEC O157) are the predominant cause of bloody diarrhea and hemolytic uremic syndrome in the United States. In silico comparison of the two complete STEC O157 genomes (Sakai and EDL933) revealed a strikingly high level of sequence identity in orthologous pr...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.4759706
更新日期:2006-06-01 00:00:00
abstract::A region-specific ENU mutagenesis screen was conducted to elucidate the functional content of proximal mouse Chr 5. We used the visibly marked, recessive, lethal inversion Rump White (Rw) as a balancer in a three-generation breeding scheme to identify recessive mutations within the approximately 50 megabases spanned b...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.3826505
更新日期:2005-08-01 00:00:00