Abstract:
:The functional classification of genes on a genome-wide scale is now in its infancy, and we make a first attempt to assess existing methods and identify sources of error. To this end, we compared two independent efforts for associating proteins with functions, one implemented by FlyBase and the other by PANTHER at Celera Genomics. Both methods make inferences based on sequence similarity and the available experimental evidence. However, they differ considerably in methodology and process. Overall, assuming that the systematic error across the two methods is relatively small, we find the protein-to-function association error rate of both the FlyBase and PANTHER methods to be <2%. The primary source of error for both methods appears to be simple human error. Although homology-based inference can certainly cause errors in annotation, our analysis indicates that the frequency of such errors is relatively small compared with the number of correct inferences. Moreover, these homology errors can be minimized by careful tree-based inference, such as that implemented in PANTHER. Often, functional associations are made by one method and not the other, indicating that one of the greatest challenges lies in improving the completeness of available ontology associations.
journal_name
Genome Resjournal_title
Genome researchauthors
Mi H,Vandergriff J,Campbell M,Narechania A,Majoros W,Lewis S,Thomas PD,Ashburner Mdoi
10.1101/gr.771603subject
Has Abstractpub_date
2003-09-01 00:00:00pages
2118-28issue
9eissn
1088-9051issn
1549-5469pii
13/9/2118journal_volume
13pub_type
杂志文章相关文献
GENOME RESEARCH文献大全abstract::Levels of diversity vary across the human genome. This variation is caused by two forces: differences in mutation rates and the differential impact of natural selection. Pertinent to the question of the relative importance of these two forces is the observation that both diversity within species and interspecies diver...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.3461105
更新日期:2005-09-01 00:00:00
abstract::We have identified three new families of insulin homologs in Caenorhabditis elegans. In two of these families, concerted mutations suggest that an additional disulfide bond links B and A domains, and that the A-domain internal disulfide bond is substituted by a hydrophobic interaction. Homology modeling remarkably con...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.8.4.348
更新日期:1998-04-01 00:00:00
abstract::Genomic comparisons provide evidence for ancient genome-wide duplications in a diverse array of animals and plants. We developed a birth-death model to identify evidence for genome duplication in EST data, and applied a mixture model to estimate the age distribution of paralogous pairs identified in EST sets for speci...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.4825606
更新日期:2006-06-01 00:00:00
abstract::Biological products of importance in food (e.g., milk) and medical (e.g., donor blood-derived products) sciences often correspond to mixtures of samples contributed by multiple individuals. Identifying which individuals contributed to the mixture and in what proportions may be of interest in several circumstances. We ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.256172.119
更新日期:2020-08-01 00:00:00
abstract::The apicomplexan Cryptosporidium parvum is one of the most prevalent protozoan parasites of humans. We report the physical mapping of the genome of the Iowa isolate, sequencing and analysis of chromosome 6, and approximately 0.9 Mbp of sequence sampled from the remainder of the genome. To construct a robust physical m...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.1555203
更新日期:2003-08-01 00:00:00
abstract::Interferons and interleukin-10 are involved in key aspects of the host defence mechanisms. Human chromosome 21 harbors the interferon/interleukin-10 receptor gene cluster linked to the GART gene. This cluster includes both components of the interferon alpha/beta-receptor (IFNAR1 and IFNAR2) and the second components o...
journal_title:Genome research
pub_type: 杂志文章
doi:
更新日期:1999-03-01 00:00:00
abstract::The detailed genomic organization of a gene-dense region at human chromosome 12p13, spanning 223 kb of contiguous sequence, was determined. This region is composed of 20 genes and several other expressed sequences. Experimental tools including RT-PCR and cDNA sequencing, combined with gene prediction programs, were ut...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.7.3.268
更新日期:1997-03-01 00:00:00
abstract::Upon invasion of the erythrocyte cell, the malaria parasite remodels its environment; in particular, it establishes a complex membrane network, which connects the parasitophorous vacuole to the host plasma membrane and is involved in protein transport and trafficking. We have identified a novel subtelomeric gene famil...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.2126104
更新日期:2004-06-01 00:00:00
abstract::Mutations in the human AIRE gene (hAIRE) result in the development of an autoimmune disease named APECED (autoimmune polyendocrinopathy candidiasis ectodermal dystrophy; OMIM 240300). Previously, we have cloned hAIRE and shown that it codes for a putative transcription-associated factor. Here we report the cloning and...
journal_title:Genome research
pub_type: 杂志文章
doi:
更新日期:1999-02-01 00:00:00
abstract::Analysis procedures are needed to extract useful information from the large amount of gene expression data that is becoming available. This work describes a set of analytical tools and their application to yeast cell cycle data. The components of our approach are (1) a similarity measure that reduces the number of fal...
journal_title:Genome research
pub_type: 杂志文章,评审
doi:10.1101/gr.9.11.1106
更新日期:1999-11-01 00:00:00
abstract::We have identified previously a putative tumor suppressor gene (TSG) locus at human chromosome (hchr) 7q31 showing that it is altered in a variety of human epithelial tumors. To determine whether this TSG is conserved in mice, we studied loss of heterozygosity (LOH) in chemically induced mouse liver adenomas. The LOH ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6.11.1070
更新日期:1996-11-01 00:00:00
abstract::The in vitro cloning of DNA molecules traditionally uses PCR amplification or site-specific restriction endonucleases to generate linear DNA inserts with defined termini and requires DNA ligase to covalently join those inserts to vectors with the corresponding ends. We have used the properties of Vaccinia DNA topoisom...
journal_title:Genome research
pub_type: 杂志文章
doi:
更新日期:1999-04-01 00:00:00
abstract::Evolutionary constraints on gene regulatory elements are poorly understood: Little is known about how the strength of transcription factor binding correlates with DNA sequence conservation, and whether transcription factor binding sites can evolve rapidly while retaining their function. Here we use the model of the NF...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6490707
更新日期:2007-09-01 00:00:00
abstract::Little is known about novel genetic elements that drove the emergence of anthropoid primates. We exploited the sequencing of the marmoset genome to identify 23,849 anthropoid-specific constrained (ASC) regions and confirmed their robust functional signatures. Of the ASC base pairs, 99.7% were noncoding, suggesting tha...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.168963.113
更新日期:2014-09-01 00:00:00
abstract::High-throughput sequencing of cDNA (RNA-seq) is a widely deployed transcriptome profiling and annotation technique, but questions about the performance of different protocols and platforms remain. We used a newly developed pool of 96 synthetic RNAs with various lengths, and GC content covering a 2(20) concentration ra...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.121095.111
更新日期:2011-09-01 00:00:00
abstract::We have developed the CADLIVE (Computer-Aided Design of LIVing systEms) Simulator that provided a rule-based automatic way to convert biochemical network maps into dynamic models, which enables simulating their dynamics without going through all of the reactions down to the details of exact kinetic parameters. The sim...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.3463705
更新日期:2005-04-01 00:00:00
abstract::Mosaic mutations present in the germline have important implications for reproductive risk and disease transmission. We previously demonstrated a phenomenon occurring in the male germline, whereby specific mutations arising spontaneously in stem cells (spermatogonia) lead to clonal expansion, resulting in elevated mut...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.239186.118
更新日期:2018-12-01 00:00:00
abstract::The vomeronasal system of mice is thought to be specialized in the detection of pheromones. Two multigene families have been identified that encode proteins with seven putative transmembrane domains and that are expressed selectively in subsets of neurons of the vomeronasal organ. The products of these vomeronasal rec...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.10.12.1958
更新日期:2000-12-01 00:00:00
abstract::Genomic imprinting is a developmentally important mechanism that involves both differential DNA methylation and allelic histone modifications. Through detailed comparative characterization, a large imprinted domain mapping to chromosome 7q21 in humans and proximal chromosome 6 in mice was redefined. This domain is org...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.077115.108
更新日期:2008-08-01 00:00:00
abstract::Aberrant DNA methylation (DNAm) was first linked to cancer over 25 yr ago. Since then, many studies have associated hypermethylation of tumor suppressor genes and hypomethylation of oncogenes to the tumorigenic process. However, most of these studies have been limited to the analysis of promoters and CpG islands (CGIs...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.109678.110
更新日期:2011-04-01 00:00:00
abstract::RNA-seq protocols that focus on transcript termini are well suited for applications in which template quantity is limiting. Here we show that, when applied to end-sequencing data, analytical methods designed for global RNA-seq produce computational artifacts. To remedy this, we created the End Sequence Analysis Toolki...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.207902.116
更新日期:2016-10-01 00:00:00
abstract::Identifying genes in the genomic context is central to a cell's ability to interpret the genome. Yet, in general, the signals used to define eukaryotic genes are poorly described. Here, we derived simple classifiers that identify where transcription will initiate and terminate using nucleic acid sequence features dete...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.164327.113
更新日期:2014-01-01 00:00:00
abstract::Metagenomic projects generate short, overlapping fragments of DNA sequence, each deriving from a different individual. We report a new method for inferring the scaled mutation rate, theta = 2Neu, and the scaled exponential growth rate, R = Ner, from the site-frequency spectrum of these data while accounting for sequen...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.5431206
更新日期:2006-10-01 00:00:00
abstract::Cys2-His2 zinc finger proteins (ZFPs) are the largest group of transcription factors in higher metazoans. A complete characterization of these ZFPs and their associated target sequences is pivotal to fully annotate transcriptional regulatory networks in metazoan genomes. As a first step in this process, we have charac...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.151472.112
更新日期:2013-06-01 00:00:00
abstract::Previous approaches to mutation detection in mRNA from the neurofibromatosis 1 (NF1) locus have required the PCR amplification of five or more overlapping cDNA segments to screen the entire 8.5-kb open reading frame (ORF). Systematically, these assays do not detect deletions that span the region of overlap (usually 1-...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6.1.58
更新日期:1996-01-01 00:00:00
abstract::In this study we quantify the features of meiotic recombination on the long arm of human chromosome 21. We constructed a 67. 3-centimorgan (cM) high-resolution, comprehensive, and accurate genetic linkage map of chromosome 21q using 187 highly polymorphic markers covering almost the entire long arm; 46 loci, consistin...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.138100
更新日期:2000-09-01 00:00:00
abstract::We have performed detrended DNA walks on whole prokaryotic genomes, on noncoding sequences and, separately, on each position in codons of coding sequences. Our method enables us to distinguish between the mutational pressure associated with replication and the mutational pressure associated with transcription and othe...
journal_title:Genome research
pub_type: 杂志文章
doi:
更新日期:1999-05-01 00:00:00
abstract::A contiguous high-resolution map of 44 loci from a 35-Mb portion of the distal region of the long arm of human chromosome 5, q21-q35, was produced using radiation hybrid (RH) mapping in conjunction with a natural deletion mapping panel. The map includes 30 genes, four sequence-tagged site (STS) loci, and 10 DNA marker...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6.7.628
更新日期:1996-07-01 00:00:00
abstract::Despite much research, our understanding of the architecture and cis-regulatory elements of human promoters is still lacking. Here, we devised a high-throughput assay to quantify the activity of approximately 15,000 fully designed sequences that we integrated and expressed from a fixed location within the human genome...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.236075.118
更新日期:2019-02-01 00:00:00
abstract::Organisms with large genomes contain vast amounts of repetitive DNA sequences, much of which is composed of retrotransposons. Amplification of retrotransposons has been postulated to be a major mechanism increasing genome size and leading to "genomic obesity." To gain insights into the relation between retrotransposon...
journal_title:Genome research
pub_type: 评论,杂志文章
doi:10.1101/gr.10.7.908
更新日期:2000-07-01 00:00:00