Abstract:
:Comparative analysis of the protein sequences encoded in the four euryarchaeal species whose genomes have been sequenced completely (Methanococcus jannaschii, Methanobacterium thermoautotrophicum, Archaeoglobus fulgidus, and Pyrococcus horikoshii) revealed 1326 orthologous sets, of which 543 are represented in all four species. The proteins that belong to these conserved euryarchaeal families comprise 31%-35% of the gene complement and may be considered the evolutionarily stable core of the archaeal genomes. The core gene set includes the great majority of genes coding for proteins involved in genome replication and expression, but only a relatively small subset of metabolic functions. For many gene families that are conserved in all euryarchaea, previously undetected orthologs in bacteria and eukaryotes were identified. A number of euryarchaeal synapomorphies (unique shared characters) were identified; these are protein families that possess sequence signatures or domain architectures that are conserved in all euryarchaea but are not found in bacteria or eukaryotes. In addition, euryarchaea-specific expansions of several protein and domain families were detected. In terms of their apparent phylogenetic affinities, the archaeal protein families split into bacterial and eukaryotic families. The majority of the proteins that have only eukaryotic orthologs or show the greatest similarity to their eukaryotic counterparts belong to the core set. The families of euryarchaeal genes that are conserved in only two or three species constitute a relatively mobile component of the genomes whose evolution should have involved multiple events of lineage-specific gene loss and horizontal gene transfer. Frequently these proteins have detectable orthologs only in bacteria or show the greatest similarity to the bacterial homologs, which might suggest a significant role of horizontal gene transfer from bacteria in the evolution of the euryarchaeota.
journal_name
Genome Resjournal_title
Genome researchauthors
Makarova KS,Aravind L,Galperin MY,Grishin NV,Tatusov RL,Wolf YI,Koonin EVsubject
Has Abstractpub_date
1999-07-01 00:00:00pages
608-28issue
7eissn
1088-9051issn
1549-5469journal_volume
9pub_type
杂志文章相关文献
GENOME RESEARCH文献大全abstract::As part of an effort to identify the gene responsible for the predominant form of polycystic kidney disease (PKD1), we used a gridded human P1 library for contig assembly. The interval of interest, a 700-kb segment on chromosome 16p13.3, can be physically delineated by the genetic markers D16S125 and D16S84 and chromo...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6.6.515
更新日期:1996-06-01 00:00:00
abstract::Nucleosomes containing the CenH3 (CENPA or CENP-A) histone variant replace H3 nucleosomes at centromeres to provide a foundation for kinetochore assembly. CENPA nucleosomes are part of the constitutive centromere associated network (CCAN) that forms the inner kinetochore on which outer kinetochore proteins assemble. T...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.204784.116
更新日期:2016-09-01 00:00:00
abstract::Current methods struggle to reconstruct and visualize the genomic relationships of large numbers of bacterial genomes. GrapeTree facilitates the analyses of large numbers of allelic profiles by a static "GrapeTree Layout" algorithm that supports interactive visualizations of large trees within a web browser window. Gr...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.232397.117
更新日期:2018-09-01 00:00:00
abstract::Orthologous genes that maintain a single-copy status in a broad range of species may indicate a selection against gene duplication. If this is the case, then duplicates of such genes that do survive may have escaped the dosage control by rapid and sizable changes in their function. To test this hypothesis and to devel...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.3266405
更新日期:2005-03-01 00:00:00
abstract::A rigorous analysis of the Merck-sponsored EST data with respect to known gene sequences increases the utility of the data set and helps refine methods for building a gene index. A highly curated human transcript data base was used as a reference data set of known genes. A detailed analysis of EST sequences derived fr...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6.9.829
更新日期:1996-09-01 00:00:00
abstract::Although more than thirty mammalian genomes have been sequenced to draft quality, very few of these include the Y chromosome. This has limited our understanding of the evolutionary dynamics of gene persistence and loss, our ability to identify conserved regulatory elements, as well our knowledge of the extent to which...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.154286.112
更新日期:2013-09-01 00:00:00
abstract::Intra-tumor heterogeneity poses substantial challenges for cancer treatment. A tumor's composition can be deduced by reconstructing its mutational history. Central to current approaches is the infinite sites assumption that every genomic position can only mutate once over the lifetime of a tumor. The validity of this ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.220707.117
更新日期:2017-11-01 00:00:00
abstract::It is estimated that between 35% and 74% of all human genes can undergo alternative splicing. Currently, the most efficient methods for large-scale detection of alternative splicing use expressed sequence tags (ESTs) or microarray analysis. As these methods merely sample the transcriptome, splice variants that do not ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.2572604
更新日期:2004-08-01 00:00:00
abstract::DNA is a universal language encrypted with biological instruction for life. In higher organisms, the genetic information is preserved predominantly in an organized exon/intron structure. When a gene is expressed, the exons are spliced together to form the transcript for protein synthesis. We have developed a complexit...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.313703
更新日期:2003-02-01 00:00:00
abstract::Genome evolution is driven by a complex interplay of factors, including selection, recombination, and introgression. The regions determining sexual identity are particularly dynamic parts of eukaryotic genomes that are prone to molecular degeneration associated with suppressed recombination. In the fungus Neurospora t...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.197244.115
更新日期:2016-04-01 00:00:00
abstract::As more genomes are sequenced, there is an increasing need for automated first-pass annotation which allows timely access to important genomic information. The Ensembl gene-building system enables fast automated annotation of eukaryotic genomes. It annotates genes based on evidence derived from known protein, cDNA, an...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.1858004
更新日期:2004-05-01 00:00:00
abstract::Detecting and estimating DNA sample contamination are important steps to ensure high-quality genotype calls and reliable downstream analysis. Existing methods rely on population allele frequency information for accurate estimation of contamination rates. Correctly specifying population allele frequencies for each indi...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.246934.118
更新日期:2020-02-01 00:00:00
abstract::Here, we report that CRISPR guide RNAs (gRNAs) with a 5'-triphosphate group (5'-ppp gRNAs) produced via in vitro transcription trigger RNA-sensing innate immune responses in human and murine cells, leading to cytotoxicity. 5'-ppp gRNAs in the cytosol are recognized by DDX58, which in turn activates type I interferon r...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.231936.117
更新日期:2018-02-22 00:00:00
abstract::Recent studies have analyzed large-scale data sets of gene expression to identify genes associated with interindividual variation in phenotypes ranging from cancer subtypes to drug sensitivity, promising new avenues of research in personalized medicine. However, gene expression data alone is limited in its ability to ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.227066.117
更新日期:2018-08-01 00:00:00
abstract::Large-scale genetic interaction (GI) screens in yeast have been invaluable for our understanding of molecular systems biology and for characterizing novel gene function. Owing in part to the high costs and long experiment times required, a preponderance of GI data has been generated in a single environmental condition...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.246603.118
更新日期:2019-04-01 00:00:00
abstract::The accurate mapping of clones derived from genomic regions containing complex arrangements of repeated elements presents special problems for DNA sequencers. Recent advances in the automation of optical mapping have enabled us to map a set of 16 BAC clones derived from the DAZ locus of the human Y chromosome long arm...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.112100
更新日期:2000-09-01 00:00:00
abstract::Pigmentation of skin, eye, and hair reflects some of the most evident common phenotypes in humans. Several candidate genes for human pigmentation are identified. The SNP rs12913832 has strong statistical association with human pigmentation. It is located within an intron of the nonpigment gene HERC2, 21 kb upstream of...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.128652.111
更新日期:2012-03-01 00:00:00
abstract::Recent accumulation of microbial genome data has demonstrated that lateral gene transfers constitute an important and universal evolutionary process in prokaryotes, while those in multicellular eukaryotes are still regarded as unusual, except for endosymbiotic gene transfers from mitochondria and plastids. Here we tho...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.7144908
更新日期:2008-02-01 00:00:00
abstract::The apicomplexan Cryptosporidium parvum is one of the most prevalent protozoan parasites of humans. We report the physical mapping of the genome of the Iowa isolate, sequencing and analysis of chromosome 6, and approximately 0.9 Mbp of sequence sampled from the remainder of the genome. To construct a robust physical m...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.1555203
更新日期:2003-08-01 00:00:00
abstract::Despite the availability of dozens of animal genome sequences, two key questions remain unanswered: First, what fraction of any species' genome confers biological function, and second, are apparent differences in organismal complexity reflected in an objective measure of genomic complexity? Here, we address both quest...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.108795.110
更新日期:2010-10-01 00:00:00
abstract::The representation and discovery of transcription factor (TF) sequence binding specificities is critical for understanding gene regulatory networks and interpreting the impact of disease-associated noncoding genetic variants. We present a novel TF binding motif representation, the k-mer set memory (KSM), which consist...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.226852.117
更新日期:2018-06-01 00:00:00
abstract::To understand disease mechanisms, a large-scale analysis of human-yeast genetic interactions was performed. Of 1305 human disease genes assayed, 20 genes exhibited strong toxicity in yeast. Human-yeast genetic interactions were identified by en masse transformation of the human disease genes into a pool of 4653 homozy...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.211649.116
更新日期:2017-09-01 00:00:00
abstract::Obtaining high-quality sequence continuity of complex regions of recent segmental duplication remains one of the major challenges of finishing genome assemblies. In the human and mouse genomes, this was achieved by targeting large-insert clones using costly and laborious capillary-based sequencing approaches. Sanger s...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.168450.113
更新日期:2014-04-01 00:00:00
abstract::By applying graph representations to biochemical pathways, a new computational pipeline is proposed to find potential operons in microbial genomes. The algorithm relies on the fact that enzyme genes in operons tend to catalyze successive reactions in metabolic pathways. We applied this algorithm to 42 microbial genome...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.200602
更新日期:2002-08-01 00:00:00
abstract::MULTIPROSPECTOR, a multimeric threading algorithm for the prediction of protein-protein interactions, is applied to the genome of Saccharomyces cerevisiae. Each possible pairwise interaction among more than 6000 encoded proteins is evaluated against a dimer database of 768 complex structures by using a confidence esti...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.1145203
更新日期:2003-06-01 00:00:00
abstract::CLONEPICKER is a software pipeline that integrates sequence data with BAC clone fingerprints to dynamically select a minimal overlapping clone set covering the whole genome. In the Rat Genome Sequencing Project (RGSP), a hybrid strategy of "clone by clone" and "whole genome shotgun" approaches was used to maximize the...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.2171704
更新日期:2004-04-01 00:00:00
abstract::Cross-talk between DNA methylation and histone modifications drives the establishment of composite epigenetic signatures and is traditionally studied using correlative rather than direct approaches. Here, we present sequential ChIP-bisulfite-sequencing (ChIP-BS-seq) as an approach to quantitatively assess DNA methylat...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.133728.111
更新日期:2012-06-01 00:00:00
abstract::To investigate whether and how CRISPR-Cas9 on-target and off-target activities are affected by chromatin in eukaryotic cells, we first identified a series of identical endogenous DNA sequences present in both open and closed chromatin regions and then measured mutation frequencies at these sites in human cells using C...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.236620.118
更新日期:2018-12-01 00:00:00
abstract::Duplication of the genome in mammalian cells occurs in a defined temporal order referred to as its replication-timing (RT) program. RT changes dynamically during development, regulated in units of 400-800 kb referred to as replication domains (RDs). Changes in RT are generally coordinated with transcriptional competen...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.187989.114
更新日期:2015-08-01 00:00:00
abstract::Population genetics has evolved from a theory-driven field with little empirical data into a data-driven discipline in which genome-scale data sets test the limits of available models and computational analysis methods. In humans and a few model organisms, analyses of whole-genome sequence polymorphism data are curren...
journal_title:Genome research
pub_type: 杂志文章,评审
doi:10.1101/gr.079509.108
更新日期:2010-03-01 00:00:00