Comparative genomics of the Archaea (Euryarchaeota): evolution of conserved protein families, the stable core, and the variable shell.

Abstract:

:Comparative analysis of the protein sequences encoded in the four euryarchaeal species whose genomes have been sequenced completely (Methanococcus jannaschii, Methanobacterium thermoautotrophicum, Archaeoglobus fulgidus, and Pyrococcus horikoshii) revealed 1326 orthologous sets, of which 543 are represented in all four species. The proteins that belong to these conserved euryarchaeal families comprise 31%-35% of the gene complement and may be considered the evolutionarily stable core of the archaeal genomes. The core gene set includes the great majority of genes coding for proteins involved in genome replication and expression, but only a relatively small subset of metabolic functions. For many gene families that are conserved in all euryarchaea, previously undetected orthologs in bacteria and eukaryotes were identified. A number of euryarchaeal synapomorphies (unique shared characters) were identified; these are protein families that possess sequence signatures or domain architectures that are conserved in all euryarchaea but are not found in bacteria or eukaryotes. In addition, euryarchaea-specific expansions of several protein and domain families were detected. In terms of their apparent phylogenetic affinities, the archaeal protein families split into bacterial and eukaryotic families. The majority of the proteins that have only eukaryotic orthologs or show the greatest similarity to their eukaryotic counterparts belong to the core set. The families of euryarchaeal genes that are conserved in only two or three species constitute a relatively mobile component of the genomes whose evolution should have involved multiple events of lineage-specific gene loss and horizontal gene transfer. Frequently these proteins have detectable orthologs only in bacteria or show the greatest similarity to the bacterial homologs, which might suggest a significant role of horizontal gene transfer from bacteria in the evolution of the euryarchaeota.

journal_name

Genome Res

journal_title

Genome research

authors

Makarova KS,Aravind L,Galperin MY,Grishin NV,Tatusov RL,Wolf YI,Koonin EV

subject

Has Abstract

pub_date

1999-07-01 00:00:00

pages

608-28

issue

7

eissn

1088-9051

issn

1549-5469

journal_volume

9

pub_type

杂志文章
  • The region surrounding the PKD1 gene: a 700-kb P1 contig from a YAC-deficient interval.

    abstract::As part of an effort to identify the gene responsible for the predominant form of polycystic kidney disease (PKD1), we used a gridded human P1 library for contig assembly. The interval of interest, a 700-kb segment on chromosome 16p13.3, can be physically delineated by the genetic markers D16S125 and D16S84 and chromo...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6.6.515

    authors: Dackowski WR,Connors TD,Bowe AE,Stanton V Jr,Housman D,Doggett NA,Landes GM,Klinger KW

    更新日期:1996-06-01 00:00:00

  • CENPT bridges adjacent CENPA nucleosomes on young human α-satellite dimers.

    abstract::Nucleosomes containing the CenH3 (CENPA or CENP-A) histone variant replace H3 nucleosomes at centromeres to provide a foundation for kinetochore assembly. CENPA nucleosomes are part of the constitutive centromere associated network (CCAN) that forms the inner kinetochore on which outer kinetochore proteins assemble. T...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.204784.116

    authors: Thakur J,Henikoff S

    更新日期:2016-09-01 00:00:00

  • GrapeTree: visualization of core genomic relationships among 100,000 bacterial pathogens.

    abstract::Current methods struggle to reconstruct and visualize the genomic relationships of large numbers of bacterial genomes. GrapeTree facilitates the analyses of large numbers of allelic profiles by a static "GrapeTree Layout" algorithm that supports interactive visualizations of large trees within a web browser window. Gr...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.232397.117

    authors: Zhou Z,Alikhan NF,Sergeant MJ,Luhmann N,Vaz C,Francisco AP,Carriço JA,Achtman M

    更新日期:2018-09-01 00:00:00

  • Complex genomic rearrangements lead to novel primate gene function.

    abstract::Orthologous genes that maintain a single-copy status in a broad range of species may indicate a selection against gene duplication. If this is the case, then duplicates of such genes that do survive may have escaped the dosage control by rapid and sizable changes in their function. To test this hypothesis and to devel...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3266405

    authors: Ciccarelli FD,von Mering C,Suyama M,Harrington ED,Izaurralde E,Bork P

    更新日期:2005-03-01 00:00:00

  • Toward the development of a gene index to the human genome: an assessment of the nature of high-throughput EST sequence data.

    abstract::A rigorous analysis of the Merck-sponsored EST data with respect to known gene sequences increases the utility of the data set and helps refine methods for building a gene index. A highly curated human transcript data base was used as a reference data set of known genes. A detailed analysis of EST sequences derived fr...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6.9.829

    authors: Aaronson JS,Eckman B,Blevins RA,Borkowski JA,Myerson J,Imran S,Elliston KO

    更新日期:1996-09-01 00:00:00

  • Comparative analysis of mammalian Y chromosomes illuminates ancestral structure and lineage-specific evolution.

    abstract::Although more than thirty mammalian genomes have been sequenced to draft quality, very few of these include the Y chromosome. This has limited our understanding of the evolutionary dynamics of gene persistence and loss, our ability to identify conserved regulatory elements, as well our knowledge of the extent to which...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.154286.112

    authors: Li G,Davis BW,Raudsepp T,Pearks Wilkerson AJ,Mason VC,Ferguson-Smith M,O'Brien PC,Waters PD,Murphy WJ

    更新日期:2013-09-01 00:00:00

  • Single-cell sequencing data reveal widespread recurrence and loss of mutational hits in the life histories of tumors.

    abstract::Intra-tumor heterogeneity poses substantial challenges for cancer treatment. A tumor's composition can be deduced by reconstructing its mutational history. Central to current approaches is the infinite sites assumption that every genomic position can only mutate once over the lifetime of a tumor. The validity of this ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.220707.117

    authors: Kuipers J,Jahn K,Raphael BJ,Beerenwinkel N

    更新日期:2017-11-01 00:00:00

  • A non-EST-based method for exon-skipping prediction.

    abstract::It is estimated that between 35% and 74% of all human genes can undergo alternative splicing. Currently, the most efficient methods for large-scale detection of alternative splicing use expressed sequence tags (ESTs) or microarray analysis. As these methods merely sample the transcriptome, splice variants that do not ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2572604

    authors: Sorek R,Shemesh R,Cohen Y,Basechess O,Ast G,Shamir R

    更新日期:2004-08-01 00:00:00

  • A complexity reduction algorithm for analysis and annotation of large genomic sequences.

    abstract::DNA is a universal language encrypted with biological instruction for life. In higher organisms, the genetic information is preserved predominantly in an organized exon/intron structure. When a gene is expressed, the exons are spliced together to form the transcript for protein synthesis. We have developed a complexit...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.313703

    authors: Chuang TJ,Lin WC,Lee HC,Wang CW,Hsiao KL,Wang ZH,Shieh D,Lin SC,Ch'ang LY

    更新日期:2003-02-01 00:00:00

  • Introgression maintains the genetic integrity of the mating-type determining chromosome of the fungus Neurospora tetrasperma.

    abstract::Genome evolution is driven by a complex interplay of factors, including selection, recombination, and introgression. The regions determining sexual identity are particularly dynamic parts of eukaryotic genomes that are prone to molecular degeneration associated with suppressed recombination. In the fungus Neurospora t...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.197244.115

    authors: Corcoran P,Anderson JL,Jacobson DJ,Sun Y,Ni P,Lascoux M,Johannesson H

    更新日期:2016-04-01 00:00:00

  • The Ensembl automatic gene annotation system.

    abstract::As more genomes are sequenced, there is an increasing need for automated first-pass annotation which allows timely access to important genomic information. The Ensembl gene-building system enables fast automated annotation of eukaryotic genomes. It annotates genes based on evidence derived from known protein, cDNA, an...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1858004

    authors: Curwen V,Eyras E,Andrews TD,Clarke L,Mongin E,Searle SM,Clamp M

    更新日期:2004-05-01 00:00:00

  • Ancestry-agnostic estimation of DNA sample contamination from sequence reads.

    abstract::Detecting and estimating DNA sample contamination are important steps to ensure high-quality genotype calls and reliable downstream analysis. Existing methods rely on population allele frequency information for accurate estimation of contamination rates. Correctly specifying population allele frequencies for each indi...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.246934.118

    authors: Zhang F,Flickinger M,Taliun SAG,InPSYght Psychiatric Genetics Consortium.,Abecasis GR,Scott LJ,McCaroll SA,Pato CN,Boehnke M,Kang HM

    更新日期:2020-02-01 00:00:00

  • CRISPR RNAs trigger innate immune responses in human cells.

    abstract::Here, we report that CRISPR guide RNAs (gRNAs) with a 5'-triphosphate group (5'-ppp gRNAs) produced via in vitro transcription trigger RNA-sensing innate immune responses in human and murine cells, leading to cytotoxicity. 5'-ppp gRNAs in the cytosol are recognized by DDX58, which in turn activates type I interferon r...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.231936.117

    authors: Kim S,Koo T,Jee HG,Cho HY,Lee G,Lim DG,Shin HS,Kim JS

    更新日期:2018-02-22 00:00:00

  • Principled multi-omic analysis reveals gene regulatory mechanisms of phenotype variation.

    abstract::Recent studies have analyzed large-scale data sets of gene expression to identify genes associated with interindividual variation in phenotypes ranging from cancer subtypes to drug sensitivity, promising new avenues of research in personalized medicine. However, gene expression data alone is limited in its ability to ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.227066.117

    authors: Hanson C,Cairns J,Wang L,Sinha S

    更新日期:2018-08-01 00:00:00

  • Improved discovery of genetic interactions using CRISPRiSeq across multiple environments.

    abstract::Large-scale genetic interaction (GI) screens in yeast have been invaluable for our understanding of molecular systems biology and for characterizing novel gene function. Owing in part to the high costs and long experiment times required, a preponderance of GI data has been generated in a single environmental condition...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.246603.118

    authors: Jaffe M,Dziulko A,Smith JD,St Onge RP,Levy SF,Sherlock G

    更新日期:2019-04-01 00:00:00

  • Optical mapping of BAC clones from the human Y chromosome DAZ locus.

    abstract::The accurate mapping of clones derived from genomic regions containing complex arrangements of repeated elements presents special problems for DNA sequencers. Recent advances in the automation of optical mapping have enabled us to map a set of 16 BAC clones derived from the DAZ locus of the human Y chromosome long arm...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.112100

    authors: Giacalone J,Delobette S,Gibaja V,Ni L,Skiadas Y,Qi R,Edington J,Lai Z,Gebauer D,Zhao H,Anantharaman T,Mishra B,Brown LG,Saxena R,Page DC,Schwartz DC

    更新日期:2000-09-01 00:00:00

  • HERC2 rs12913832 modulates human pigmentation by attenuating chromatin-loop formation between a long-range enhancer and the OCA2 promoter.

    abstract::Pigmentation of skin, eye, and hair reflects some of the most evident common phenotypes in humans. Several candidate genes for human pigmentation are identified. The SNP rs12913832 has strong statistical association with human pigmentation. It is located within an intron of the nonpigment gene HERC2, 21 kb upstream of...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.128652.111

    authors: Visser M,Kayser M,Palstra RJ

    更新日期:2012-03-01 00:00:00

  • Wolbachia genome integrated in an insect chromosome: evolution and fate of laterally transferred endosymbiont genes.

    abstract::Recent accumulation of microbial genome data has demonstrated that lateral gene transfers constitute an important and universal evolutionary process in prokaryotes, while those in multicellular eukaryotes are still regarded as unusual, except for endosymbiotic gene transfers from mitochondria and plastids. Here we tho...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.7144908

    authors: Nikoh N,Tanaka K,Shibata F,Kondo N,Hizume M,Shimada M,Fukatsu T

    更新日期:2008-02-01 00:00:00

  • Integrated mapping, chromosomal sequencing and sequence analysis of Cryptosporidium parvum.

    abstract::The apicomplexan Cryptosporidium parvum is one of the most prevalent protozoan parasites of humans. We report the physical mapping of the genome of the Iowa isolate, sequencing and analysis of chromosome 6, and approximately 0.9 Mbp of sequence sampled from the remainder of the genome. To construct a robust physical m...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1555203

    authors: Bankier AT,Spriggs HF,Fartmann B,Konfortov BA,Madera M,Vogel C,Teichmann SA,Ivens A,Dear PH

    更新日期:2003-08-01 00:00:00

  • Massive turnover of functional sequence in human and other mammalian genomes.

    abstract::Despite the availability of dozens of animal genome sequences, two key questions remain unanswered: First, what fraction of any species' genome confers biological function, and second, are apparent differences in organismal complexity reflected in an objective measure of genomic complexity? Here, we address both quest...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.108795.110

    authors: Meader S,Ponting CP,Lunter G

    更新日期:2010-10-01 00:00:00

  • A novel k-mer set memory (KSM) motif representation improves regulatory variant prediction.

    abstract::The representation and discovery of transcription factor (TF) sequence binding specificities is critical for understanding gene regulatory networks and interpreting the impact of disease-associated noncoding genetic variants. We present a novel TF binding motif representation, the k-mer set memory (KSM), which consist...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.226852.117

    authors: Guo Y,Tian K,Zeng H,Guo X,Gifford DK

    更新日期:2018-06-01 00:00:00

  • Yeast genetic interaction screen of human genes associated with amyotrophic lateral sclerosis: identification of MAP2K5 kinase as a potential drug target.

    abstract::To understand disease mechanisms, a large-scale analysis of human-yeast genetic interactions was performed. Of 1305 human disease genes assayed, 20 genes exhibited strong toxicity in yeast. Human-yeast genetic interactions were identified by en masse transformation of the human disease genes into a pool of 4653 homozy...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.211649.116

    authors: Jo M,Chung AY,Yachie N,Seo M,Jeon H,Nam Y,Seo Y,Kim E,Zhong Q,Vidal M,Park HC,Roth FP,Suk K

    更新日期:2017-09-01 00:00:00

  • Reconstructing complex regions of genomes using long-read sequencing technology.

    abstract::Obtaining high-quality sequence continuity of complex regions of recent segmental duplication remains one of the major challenges of finishing genome assemblies. In the human and mouse genomes, this was achieved by targeting large-insert clones using costly and laborious capillary-based sequencing approaches. Sanger s...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.168450.113

    authors: Huddleston J,Ranade S,Malig M,Antonacci F,Chaisson M,Hon L,Sudmant PH,Graves TA,Alkan C,Dennis MY,Wilson RK,Turner SW,Korlach J,Eichler EE

    更新日期:2014-04-01 00:00:00

  • Computational identification of operons in microbial genomes.

    abstract::By applying graph representations to biochemical pathways, a new computational pipeline is proposed to find potential operons in microbial genomes. The algorithm relies on the fact that enzyme genes in operons tend to catalyze successive reactions in metabolic pathways. We applied this algorithm to 42 microbial genome...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.200602

    authors: Zheng Y,Szustakowski JD,Fortnow L,Roberts RJ,Kasif S

    更新日期:2002-08-01 00:00:00

  • Multimeric threading-based prediction of protein-protein interactions on a genomic scale: application to the Saccharomyces cerevisiae proteome.

    abstract::MULTIPROSPECTOR, a multimeric threading algorithm for the prediction of protein-protein interactions, is applied to the genome of Saccharomyces cerevisiae. Each possible pairwise interaction among more than 6000 encoded proteins is evaluated against a dimer database of 768 complex structures by using a confidence esti...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1145203

    authors: Lu L,Arakaki AK,Lu H,Skolnick J

    更新日期:2003-06-01 00:00:00

  • Dynamic building of a BAC clone tiling path for the Rat Genome Sequencing Project.

    abstract::CLONEPICKER is a software pipeline that integrates sequence data with BAC clone fingerprints to dynamically select a minimal overlapping clone set covering the whole genome. In the Rat Genome Sequencing Project (RGSP), a hybrid strategy of "clone by clone" and "whole genome shotgun" approaches was used to maximize the...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2171704

    authors: Chen R,Sodergren E,Weinstock GM,Gibbs RA

    更新日期:2004-04-01 00:00:00

  • Sequential ChIP-bisulfite sequencing enables direct genome-scale investigation of chromatin and DNA methylation cross-talk.

    abstract::Cross-talk between DNA methylation and histone modifications drives the establishment of composite epigenetic signatures and is traditionally studied using correlative rather than direct approaches. Here, we present sequential ChIP-bisulfite-sequencing (ChIP-BS-seq) as an approach to quantitatively assess DNA methylat...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.133728.111

    authors: Brinkman AB,Gu H,Bartels SJ,Zhang Y,Matarese F,Simmer F,Marks H,Bock C,Gnirke A,Meissner A,Stunnenberg HG

    更新日期:2012-06-01 00:00:00

  • DIG-seq: a genome-wide CRISPR off-target profiling method using chromatin DNA.

    abstract::To investigate whether and how CRISPR-Cas9 on-target and off-target activities are affected by chromatin in eukaryotic cells, we first identified a series of identical endogenous DNA sequences present in both open and closed chromatin regions and then measured mutation frequencies at these sites in human cells using C...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.236620.118

    authors: Kim D,Kim JS

    更新日期:2018-12-01 00:00:00

  • Dynamic changes in replication timing and gene expression during lineage specification of human pluripotent stem cells.

    abstract::Duplication of the genome in mammalian cells occurs in a defined temporal order referred to as its replication-timing (RT) program. RT changes dynamically during development, regulated in units of 400-800 kb referred to as replication domains (RDs). Changes in RT are generally coordinated with transcriptional competen...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.187989.114

    authors: Rivera-Mulia JC,Buckley Q,Sasaki T,Zimmerman J,Didier RA,Nazor K,Loring JF,Lian Z,Weissman S,Robins AJ,Schulz TC,Menendez L,Kulik MJ,Dalton S,Gabr H,Kahveci T,Gilbert DM

    更新日期:2015-08-01 00:00:00

  • Population genetic inference from genomic sequence variation.

    abstract::Population genetics has evolved from a theory-driven field with little empirical data into a data-driven discipline in which genome-scale data sets test the limits of available models and computational analysis methods. In humans and a few model organisms, analyses of whole-genome sequence polymorphism data are curren...

    journal_title:Genome research

    pub_type: 杂志文章,评审

    doi:10.1101/gr.079509.108

    authors: Pool JE,Hellmann I,Jensen JD,Nielsen R

    更新日期:2010-03-01 00:00:00