Comparative genomics of gene-family size in closely related bacteria.

Abstract:

BACKGROUND:The wealth of genomic data in bacteria is helping microbiologists understand the factors involved in gene innovation. Among these, the expansion and reduction of gene families appears to have a fundamental role in this, but the factors influencing gene family size are unclear. RESULTS:The relative content of paralogous genes in bacterial genomes increases with genome size, largely due to the expansion of gene family size in large genomes. Bacteria undergoing genome reduction display a parallel process of redundancy elimination, by which gene families are reduced to one or a few members. Gene family size is also influenced by sequence divergence and physiological function. Large gene families show wider sequence divergence, suggesting they are probably older, and certain functions (such as metabolite transport mechanisms) are overrepresented in large families. The size of a given gene family is remarkably similar in strains of the same species and in closely related species, suggesting that homologous gene families are vertically transmitted and depend little on horizontal gene transfer (HGT). CONCLUSIONS:The remarkable preservation of copy numbers in widely different ecotypes indicates a functional role for the different copies rather than simply a back-up role. When different genera are compared, the increase in phylogenetic distance and/or ecological specialization disrupts this preservation, albeit in a gradual manner and maintaining an overall similarity, which also supports this view. HGT can have an important role, however, in nonhomologous gene families, as exemplified by a comparison between saprophytic and enterohemorrhagic strains of Escherichia coli.

journal_name

Genome Biol

journal_title

Genome biology

authors

Pushker R,Mira A,Rodríguez-Valera F

doi

10.1186/gb-2004-5-4-r27

keywords:

subject

Has Abstract

pub_date

2004-01-01 00:00:00

pages

R27

issue

4

eissn

1474-7596

issn

1474-760X

pii

gb-2004-5-4-r27

journal_volume

5

pub_type

杂志文章
  • Domain atrophy creates rare cases of functional partial protein domains.

    abstract:BACKGROUND:Protein domains display a range of structural diversity, with numerous additions and deletions of secondary structural elements between related domains. We have observed a small number of cases of surprising large-scale deletions of core elements of structural domains. We propose a new concept called domain ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-015-0655-8

    authors: Prakash A,Bateman A

    更新日期:2015-04-30 00:00:00

  • Selection in the evolution of gene duplications.

    abstract:BACKGROUND:Gene duplications have a major role in the evolution of new biological functions. Theoretical studies often assume that a duplication per se is selectively neutral and that, following a duplication, one of the gene copies is freed from purifying (stabilizing) selection, which creates the potential for evolut...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2002-3-2-research0008

    authors: Kondrashov FA,Rogozin IB,Wolf YI,Koonin EV

    更新日期:2002-01-01 00:00:00

  • Demystifying "drop-outs" in single-cell UMI data.

    abstract::Many existing pipelines for scRNA-seq data apply pre-processing steps such as normalization or imputation to account for excessive zeros or "drop-outs." Here, we extensively analyze diverse UMI data sets to show that clustering should be the foremost step of the workflow. We observe that most drop-outs disappear once ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-02096-y

    authors: Kim TH,Zhou X,Chen M

    更新日期:2020-08-06 00:00:00

  • Reconstruction of avian ancestral karyotypes reveals differences in the evolutionary history of macro- and microchromosomes.

    abstract:BACKGROUND:Reconstruction of ancestral karyotypes is critical for our understanding of genome evolution, allowing for the identification of the gross changes that shaped extant genomes. The identification of such changes and their time of occurrence can shed light on the biology of each species, clade and their evoluti...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-018-1544-8

    authors: Damas J,Kim J,Farré M,Griffin DK,Larkin DM

    更新日期:2018-10-05 00:00:00

  • MicroPro: using metagenomic unmapped reads to provide insights into human microbiota and disease associations.

    abstract::We develop a metagenomic data analysis pipeline, MicroPro, that takes into account all reads from known and unknown microbial organisms and associates viruses with complex diseases. We utilize MicroPro to analyze four metagenomic datasets relating to colorectal cancer, type 2 diabetes, and liver cirrhosis and show tha...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-019-1773-5

    authors: Zhu Z,Ren J,Michail S,Sun F

    更新日期:2019-08-06 00:00:00

  • Broad network-based predictability of Saccharomyces cerevisiae gene loss-of-function phenotypes.

    abstract::We demonstrate that loss-of-function yeast phenotypes are predictable by guilt-by-association in functional gene networks. Testing 1,102 loss-of-function phenotypes from genome-wide assays of yeast reveals predictability of diverse phenotypes, spanning cellular morphology, growth, metabolism, and quantitative cell sha...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-12-r258

    authors: McGary KL,Lee I,Marcotte EM

    更新日期:2007-01-01 00:00:00

  • SuperTranscripts: a data driven reference for analysis and visualisation of transcriptomes.

    abstract::Numerous methods have been developed to analyse RNA sequencing (RNA-seq) data, but most rely on the availability of a reference genome, making them unsuitable for non-model organisms. Here we present superTranscripts, a substitute for a reference genome, where each gene with multiple transcripts is represented by a si...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1284-1

    authors: Davidson NM,Hawkins ADK,Oshlack A

    更新日期:2017-08-04 00:00:00

  • AlphaBeta: computational inference of epimutation rates and spectra from high-throughput DNA methylation data in plants.

    abstract::Stochastic changes in DNA methylation (i.e., spontaneous epimutations) contribute to methylome diversity in plants. Here, we describe AlphaBeta, a computational method for estimating the precise rate of such stochastic events using pedigree-based DNA methylation data as input. We demonstrate how AlphaBeta can be emplo...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-020-02161-6

    authors: Shahryary Y,Symeonidi A,Hazarika RR,Denkena J,Mubeen T,Hofmeister B,van Gurp T,Colomé-Tatché M,Verhoeven KJF,Tuskan G,Schmitz RJ,Johannes F

    更新日期:2020-10-06 00:00:00

  • Having a BLAST with bioinformatics (and avoiding BLASTphemy).

    abstract::Searching for similarities between biological sequences is the principal means by which bioinformatics contributes to our understanding of biology. Of the various informatics tools developed to accomplish this task, the most widely used is BLAST, the basic local alignment search tool. This article discusses the princi...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2001-2-10-reviews2002

    authors: Pertsemlidis A,Fondon JW 3rd

    更新日期:2001-01-01 00:00:00

  • Expanded identification and characterization of mammalian circular RNAs.

    abstract:BACKGROUND:The recent reports of two circular RNAs (circRNAs) with strong potential to act as microRNA (miRNA) sponges suggest that circRNAs might play important roles in regulating gene expression. However, the global properties of circRNAs are not well understood. RESULTS:We developed a computational pipeline to ide...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-014-0409-z

    authors: Guo JU,Agarwal V,Guo H,Bartel DP

    更新日期:2014-07-29 00:00:00

  • Muscular expressions: profiling genes in complex tissues.

    abstract::Gene-expression profiling has yielded important information about simple systems, but complex tissues have not yet been widely profiled. Four recent studies of mammalian skeletal muscles have added to the catalogs of their gene expression differences, but have yet to lead to better understanding of the molecular proce...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2001-2-12-reviews1033

    authors: Hampson R,Hughes SM

    更新日期:2001-01-01 00:00:00

  • The three-dimensional genome organization of Drosophila melanogaster through data integration.

    abstract:BACKGROUND:Genome structures are dynamic and non-randomly organized in the nucleus of higher eukaryotes. To maximize the accuracy and coverage of three-dimensional genome structural models, it is important to integrate all available sources of experimental information about a genome's organization. It remains a major c...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1264-5

    authors: Li Q,Tjong H,Li X,Gong K,Zhou XJ,Chiolo I,Alber F

    更新日期:2017-07-31 00:00:00

  • Studying the microbiology of the indoor environment.

    abstract::The majority of people in the developed world spend more than 90% of their lives indoors. Here, we examine our understanding of the bacteria that co-inhabit our artificial world and how they might influence human health. ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2013-14-2-202

    authors: Kelley ST,Gilbert JA

    更新日期:2013-02-28 00:00:00

  • The Dictyostelium genome encodes numerous RasGEFs with multiple biological roles.

    abstract:BACKGROUND:Dictyostelium discoideum is a eukaryote with a simple lifestyle and a relatively small genome whose sequence has been fully determined. It is widely used for studies on cell signaling, movement and multicellular development. Ras guanine-nucleotide exchange factors (RasGEFs) are the proteins that activate Ras...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2005-6-8-r68

    authors: Wilkins A,Szafranski K,Fraser DJ,Bakthavatsalam D,Müller R,Fisher PR,Glöckner G,Eichinger L,Noegel AA,Insall RH

    更新日期:2005-01-01 00:00:00

  • Modeling double strand break susceptibility to interrogate structural variation in cancer.

    abstract:BACKGROUND:Structural variants (SVs) are known to play important roles in a variety of cancers, but their origins and functional consequences are still poorly understood. Many SVs are thought to emerge from errors in the repair processes following DNA double strand breaks (DSBs). RESULTS:We used experimentally quantif...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-019-1635-1

    authors: Ballinger TJ,Bouwman BAM,Mirzazadeh R,Garnerone S,Crosetto N,Semple CA

    更新日期:2019-02-08 00:00:00

  • Single-cell profiling of lncRNAs in the developing human brain.

    abstract::Single-cell RNA-seq in samples from the human neocortex demonstrate that long noncoding RNAs (lncRNAs) are abundantly expressed in specific individual brain cells, despite being hard to detect in bulk samples. This result suggests that the lncRNAs might have important functions in specific cell types in the brain. ...

    journal_title:Genome biology

    pub_type: 评论,杂志文章

    doi:10.1186/s13059-016-0933-0

    authors: Ma Q,Chang HY

    更新日期:2016-04-14 00:00:00

  • Chipping away at major depressive disorder.

    abstract::An intriguing recent study examines the role of miR-1202, a glutamate receptor regulating microRNA, in regulating major depressive disorder. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-014-0421-3

    authors: Rucker JJ,McGuffin P

    更新日期:2014-07-26 00:00:00

  • Opening sequence: computational genomics in the era of high-throughput sequencing.

    abstract::A report on the 11th Cold Spring Harbor Laboratory/Wellcome Trust conference on Genome Informatics, Cold Spring Harbor Laboratories, New York, USA, November 2-5, 2011. ...

    journal_title:Genome biology

    pub_type:

    doi:10.1186/gb-2011-12-12-310

    authors: Chambers EV,Kindt AS,Semple CA

    更新日期:2011-12-28 00:00:00

  • Common gene expression strategies revealed by genome-wide analysis in yeast.

    abstract:BACKGROUND:Gene expression is a two-step synthesis process that ends with the necessary amount of each protein required to perform its function. Since the protein is the final product, the main focus of gene regulation should be centered on it. However, because mRNA is an intermediate step and the amounts of both mRNA ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2007-8-10-r222

    authors: García-Martínez J,González-Candelas F,Pérez-Ortín JE

    更新日期:2007-01-01 00:00:00

  • Divergence in cis-regulatory networks: taking the 'species' out of cross-species analysis.

    abstract::Many essential transcription factors have conserved roles in regulating biological programs, yet their genomic occupancy can diverge significantly. A new study demonstrates that such variations are primarily due to cis-regulatory sequences, rather than differences between the regulators or nuclear environments. ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2008-9-11-240

    authors: Zinzen RP,Furlong EE

    更新日期:2008-01-01 00:00:00

  • Proteomic view of mitochondrial function.

    abstract::Genomic and proteomic studies have identified hundreds of proteins from mitochondria. A recent study has added a functional twist to these systematic approaches and identified novel mitochondrial modifiers and regulators. ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2008-9-2-209

    authors: Dimmer KS,Rapaport D

    更新日期:2008-01-01 00:00:00

  • Permutation-validated principal components analysis of microarray data.

    abstract:BACKGROUND:In microarray data analysis, the comparison of gene-expression profiles with respect to different conditions and the selection of biologically interesting genes are crucial tasks. Multivariate statistical methods have been applied to analyze these large datasets. Less work has been published concerning the a...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2002-3-4-research0019

    authors: Landgrebe J,Wurst W,Welzl G

    更新日期:2002-01-01 00:00:00

  • The relationship between proteome size, structural disorder and organism complexity.

    abstract:BACKGROUND:Sequencing the genomes of the first few eukaryotes created the impression that gene number shows no correlation with organism complexity, often referred to as the G-value paradox. Several attempts have previously been made to resolve this paradox, citing multifunctionality of proteins, alternative splicing, ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2011-12-12-r120

    authors: Schad E,Tompa P,Hegyi H

    更新日期:2011-12-19 00:00:00

  • Leaf senescence--not just a 'wear and tear' phenomenon.

    abstract::A recent, genome-wide study shows that the transcriptional program underlying leaf senescence is active and complex, reflecting the activation of more than 2,000 genes in Arabidopsis, with gene products involved in a broad spectrum of regulatory, biochemical and cellular events. ...

    journal_title:Genome biology

    pub_type: 杂志文章,评审

    doi:10.1186/gb-2004-5-3-212

    authors: Gepstein S

    更新日期:2004-01-01 00:00:00

  • Discovery and functional prioritization of Parkinson's disease candidate genes from large-scale whole exome sequencing.

    abstract:BACKGROUND:Whole-exome sequencing (WES) has been successful in identifying genes that cause familial Parkinson's disease (PD). However, until now this approach has not been deployed to study large cohorts of unrelated participants. To discover rare PD susceptibility variants, we performed WES in 1148 unrelated cases an...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1147-9

    authors: Jansen IE,Ye H,Heetveld S,Lechler MC,Michels H,Seinstra RI,Lubbe SJ,Drouet V,Lesage S,Majounie E,Gibbs JR,Nalls MA,Ryten M,Botia JA,Vandrovcova J,Simon-Sanchez J,Castillo-Lizardo M,Rizzu P,Blauwendraat C,Chouhan AK

    更新日期:2017-01-30 00:00:00

  • Going beyond genetics to discover cancer targets.

    abstract::Two recent studies demonstrate the power of integrating tumor genotype information with epigenetic and proteomic studies to discover potential therapeutic targets in breast cancer. ...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-017-1238-7

    authors: Sandoval GJ,Hahn WC

    更新日期:2017-05-22 00:00:00

  • Visualization of pseudogenes in intracellular bacteria reveals the different tracks to gene destruction.

    abstract:BACKGROUND:Pseudogenes reveal ancestral gene functions. Some obligate intracellular bacteria, such as Mycobacterium leprae and Rickettsia spp., carry substantial fractions of pseudogenes. Until recently, horizontal gene transfers were considered to be rare events in obligate host-associated bacteria. RESULTS:We presen...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2008-9-2-r42

    authors: Fuxelius HH,Darby AC,Cho NH,Andersson SG

    更新日期:2008-01-01 00:00:00

  • PU.1 target genes undergo Tet2-coupled demethylation and DNMT3b-mediated methylation in monocyte-to-osteoclast differentiation.

    abstract:BACKGROUND:DNA methylation is a key epigenetic mechanism for driving and stabilizing cell-fate decisions. Local deposition and removal of DNA methylation are tightly coupled with transcription factor binding, although the relationship varies with the specific differentiation process. Conversion of monocytes to osteocla...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2013-14-9-r99

    authors: de la Rica L,Rodríguez-Ubreva J,García M,Islam AB,Urquiza JM,Hernando H,Christensen J,Helin K,Gómez-Vaquero C,Ballestar E

    更新日期:2013-01-01 00:00:00

  • Hepatic steatosis risk is partly driven by increased de novo lipogenesis following carbohydrate consumption.

    abstract:BACKGROUND:Diet is a major contributor to metabolic disease risk, but there is controversy as to whether increased incidences of diseases such as non-alcoholic fatty liver disease arise from consumption of saturated fats or free sugars. Here, we investigate whether a sub-set of triacylglycerols (TAGs) were associated w...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/s13059-018-1439-8

    authors: Sanders FWB,Acharjee A,Walker C,Marney L,Roberts LD,Imamura F,Jenkins B,Case J,Ray S,Virtue S,Vidal-Puig A,Kuh D,Hardy R,Allison M,Forouhi N,Murray AJ,Wareham N,Vacca M,Koulman A,Griffin JL

    更新日期:2018-06-20 00:00:00

  • Asymmetric relationships between proteins shape genome evolution.

    abstract:BACKGROUND:The relationships between proteins are often asymmetric: one protein (A) depends for its function on another protein (B), but the second protein does not depend on the first. In metabolic networks there are multiple pathways that converge into one central pathway. The enzymes in the converging pathways depen...

    journal_title:Genome biology

    pub_type: 杂志文章

    doi:10.1186/gb-2009-10-2-r19

    authors: Notebaart RA,Kensche PR,Huynen MA,Dutilh BE

    更新日期:2009-02-12 00:00:00