Unsupervised genome-wide recognition of local relationship patterns.

Abstract:

BACKGROUND:Phenomena such as incomplete lineage sorting, horizontal gene transfer, gene duplication and subsequent sub- and neo-functionalisation can result in distinct local phylogenetic relationships that are discordant with species phylogeny. In order to assess the possible biological roles for these subdivisions, they must first be identified and characterised, preferably on a large scale and in an automated fashion. RESULTS:We developed Saguaro, a combination of a Hidden Markov Model (HMM) and a Self Organising Map (SOM), to characterise local phylogenetic relationships among aligned sequences using cacti, matrices of pair-wise distance measures. While the HMM determines the genomic boundaries from aligned sequences, the SOM hypothesises new cacti in an unsupervised and iterative fashion based on the regions that were modelled least well by existing cacti. After testing the software on simulated data, we demonstrate the utility of Saguaro by testing two different data sets: (i) 181 Dengue virus strains, and (ii) 5 primate genomes. Saguaro identifies regions under lineage-specific constraint for the first set, and genomic segments that we attribute to incomplete lineage sorting in the second dataset. Intriguingly for the primate data, Saguaro also classified an additional ~3% of the genome as most incompatible with the expected species phylogeny. A substantial fraction of these regions was found to overlap genes associated with both the innate and adaptive immune systems. CONCLUSIONS:Saguaro detects distinct cacti describing local phylogenetic relationships without requiring any a priori hypotheses. We have successfully demonstrated Saguaro's utility with two contrasting data sets, one containing many members with short sequences (Dengue viral strains: n = 181, genome size = 10,700 nt), and the other with few members but complex genomes (related primate species: n = 5, genome size = 3 Gb), suggesting that the software is applicable to a wide variety of experimental populations. Saguaro is written in C++, runs on the Linux operating system, and can be downloaded from http://saguarogw.sourceforge.net/.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Zamani N,Russell P,Lantz H,Hoeppner MP,Meadows JR,Vijay N,Mauceli E,di Palma F,Lindblad-Toh K,Jern P,Grabherr MG

doi

10.1186/1471-2164-14-347

subject

Has Abstract

pub_date

2013-05-24 00:00:00

pages

347

issn

1471-2164

pii

1471-2164-14-347

journal_volume

14

pub_type

杂志文章
  • Effects of dietary physical or nutritional factors on morphology of rumen papillae and transcriptome changes in lactating dairy cows based on three different forage-based diets.

    abstract:BACKGROUND:Rumen epithelial tissue plays an important role in nutrient absorption and rumen health. However, whether forage quality and particle size impact the rumen epithelial morphology is unclear. The current study was conducted to elucidate the effects of forage quality and forage particle size on rumen epithelial...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3726-2

    authors: Wang B,Wang D,Wu X,Cai J,Liu M,Huang X,Wu J,Liu J,Guan L

    更新日期:2017-05-06 00:00:00

  • Genomic arrangement of salinity tolerance QTLs in salmonids: a comparative analysis of Atlantic salmon (Salmo salar) with Arctic charr (Salvelinus alpinus) and rainbow trout (Oncorhynchus mykiss).

    abstract:BACKGROUND:Quantitative trait locus (QTL) studies show that variation in salinity tolerance in Arctic charr and rainbow trout has a genetic basis, even though both these species have low to moderate salinity tolerance capacities. QTL were observed to localize to homologous linkage group segments within putative chromos...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-420

    authors: Norman JD,Robinson M,Glebe B,Ferguson MM,Danzmann RG

    更新日期:2012-08-24 00:00:00

  • The genome of the emerging barley pathogen Ramularia collo-cygni.

    abstract:BACKGROUND:Ramularia collo-cygni is a newly important, foliar fungal pathogen of barley that causes the disease Ramularia leaf spot. The fungus exhibits a prolonged endophytic growth stage before switching life habit to become an aggressive, necrotrophic pathogen that causes significant losses to green leaf area and he...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2928-3

    authors: McGrann GR,Andongabo A,Sjökvist E,Trivedi U,Dussart F,Kaczmarek M,Mackenzie A,Fountaine JM,Taylor JM,Paterson LJ,Gorniak K,Burnett F,Kanyuka K,Hammond-Kosack KE,Rudd JJ,Blaxter M,Havis ND

    更新日期:2016-08-09 00:00:00

  • Genome-wide investigation of calcium-dependent protein kinase gene family in pineapple: evolution and expression profiles during development and stress.

    abstract:BACKGROUND:Calcium-dependent protein kinase (CPK) is one of the main Ca2+ combined protein kinase that play significant roles in plant growth, development and response to multiple stresses. Despite an important member of the stress responsive gene family, little is known about the evolutionary history and expression pa...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6501-8

    authors: Zhang M,Liu Y,He Q,Chai M,Huang Y,Chen F,Wang X,Liu Y,Cai H,Qin Y

    更新日期:2020-01-23 00:00:00

  • Comparative genomic analysis reveals occurrence of genetic recombination in virulent Cryptosporidium hominis subtypes and telomeric gene duplications in Cryptosporidium parvum.

    abstract:BACKGROUND:Cryptosporidium hominis is a dominant species for human cryptosporidiosis. Within the species, IbA10G2 is the most virulent subtype responsible for all C. hominis-associated outbreaks in Europe and Australia, and is a dominant outbreak subtype in the United States. In recent yearsIaA28R4 is becoming a major ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1517-1

    authors: Guo Y,Tang K,Rowe LA,Li N,Roellig DM,Knipe K,Frace M,Yang C,Feng Y,Xiao L

    更新日期:2015-04-18 00:00:00

  • Global proteomic analysis of the oocyst/sporozoite of Toxoplasma gondii reveals commitment to a host-independent lifestyle.

    abstract:BACKGROUND:Toxoplasmosis is caused by the apicomplexan parasite Toxoplasma gondii and can be acquired either congenitally or via the oral route. In the latter case, transmission is mediated by two distinct invasive stages, i.e., bradyzoites residing in tissue cysts or sporozoites contained in environmentally resistant ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-183

    authors: Possenti A,Fratini F,Fantozzi L,Pozio E,Dubey JP,Ponzi M,Pizzi E,Spano F

    更新日期:2013-03-15 00:00:00

  • Hyper-expansion of large DNA segments in the genome of kuruma shrimp, Marsupenaeus japonicus.

    abstract:BACKGROUND:Higher crustaceans (class Malacostraca) represent the most species-rich and morphologically diverse group of non-insect arthropods and many of its members are commercially important. Although the crustacean DNA sequence information is growing exponentially, little is known about the genome organization of Ma...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-141

    authors: Koyama T,Asakawa S,Katagiri T,Shimizu A,Fagutao FF,Mavichak R,Santos MD,Fuji K,Sakamoto T,Kitakado T,Kondo H,Shimizu N,Aoki T,Hirono I

    更新日期:2010-02-26 00:00:00

  • GWAS and fine-mapping of livability and six disease traits in Holstein cattle.

    abstract:BACKGROUND:Health traits are of significant economic importance to the dairy industry due to their effects on milk production and associated treatment costs. Genome-wide association studies (GWAS) provide a means to identify associated genomic variants and thus reveal insights into the genetic architecture of complex t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6461-z

    authors: Freebern E,Santos DJA,Fang L,Jiang J,Parker Gaddis KL,Liu GE,VanRaden PM,Maltecca C,Cole JB,Ma L

    更新日期:2020-01-13 00:00:00

  • Combinatorial control of temporal gene expression in the Drosophila wing by enhancers and core promoters.

    abstract:BACKGROUND:The transformation of a developing epithelium into an adult structure is a complex process, which often involves coordinated changes in cell proliferation, metabolism, adhesion, and shape. To identify genetic mechanisms that control epithelial differentiation, we analyzed the temporal patterns of gene expres...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-498

    authors: O'Keefe DD,Thomas SR,Bolin K,Griggs E,Edgar BA,Buttitta LA

    更新日期:2012-09-20 00:00:00

  • Integrative phenotyping framework (iPF): integrative clustering of multiple omics data identifies novel lung disease subphenotypes.

    abstract:BACKGROUND:The increased multi-omics information on carefully phenotyped patients in studies of complex diseases requires novel methods for data integration. Unlike continuous intensity measurements from most omics data sets, phenome data contain clinical variables that are binary, ordinal and categorical. RESULTS:In ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2170-4

    authors: Kim S,Herazo-Maya JD,Kang DD,Juan-Guardela BM,Tedrow J,Martinez FJ,Sciurba FC,Tseng GC,Kaminski N

    更新日期:2015-11-11 00:00:00

  • Multivariate genome wide association and network analysis of subcortical imaging phenotypes in Alzheimer's disease.

    abstract:BACKGROUND:Genome-wide association studies (GWAS) have identified many individual genes associated with brain imaging quantitative traits (QTs) in Alzheimer's disease (AD). However single marker level association discovery may not be able to address the underlying biological interactions with disease mechanism. RESULT...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07282-7

    authors: Meng X,Li J,Zhang Q,Chen F,Bian C,Yao X,Yan J,Xu Z,Risacher SL,Saykin AJ,Liang H,Shen L,Alzheimer’s Disease Neuroimaging Initiative.

    更新日期:2020-12-29 00:00:00

  • Characterization of transcriptome dynamics during watermelon fruit development: sequencing, assembly, annotation and gene expression profiles.

    abstract:BACKGROUND:Cultivated watermelon [Citrullus lanatus (Thunb.) Matsum. & Nakai var. lanatus] is an important agriculture crop world-wide. The fruit of watermelon undergoes distinct stages of development with dramatic changes in its size, color, sweetness, texture and aroma. In order to better understand the genetic and m...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-454

    authors: Guo S,Liu J,Zheng Y,Huang M,Zhang H,Gong G,He H,Ren Y,Zhong S,Fei Z,Xu Y

    更新日期:2011-09-21 00:00:00

  • Extensive structural variations between mitochondrial genomes of CMS and normal peppers (Capsicum annuum L.) revealed by complete nucleotide sequencing.

    abstract:BACKGROUND:Cytoplasmic male sterility (CMS) is an inability to produce functional pollen that is caused by mutation of the mitochondrial genome. Comparative analyses of mitochondrial genomes of lines with and without CMS in several species have revealed structural differences between genomes, including extensive rearra...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-561

    authors: Jo YD,Choi Y,Kim DH,Kim BD,Kang BC

    更新日期:2014-07-04 00:00:00

  • De novo sequencing of tree peony (Paeonia suffruticosa) transcriptome to identify critical genes involved in flowering and floral organ development.

    abstract:BACKGROUND:Tree peony (Paeonia suffruticosa Andrews) is a globally famous ornamental flower, with large and colorful flowers and abundant flower types. However, a relatively short and uniform flowering period hinders the applications and production of ornamental tree peony. Unfortunately, the molecular mechanism of reg...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5857-0

    authors: Wang S,Gao J,Xue J,Xue Y,Li D,Guan Y,Zhang X

    更新日期:2019-07-11 00:00:00

  • The complete and fully assembled genome sequence of Aeromonas salmonicida subsp. pectinolytica and its comparative analysis with other Aeromonas species: investigation of the mobilome in environmental and pathogenic strains.

    abstract:BACKGROUND:Due to the predominant usage of short-read sequencing to date, most bacterial genome sequences reported in the last years remain at the draft level. This precludes certain types of analyses, such as the in-depth analysis of genome plasticity. RESULTS:Here we report the finalized genome sequence of the envir...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4301-6

    authors: Pfeiffer F,Zamora-Lagos MA,Blettinger M,Yeroslaviz A,Dahl A,Gruber S,Habermann BH

    更新日期:2018-01-05 00:00:00

  • Investigation of regions impacting inbreeding depression and their association with the additive genetic effect for United States and Australia Jersey dairy cattle.

    abstract:BACKGROUND:Variation in environment, management practices, nutrition or selection objectives has led to a variety of different choices being made in the use of genetic material between countries. Differences in genome-level homozygosity between countries may give rise to regions that result in inbreeding depression to ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2001-7

    authors: Howard JT,Haile-Mariam M,Pryce JE,Maltecca C

    更新日期:2015-10-19 00:00:00

  • Genomic predictions combining SNP markers and copy number variations in Nellore cattle.

    abstract:BACKGROUND:Due to the advancement in high throughput technology, single nucleotide polymorphism (SNP) is routinely being incorporated along with phenotypic information into genetic evaluation. However, this approach often cannot achieve high accuracy for some complex traits. It is possible that SNP markers are not suff...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4787-6

    authors: Hay EHA,Utsunomiya YT,Xu L,Zhou Y,Neves HHR,Carvalheiro R,Bickhart DM,Ma L,Garcia JF,Liu GE

    更新日期:2018-06-05 00:00:00

  • Transcriptional analysis of abdominal fat in chickens divergently selected on bodyweight at two ages reveals novel mechanisms controlling adiposity: validating visceral adipose tissue as a dynamic endocrine and metabolic organ.

    abstract:BACKGROUND:Decades of intensive genetic selection in the domestic chicken (Gallus gallus domesticus) have enabled the remarkable rapid growth of today's broiler (meat-type) chickens. However, this enhanced growth rate was accompanied by several unfavorable traits (i.e., increased visceral fatness, leg weakness, and dis...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4035-5

    authors: Resnyk CW,Carré W,Wang X,Porter TE,Simon J,Le Bihan-Duval E,Duclos MJ,Aggrey SE,Cogburn LA

    更新日期:2017-08-16 00:00:00

  • Colorectal cancer cell-derived microvesicles are enriched in cell cycle-related mRNAs that promote proliferation of endothelial cells.

    abstract:BACKGROUND:Various cancer cells, including those of colorectal cancer (CRC), release microvesicles (exosomes) into surrounding tissues and peripheral circulation. These microvesicles can mediate communication between cells and affect various tumor-related processes in their target cells. RESULTS:We present potential r...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-556

    authors: Hong BS,Cho JH,Kim H,Choi EJ,Rho S,Kim J,Kim JH,Choi DS,Kim YK,Hwang D,Gho YS

    更新日期:2009-11-25 00:00:00

  • A graph-theoretic approach for classification and structure prediction of transmembrane β-barrel proteins.

    abstract:BACKGROUND:Transmembrane β-barrel proteins are a special class of transmembrane proteins which play several key roles in human body and diseases. Due to experimental difficulties, the number of transmembrane β-barrel proteins with known structures is very small. Over the years, a number of learning-based methods have b...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S2-S5

    authors: Tran Vdu T,Chassignet P,Sheikh S,Steyaert JM

    更新日期:2012-04-12 00:00:00

  • Highly sensitive amplicon-based transcript quantification by semiconductor sequencing.

    abstract:BACKGROUND:In clinical and basic research custom panels for transcript profiling are gaining importance because only project specific informative genes are interrogated. This approach reduces costs and complexity of data analysis and allows multiplexing of samples. Polymerase-chain-reaction (PCR) based TaqMan assays ha...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-565

    authors: Zhang JD,Schindler T,Küng E,Ebeling M,Certa U

    更新日期:2014-07-05 00:00:00

  • Estimating the total genome length of a metagenomic sample using k-mers.

    abstract:BACKGROUND:Metagenomic sequencing is a powerful technology for studying the mixture of microbes or the microbiomes on human and in the environment. One basic task of analyzing metagenomic data is to identify the component genomes in the community. This task is challenging due to the complexity of microbiome composition...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5467-x

    authors: Hua K,Zhang X

    更新日期:2019-04-04 00:00:00

  • Characterization of familial breast cancer in Saudi Arabia.

    abstract:BACKGROUND:The contribution of genetic factors to the development of breast cancer in the admixed and consanguineous population of the western region of Saudi Arabia is thought to be significant as the disease is early onset. The current protocols of continuous clinical follow-up of relatives of such patients are costl...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-16-S1-S3

    authors: Merdad A,Gari MA,Hussein S,Al-Khayat S,Tashkandi H,Al-Maghrabi J,Al-Thubaiti F,Hussein IR,Koumosani T,Shaer N,Chaudhary AG,Abuzenadah AM,Al-Qahtani MH,Dallol A

    更新日期:2015-01-01 00:00:00

  • Identification of small RNAs in Francisella tularensis.

    abstract:BACKGROUND:Regulation of bacterial gene expression by small RNAs (sRNAs) have proved to be important for many biological processes. Francisella tularensis is a highly pathogenic Gram-negative bacterium that causes the disease tularaemia in humans and animals. Relatively little is known about the regulatory networks exi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-625

    authors: Postic G,Frapy E,Dupuis M,Dubail I,Livny J,Charbit A,Meibom KL

    更新日期:2010-11-10 00:00:00

  • Nonlinear transcriptomic response to dietary fat intake in the small intestine of C57BL/6J mice.

    abstract:BACKGROUND:A high caloric diet, in conjunction with low levels of physical activity, promotes obesity. Many studies are available regarding the relation between dietary saturated fats and the etiology of obesity, but most focus on liver, muscle and white adipose tissue. Furthermore, the majority of transcriptomic studi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2424-9

    authors: Nyima T,Müller M,Hooiveld GJ,Morine MJ,Scotti M

    更新日期:2016-02-09 00:00:00

  • Polysome profiling reveals broad translatome remodeling during endoplasmic reticulum (ER) stress in the pathogenic fungus Aspergillus fumigatus.

    abstract:BACKGROUND:The unfolded protein response (UPR) is a network of intracellular signaling pathways that supports the ability of the secretory pathway to maintain a balance between the load of proteins entering the endoplasmic reticulum (ER) and the protein folding capacity of the ER lumen. Current evidence indicates that ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-159

    authors: Krishnan K,Ren Z,Losada L,Nierman WC,Lu LJ,Askew DS

    更新日期:2014-02-25 00:00:00

  • Multi-tissue transcriptomics of the black widow spider reveals expansions, co-options, and functional processes of the silk gland gene toolkit.

    abstract:BACKGROUND:Spiders (Order Araneae) are essential predators in every terrestrial ecosystem largely because they have evolved potent arsenals of silk and venom. Spider silks are high performance materials made almost entirely of proteins, and thus represent an ideal system for investigating genome level evolution of nove...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-365

    authors: Clarke TH,Garb JE,Hayashi CY,Haney RA,Lancaster AK,Corbett S,Ayoub NA

    更新日期:2014-05-23 00:00:00

  • Cross-species global and subset gene expression profiling identifies genes involved in prostate cancer response to selenium.

    abstract:BACKGROUND:Gene expression technologies have the ability to generate vast amounts of data, yet there often resides only limited resources for subsequent validation studies. This necessitates the ability to perform sorting and prioritization of the output data. Previously described methodologies have used functional pat...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-5-58

    authors: Schlicht M,Matysiak B,Brodzeller T,Wen X,Liu H,Zhou G,Dhir R,Hessner MJ,Tonellato P,Suckow M,Pollard M,Datta MW

    更新日期:2004-08-20 00:00:00

  • Identification of novel aspartic proteases from Strongyloides ratti and characterisation of their evolutionary relationships, stage-specific expression and molecular structure.

    abstract:BACKGROUND:Aspartic proteases are known to play an important role in the biology of nematode parasitism. This role is best characterised in blood-feeding nematodes, where they digest haemoglobin, but they are also likely to play important roles in the biology of nematode parasites that do not feed on blood. In the pres...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-611

    authors: Mello LV,O'Meara H,Rigden DJ,Paterson S

    更新日期:2009-12-16 00:00:00

  • Transcriptional profiling of liver during the critical embryo-to-hatchling transition period in the chicken (Gallus gallus).

    abstract:BACKGROUND:Although hatching is perhaps the most abrupt and profound metabolic challenge that a chicken must undergo; there have been no attempts to functionally map the metabolic pathways induced in liver during the embryo-to-hatchling transition. Furthermore, we know very little about the metabolic and regulatory fac...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5080-4

    authors: Cogburn LA,Trakooljul N,Chen C,Huang H,Wu CH,Carré W,Wang X,White HB 3rd

    更新日期:2018-09-21 00:00:00