Abstract:
BACKGROUND:Phenomena such as incomplete lineage sorting, horizontal gene transfer, gene duplication and subsequent sub- and neo-functionalisation can result in distinct local phylogenetic relationships that are discordant with species phylogeny. In order to assess the possible biological roles for these subdivisions, they must first be identified and characterised, preferably on a large scale and in an automated fashion. RESULTS:We developed Saguaro, a combination of a Hidden Markov Model (HMM) and a Self Organising Map (SOM), to characterise local phylogenetic relationships among aligned sequences using cacti, matrices of pair-wise distance measures. While the HMM determines the genomic boundaries from aligned sequences, the SOM hypothesises new cacti in an unsupervised and iterative fashion based on the regions that were modelled least well by existing cacti. After testing the software on simulated data, we demonstrate the utility of Saguaro by testing two different data sets: (i) 181 Dengue virus strains, and (ii) 5 primate genomes. Saguaro identifies regions under lineage-specific constraint for the first set, and genomic segments that we attribute to incomplete lineage sorting in the second dataset. Intriguingly for the primate data, Saguaro also classified an additional ~3% of the genome as most incompatible with the expected species phylogeny. A substantial fraction of these regions was found to overlap genes associated with both the innate and adaptive immune systems. CONCLUSIONS:Saguaro detects distinct cacti describing local phylogenetic relationships without requiring any a priori hypotheses. We have successfully demonstrated Saguaro's utility with two contrasting data sets, one containing many members with short sequences (Dengue viral strains: n = 181, genome size = 10,700 nt), and the other with few members but complex genomes (related primate species: n = 5, genome size = 3 Gb), suggesting that the software is applicable to a wide variety of experimental populations. Saguaro is written in C++, runs on the Linux operating system, and can be downloaded from http://saguarogw.sourceforge.net/.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Zamani N,Russell P,Lantz H,Hoeppner MP,Meadows JR,Vijay N,Mauceli E,di Palma F,Lindblad-Toh K,Jern P,Grabherr MGdoi
10.1186/1471-2164-14-347subject
Has Abstractpub_date
2013-05-24 00:00:00pages
347issn
1471-2164pii
1471-2164-14-347journal_volume
14pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Rumen epithelial tissue plays an important role in nutrient absorption and rumen health. However, whether forage quality and particle size impact the rumen epithelial morphology is unclear. The current study was conducted to elucidate the effects of forage quality and forage particle size on rumen epithelial...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-3726-2
更新日期:2017-05-06 00:00:00
abstract:BACKGROUND:Quantitative trait locus (QTL) studies show that variation in salinity tolerance in Arctic charr and rainbow trout has a genetic basis, even though both these species have low to moderate salinity tolerance capacities. QTL were observed to localize to homologous linkage group segments within putative chromos...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-420
更新日期:2012-08-24 00:00:00
abstract:BACKGROUND:Ramularia collo-cygni is a newly important, foliar fungal pathogen of barley that causes the disease Ramularia leaf spot. The fungus exhibits a prolonged endophytic growth stage before switching life habit to become an aggressive, necrotrophic pathogen that causes significant losses to green leaf area and he...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2928-3
更新日期:2016-08-09 00:00:00
abstract:BACKGROUND:Calcium-dependent protein kinase (CPK) is one of the main Ca2+ combined protein kinase that play significant roles in plant growth, development and response to multiple stresses. Despite an important member of the stress responsive gene family, little is known about the evolutionary history and expression pa...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-6501-8
更新日期:2020-01-23 00:00:00
abstract:BACKGROUND:Cryptosporidium hominis is a dominant species for human cryptosporidiosis. Within the species, IbA10G2 is the most virulent subtype responsible for all C. hominis-associated outbreaks in Europe and Australia, and is a dominant outbreak subtype in the United States. In recent yearsIaA28R4 is becoming a major ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1517-1
更新日期:2015-04-18 00:00:00
abstract:BACKGROUND:Toxoplasmosis is caused by the apicomplexan parasite Toxoplasma gondii and can be acquired either congenitally or via the oral route. In the latter case, transmission is mediated by two distinct invasive stages, i.e., bradyzoites residing in tissue cysts or sporozoites contained in environmentally resistant ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-183
更新日期:2013-03-15 00:00:00
abstract:BACKGROUND:Higher crustaceans (class Malacostraca) represent the most species-rich and morphologically diverse group of non-insect arthropods and many of its members are commercially important. Although the crustacean DNA sequence information is growing exponentially, little is known about the genome organization of Ma...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-141
更新日期:2010-02-26 00:00:00
abstract:BACKGROUND:Health traits are of significant economic importance to the dairy industry due to their effects on milk production and associated treatment costs. Genome-wide association studies (GWAS) provide a means to identify associated genomic variants and thus reveal insights into the genetic architecture of complex t...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-6461-z
更新日期:2020-01-13 00:00:00
abstract:BACKGROUND:The transformation of a developing epithelium into an adult structure is a complex process, which often involves coordinated changes in cell proliferation, metabolism, adhesion, and shape. To identify genetic mechanisms that control epithelial differentiation, we analyzed the temporal patterns of gene expres...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-498
更新日期:2012-09-20 00:00:00
abstract:BACKGROUND:The increased multi-omics information on carefully phenotyped patients in studies of complex diseases requires novel methods for data integration. Unlike continuous intensity measurements from most omics data sets, phenome data contain clinical variables that are binary, ordinal and categorical. RESULTS:In ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-2170-4
更新日期:2015-11-11 00:00:00
abstract:BACKGROUND:Genome-wide association studies (GWAS) have identified many individual genes associated with brain imaging quantitative traits (QTs) in Alzheimer's disease (AD). However single marker level association discovery may not be able to address the underlying biological interactions with disease mechanism. RESULT...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-07282-7
更新日期:2020-12-29 00:00:00
abstract:BACKGROUND:Cultivated watermelon [Citrullus lanatus (Thunb.) Matsum. & Nakai var. lanatus] is an important agriculture crop world-wide. The fruit of watermelon undergoes distinct stages of development with dramatic changes in its size, color, sweetness, texture and aroma. In order to better understand the genetic and m...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-454
更新日期:2011-09-21 00:00:00
abstract:BACKGROUND:Cytoplasmic male sterility (CMS) is an inability to produce functional pollen that is caused by mutation of the mitochondrial genome. Comparative analyses of mitochondrial genomes of lines with and without CMS in several species have revealed structural differences between genomes, including extensive rearra...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-561
更新日期:2014-07-04 00:00:00
abstract:BACKGROUND:Tree peony (Paeonia suffruticosa Andrews) is a globally famous ornamental flower, with large and colorful flowers and abundant flower types. However, a relatively short and uniform flowering period hinders the applications and production of ornamental tree peony. Unfortunately, the molecular mechanism of reg...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5857-0
更新日期:2019-07-11 00:00:00
abstract:BACKGROUND:Due to the predominant usage of short-read sequencing to date, most bacterial genome sequences reported in the last years remain at the draft level. This precludes certain types of analyses, such as the in-depth analysis of genome plasticity. RESULTS:Here we report the finalized genome sequence of the envir...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4301-6
更新日期:2018-01-05 00:00:00
abstract:BACKGROUND:Variation in environment, management practices, nutrition or selection objectives has led to a variety of different choices being made in the use of genetic material between countries. Differences in genome-level homozygosity between countries may give rise to regions that result in inbreeding depression to ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-2001-7
更新日期:2015-10-19 00:00:00
abstract:BACKGROUND:Due to the advancement in high throughput technology, single nucleotide polymorphism (SNP) is routinely being incorporated along with phenotypic information into genetic evaluation. However, this approach often cannot achieve high accuracy for some complex traits. It is possible that SNP markers are not suff...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4787-6
更新日期:2018-06-05 00:00:00
abstract:BACKGROUND:Decades of intensive genetic selection in the domestic chicken (Gallus gallus domesticus) have enabled the remarkable rapid growth of today's broiler (meat-type) chickens. However, this enhanced growth rate was accompanied by several unfavorable traits (i.e., increased visceral fatness, leg weakness, and dis...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4035-5
更新日期:2017-08-16 00:00:00
abstract:BACKGROUND:Various cancer cells, including those of colorectal cancer (CRC), release microvesicles (exosomes) into surrounding tissues and peripheral circulation. These microvesicles can mediate communication between cells and affect various tumor-related processes in their target cells. RESULTS:We present potential r...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-556
更新日期:2009-11-25 00:00:00
abstract:BACKGROUND:Transmembrane β-barrel proteins are a special class of transmembrane proteins which play several key roles in human body and diseases. Due to experimental difficulties, the number of transmembrane β-barrel proteins with known structures is very small. Over the years, a number of learning-based methods have b...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-S2-S5
更新日期:2012-04-12 00:00:00
abstract:BACKGROUND:In clinical and basic research custom panels for transcript profiling are gaining importance because only project specific informative genes are interrogated. This approach reduces costs and complexity of data analysis and allows multiplexing of samples. Polymerase-chain-reaction (PCR) based TaqMan assays ha...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-565
更新日期:2014-07-05 00:00:00
abstract:BACKGROUND:Metagenomic sequencing is a powerful technology for studying the mixture of microbes or the microbiomes on human and in the environment. One basic task of analyzing metagenomic data is to identify the component genomes in the community. This task is challenging due to the complexity of microbiome composition...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5467-x
更新日期:2019-04-04 00:00:00
abstract:BACKGROUND:The contribution of genetic factors to the development of breast cancer in the admixed and consanguineous population of the western region of Saudi Arabia is thought to be significant as the disease is early onset. The current protocols of continuous clinical follow-up of relatives of such patients are costl...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-16-S1-S3
更新日期:2015-01-01 00:00:00
abstract:BACKGROUND:Regulation of bacterial gene expression by small RNAs (sRNAs) have proved to be important for many biological processes. Francisella tularensis is a highly pathogenic Gram-negative bacterium that causes the disease tularaemia in humans and animals. Relatively little is known about the regulatory networks exi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-625
更新日期:2010-11-10 00:00:00
abstract:BACKGROUND:A high caloric diet, in conjunction with low levels of physical activity, promotes obesity. Many studies are available regarding the relation between dietary saturated fats and the etiology of obesity, but most focus on liver, muscle and white adipose tissue. Furthermore, the majority of transcriptomic studi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2424-9
更新日期:2016-02-09 00:00:00
abstract:BACKGROUND:The unfolded protein response (UPR) is a network of intracellular signaling pathways that supports the ability of the secretory pathway to maintain a balance between the load of proteins entering the endoplasmic reticulum (ER) and the protein folding capacity of the ER lumen. Current evidence indicates that ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-159
更新日期:2014-02-25 00:00:00
abstract:BACKGROUND:Spiders (Order Araneae) are essential predators in every terrestrial ecosystem largely because they have evolved potent arsenals of silk and venom. Spider silks are high performance materials made almost entirely of proteins, and thus represent an ideal system for investigating genome level evolution of nove...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-365
更新日期:2014-05-23 00:00:00
abstract:BACKGROUND:Gene expression technologies have the ability to generate vast amounts of data, yet there often resides only limited resources for subsequent validation studies. This necessitates the ability to perform sorting and prioritization of the output data. Previously described methodologies have used functional pat...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-5-58
更新日期:2004-08-20 00:00:00
abstract:BACKGROUND:Aspartic proteases are known to play an important role in the biology of nematode parasitism. This role is best characterised in blood-feeding nematodes, where they digest haemoglobin, but they are also likely to play important roles in the biology of nematode parasites that do not feed on blood. In the pres...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-611
更新日期:2009-12-16 00:00:00
abstract:BACKGROUND:Although hatching is perhaps the most abrupt and profound metabolic challenge that a chicken must undergo; there have been no attempts to functionally map the metabolic pathways induced in liver during the embryo-to-hatchling transition. Furthermore, we know very little about the metabolic and regulatory fac...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5080-4
更新日期:2018-09-21 00:00:00