Abstract:
BACKGROUND:The identification of cell type-specific genes (markers) is an essential step for the deconvolution of the cellular fractions, primarily, from the gene expression data of a bulk sample. However, the genes with significant changes identified by pair-wise comparisons cannot indeed represent the specificity of gene expression across multiple conditions. In addition, the knowledge about the identification of gene expression markers across multiple conditions is still paucity. RESULTS:Herein, we developed a hybrid tool, LinDeconSeq, which consists of 1) identifying marker genes using specificity scoring and mutual linearity strategies across any number of cell types, and 2) predicting cellular fractions of bulk samples using weighted robust linear regression with the marker genes identified in the first stage. On multiple publicly available datasets, the marker genes identified by LinDeconSeq demonstrated better accuracy and reproducibility compared to MGFM and RNentropy. Among deconvolution methods, LinDeconSeq showed low average deviations (≤0.0958) and high average Pearson correlations (≥0.8792) between the predicted and actual fractions on the benchmark datasets. Importantly, the cellular fractions predicted by LinDeconSeq appear to be relevant in the diagnosis of acute myeloid leukemia (AML). The distinct cellular fractions in granulocyte-monocyte progenitor (GMP), lymphoid-primed multipotent progenitor (LMPP) and monocytes (MONO) were found to be closely associated with AML compared to the healthy samples. Moreover, the heterogeneity of cellular fractions in AML patients divided these patients into two subgroups, differing in both prognosis and mutation patterns. GMP fraction was the most pronounced between these two subgroups, particularly, in SubgroupA, which was strongly associated with the better AML prognosis and the younger population. Totally, the identification of marker genes by LinDeconSeq represents the improved feature for deconvolution. The data processing strategy with regard to the cellular fractions used in this study also showed potential for the diagnosis and prognosis of diseases. CONCLUSIONS:Taken together, we developed a freely-available and open-source tool LinDeconSeq ( https://github.com/lihuamei/LinDeconSeq ), which includes marker identification and deconvolution procedures. LinDeconSeq is comparable to other current methods in terms of accuracy when applied to benchmark datasets and has broad application in clinical outcome and disease-specific molecular mechanisms.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Li H,Sharma A,Ming W,Sun X,Liu Hdoi
10.1186/s12864-020-06888-1subject
Has Abstractpub_date
2020-09-23 00:00:00pages
652issue
1issn
1471-2164pii
10.1186/s12864-020-06888-1journal_volume
21pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Gene expression variation is a key underlying factor influencing phenotypic variation, and can occur via cis- or trans-regulation. To understand the role of cis- and trans-regulatory variation on population divergence in chicken, we developed reciprocal crosses of two chicken breeds, White Leghorn and Cornis...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-6342-5
更新日期:2019-12-05 00:00:00
abstract:BACKGROUND:Major changes in gene expression occur in the fetal brain to modulate the function of this organ postnatally. Thus, factors can alter the genomics of the fetal brain, predisposing to neurological disorders later in life. We hypothesized that the physiological dynamics of the immune system transcriptome of th...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-1001
更新日期:2014-11-19 00:00:00
abstract:BACKGROUND:Complete or near-complete genomic sequence information is presently only available for a few plant species representing a large phylogenetic diversity among plants. In order to effectively transfer this information to species lacking sequence information, comparative genomic tools need to be developed. Molec...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-7-207
更新日期:2006-08-14 00:00:00
abstract:BACKGROUND:Cotton (Gossypium spp.) is commonly grouped into eight diploid genomic groups and an allotetraploid genomic group, AD. The mitochondrial genomes supply new information to understand both the evolution process and the mechanism of cytoplasmic male sterility. Based on previously released mitochondrial genomes ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4282-5
更新日期:2017-11-13 00:00:00
abstract:BACKGROUND:Acinetobacter baumannii is a major health problem. The most common infection caused by A. baumannii is hospital acquired pneumonia, and the associated mortality rate is approximately 50%. Neither in vivo nor ex vivo expression profiling has been performed at the proteomic or transcriptomic level for pneumoni...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1608-z
更新日期:2015-05-30 00:00:00
abstract:BACKGROUND:Protein and mRNA levels for several selenoproteins, such as glutathione peroxidase-1 (Gpx1), are down-regulated dramatically by selenium (Se) deficiency. These levels in rats increase sigmoidally with increasing dietary Se and reach defined plateaus at the Se requirement, making them sensitive biomarkers for...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-26
更新日期:2011-01-12 00:00:00
abstract:BACKGROUND:Azadirachtin A is a triterpenoid from neem tree exhibiting excellent activities against over 600 insect species in agriculture. The production of azadirachtin A depends on extraction from neem tissues, which is not an eco-friendly and sustainable process. The low yield and discontinuous supply of azadirachti...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-07124-6
更新日期:2020-10-28 00:00:00
abstract:BACKGROUND:The growth and development of the posterior silk gland and the biosynthesis of the silk core protein at the fifth larval instar stage of Bombyx mori are of paramount importance for silk production. RESULTS:Here, aided by next-generation sequencing and microarry assay, we profile 1,229 microRNAs (miRNAs), in...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-410
更新日期:2014-05-29 00:00:00
abstract:BACKGROUND:In biomedical research, gene expression profiling studies have been extensively conducted. The analysis of gene expression data has led to a deeper understanding of human genetics as well as practically useful models. Clustering analysis has been a critical component of gene expression data analysis and can ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-3990-1
更新日期:2017-08-16 00:00:00
abstract:BACKGROUND:The contribution of a gene to the fitness of a bacterium can be assayed by whether and to what degree the bacterium tolerates transposon insertions in that gene. We use this fact to compare the fitness of syntenic homologous genes among related Salmonella strains and thereby reveal differences not apparent a...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-212
更新日期:2012-05-30 00:00:00
abstract:BACKGROUND:Infectious laryngotracheitis virus (ILTV; gallid herpesvirus 1) infection causes high mortality and huge economic losses in the poultry industry. To protect chickens against ILTV infection, chicken-embryo origin (CEO) and tissue-culture origin (TCO) vaccines have been used. However, the transmission of vacci...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-143
更新日期:2012-04-24 00:00:00
abstract:BACKGROUND:Bloodstream malaria parasites require Ca++ for their development, but the sites and mechanisms of Ca++ utilization are not well understood. We hypothesized that there may be differences in Ca++ uptake or utilization by genetically distinct lines of P. falciparum. These differences, if identified, may provide...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5418-y
更新日期:2019-01-16 00:00:00
abstract:BACKGROUND:Over the past 50,000 years, shifts in human-environmental or human-human interactions shaped genetic differences within and among human populations, including variants under positive selection. Shaped by environmental factors, such variants influence the genetics of modern health, disease, and treatment outc...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-16-S8-S8
更新日期:2015-01-01 00:00:00
abstract:BACKGROUND:Using the piggyBac-mediated GAL4/UAS transgenic system established in the silkworm, Bombyx mori, we have previously reported that overexpression of the Ras1(CA) oncogene specifically in the posterior silk gland (PSG) improved cell growth, fibroin synthesis, and thus silk yield. However, the detailed molecula...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-182
更新日期:2014-03-09 00:00:00
abstract:BACKGROUND:In mammals, CEACAM1 and closely related members represent paired receptors with similar extracellular ligand-binding regions and cytoplasmic domains with opposing functions. Human CEACAM1 and CEACAM3 which have inhibitory ITIM/ITSM and activating ITAM-like motifs, respectively, in their cytoplasmic regions a...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3279-9
更新日期:2016-11-16 00:00:00
abstract:BACKGROUND:The environmental light-dark cycle is the dominant cue that maintains 24-h biological rhythms in multicellular organisms. In Drosophila, light entrainment is mediated by the photosensitive protein CRYPTOCHROME, but the role and extent of transcription regulation in light resetting of the dipteran clock is ye...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1787-7
更新日期:2015-08-01 00:00:00
abstract:BACKGROUND:gammadelta T cells differ from alphabeta T cells with regard to the types of antigen with which their T cell receptors interact; gammadelta T cell antigens are not necessarily peptides nor are they presented on MHC. Cattle are considered a "gammadelta T cell high" species indicating they have an increased pr...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-100
更新日期:2010-02-09 00:00:00
abstract:BACKGROUND:Wheat yellow (stripe) rust caused by Puccinia striiformis f. sp. tritici (PST) is one of the most devastating diseases of wheat worldwide. To design effective breeding strategies that maximize the potential for durable disease resistance it is important to understand the molecular basis of PST pathogenicity....
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-270
更新日期:2013-04-22 00:00:00
abstract:BACKGROUND:Alpha tubulin is a fundamental component of the cytoskeleton which is responsible for cell shape and is involved in cell division, ciliary and flagellar motility and intracellular transport. Alpha tubulin gene expression varies according to the morphological changes suffered by Leishmania in its life cycle. ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-454
更新日期:2013-07-06 00:00:00
abstract:BACKGROUND:Microsatellites are highly abundant in eukaryotic genomes but their function and evolution are not yet well understood. Their elevated mutation rate makes them ideal markers of genetic difference, but high levels of unexplained heterogeneity in mutation rates among microsatellites at different genomic locati...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-49
更新日期:2008-01-28 00:00:00
abstract:BACKGROUND:Rich in genetic information and cost-effective to genotype, the Insertion-Deletion (InDel) molecular marker system is an important tool for studies in genetics, genomics and for marker-assisted breeding. Advent of next-generation sequencing (NGS) revolutionized the speed and throughput of sequence data gener...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2614-5
更新日期:2016-04-14 00:00:00
abstract:BACKGROUND:Clock genes are considered to be the molecular core of biological clock in vertebrates and they are directly involved in the regulation of daily rhythms in vertebrate tissues such as skeletal muscles. Fish myotomes are composed of anatomically segregated fast and slow muscle fibers that possess different met...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3373-z
更新日期:2016-12-08 00:00:00
abstract:BACKGROUND:The response of the trout, O. mykiss, head kidney to bacterial lipopolysaccharide (LPS) or active and attenuated infectious hematopoietic necrosis virus (IHNV and attINHV respectively) intraperitoneal challenge, 24 and 72 hours post-injection, was investigated using a salmonid-specific cDNA microarray. RESU...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-141
更新日期:2008-03-26 00:00:00
abstract:BACKGROUND:The endothelial PAS domain protein 1 (EPAS1) activates genes that are involved in erythropoiesis and angiogenesis, thus favoring a better delivery of oxygen to the tissues and is a plausible candidate to influence athletic performance. Using innovative statistical methods we compared genotype distributions a...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-382
更新日期:2014-05-18 00:00:00
abstract:BACKGROUND:Musa species (Zingiberaceae, Zingiberales) including bananas and plantains are collectively the fourth most important crop in developing countries. Knowledge concerning Musa genome structure and the origin of distinct cultivars has greatly increased over the last few years. Until now, however, no large-scale...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-58
更新日期:2008-01-30 00:00:00
abstract:BACKGROUND:Siraitia grosvenorii (Luohanguo) is an herbaceous perennial plant native to southern China and most prevalent in Guilin city. Its fruit contains a sweet, fleshy, edible pulp that is widely used in traditional Chinese medicine. The major bioactive constituents in the fruit extract are the cucurbitane-type tri...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-343
更新日期:2011-07-05 00:00:00
abstract:BACKGROUND:Genome-wide association studies (GWAS) have identified many common polymorphisms associated with complex traits. However, these associated common variants explain only a small fraction of the phenotypic variances, leaving a substantial portion of genetic heritability unexplained. As a result, searches for "m...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-S1-S1
更新日期:2013-01-01 00:00:00
abstract:BACKGROUND:Deciphering taxonomical structures based on high dimensional sequencing data is still challenging in metagenomics study. Moreover, the common workflow processed in this field fails to identify microbial communities and their effect on a specific disease status. Even the relationships and interactions between...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3257-2
更新日期:2017-01-25 00:00:00
abstract:BACKGROUND:To date, oil-rich plants are the main source of biodiesel products. Because concerns have been voiced about the impact of oil-crop cultivation on the price of food commodities, the interest in oil plants not used for food production and amenable to cultivation on non-agricultural land has soared. As a non-fo...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-462
更新日期:2010-08-06 00:00:00
abstract:BACKGROUND:Rainbow trout is a significant fish farming species under temperate climates. Female reproduction traits play an important role in the economy of breeding companies with the sale of fertilized eggs. The objectives of this study are threefold: to estimate the genetic parameters of female reproduction traits, ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-06955-7
更新日期:2020-08-14 00:00:00