A deconvolution method and its application in analyzing the cellular fractions in acute myeloid leukemia samples.

Abstract:

BACKGROUND:The identification of cell type-specific genes (markers) is an essential step for the deconvolution of the cellular fractions, primarily, from the gene expression data of a bulk sample. However, the genes with significant changes identified by pair-wise comparisons cannot indeed represent the specificity of gene expression across multiple conditions. In addition, the knowledge about the identification of gene expression markers across multiple conditions is still paucity. RESULTS:Herein, we developed a hybrid tool, LinDeconSeq, which consists of 1) identifying marker genes using specificity scoring and mutual linearity strategies across any number of cell types, and 2) predicting cellular fractions of bulk samples using weighted robust linear regression with the marker genes identified in the first stage. On multiple publicly available datasets, the marker genes identified by LinDeconSeq demonstrated better accuracy and reproducibility compared to MGFM and RNentropy. Among deconvolution methods, LinDeconSeq showed low average deviations (≤0.0958) and high average Pearson correlations (≥0.8792) between the predicted and actual fractions on the benchmark datasets. Importantly, the cellular fractions predicted by LinDeconSeq appear to be relevant in the diagnosis of acute myeloid leukemia (AML). The distinct cellular fractions in granulocyte-monocyte progenitor (GMP), lymphoid-primed multipotent progenitor (LMPP) and monocytes (MONO) were found to be closely associated with AML compared to the healthy samples. Moreover, the heterogeneity of cellular fractions in AML patients divided these patients into two subgroups, differing in both prognosis and mutation patterns. GMP fraction was the most pronounced between these two subgroups, particularly, in SubgroupA, which was strongly associated with the better AML prognosis and the younger population. Totally, the identification of marker genes by LinDeconSeq represents the improved feature for deconvolution. The data processing strategy with regard to the cellular fractions used in this study also showed potential for the diagnosis and prognosis of diseases. CONCLUSIONS:Taken together, we developed a freely-available and open-source tool LinDeconSeq ( https://github.com/lihuamei/LinDeconSeq ), which includes marker identification and deconvolution procedures. LinDeconSeq is comparable to other current methods in terms of accuracy when applied to benchmark datasets and has broad application in clinical outcome and disease-specific molecular mechanisms.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Li H,Sharma A,Ming W,Sun X,Liu H

doi

10.1186/s12864-020-06888-1

subject

Has Abstract

pub_date

2020-09-23 00:00:00

pages

652

issue

1

issn

1471-2164

pii

10.1186/s12864-020-06888-1

journal_volume

21

pub_type

杂志文章
  • Evolution of cis- and trans-regulatory divergence in the chicken genome between two contrasting breeds analyzed using three tissue types at one-day-old.

    abstract:BACKGROUND:Gene expression variation is a key underlying factor influencing phenotypic variation, and can occur via cis- or trans-regulation. To understand the role of cis- and trans-regulatory variation on population divergence in chicken, we developed reciprocal crosses of two chicken breeds, White Leghorn and Cornis...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6342-5

    authors: Wang Q,Jia Y,Wang Y,Jiang Z,Zhou X,Zhang Z,Nie C,Li J,Yang N,Qu L

    更新日期:2019-12-05 00:00:00

  • Transcriptomics of the late gestation ovine fetal brain: modeling the co-expression of immune marker genes.

    abstract:BACKGROUND:Major changes in gene expression occur in the fetal brain to modulate the function of this organ postnatally. Thus, factors can alter the genomics of the fetal brain, predisposing to neurological disorders later in life. We hypothesized that the physiological dynamics of the immune system transcriptome of th...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1001

    authors: Rabaglino MB,Keller-Wood M,Wood CE

    更新日期:2014-11-19 00:00:00

  • A general pipeline for the development of anchor markers for comparative genomics in plants.

    abstract:BACKGROUND:Complete or near-complete genomic sequence information is presently only available for a few plant species representing a large phylogenetic diversity among plants. In order to effectively transfer this information to species lacking sequence information, comparative genomic tools need to be developed. Molec...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-207

    authors: Fredslund J,Madsen LH,Hougaard BK,Nielsen AM,Bertioli D,Sandal N,Stougaard J,Schauser L

    更新日期:2006-08-14 00:00:00

  • Rapid evolutionary divergence of diploid and allotetraploid Gossypium mitochondrial genomes.

    abstract:BACKGROUND:Cotton (Gossypium spp.) is commonly grouped into eight diploid genomic groups and an allotetraploid genomic group, AD. The mitochondrial genomes supply new information to understand both the evolution process and the mechanism of cytoplasmic male sterility. Based on previously released mitochondrial genomes ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4282-5

    authors: Chen Z,Nie H,Wang Y,Pei H,Li S,Zhang L,Hua J

    更新日期:2017-11-13 00:00:00

  • Quantitative proteomic analysis of host--pathogen interactions: a study of Acinetobacter baumannii responses to host airways.

    abstract:BACKGROUND:Acinetobacter baumannii is a major health problem. The most common infection caused by A. baumannii is hospital acquired pneumonia, and the associated mortality rate is approximately 50%. Neither in vivo nor ex vivo expression profiling has been performed at the proteomic or transcriptomic level for pneumoni...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1608-z

    authors: Méndez JA,Mateos J,Beceiro A,Lopez M,Tomás M,Poza M,Bou G

    更新日期:2015-05-30 00:00:00

  • Selenium toxicity but not deficient or super-nutritional selenium status vastly alters the transcriptome in rodents.

    abstract:BACKGROUND:Protein and mRNA levels for several selenoproteins, such as glutathione peroxidase-1 (Gpx1), are down-regulated dramatically by selenium (Se) deficiency. These levels in rats increase sigmoidally with increasing dietary Se and reach defined plateaus at the Se requirement, making them sensitive biomarkers for...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-26

    authors: Raines AM,Sunde RA

    更新日期:2011-01-12 00:00:00

  • Multi-tissue transcriptome analysis using hybrid-sequencing reveals potential genes and biological pathways associated with azadirachtin A biosynthesis in neem (azadirachta indica).

    abstract:BACKGROUND:Azadirachtin A is a triterpenoid from neem tree exhibiting excellent activities against over 600 insect species in agriculture. The production of azadirachtin A depends on extraction from neem tissues, which is not an eco-friendly and sustainable process. The low yield and discontinuous supply of azadirachti...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07124-6

    authors: Wang H,Wang N,Huo Y

    更新日期:2020-10-28 00:00:00

  • MicroRNA expression profiling of the fifth-instar posterior silk gland of Bombyx mori.

    abstract:BACKGROUND:The growth and development of the posterior silk gland and the biosynthesis of the silk core protein at the fifth larval instar stage of Bombyx mori are of paramount importance for silk production. RESULTS:Here, aided by next-generation sequencing and microarry assay, we profile 1,229 microRNAs (miRNAs), in...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-410

    authors: Li J,Cai Y,Ye L,Wang S,Che J,You Z,Yu J,Zhong B

    更新日期:2014-05-29 00:00:00

  • Assisted clustering of gene expression data using ANCut.

    abstract:BACKGROUND:In biomedical research, gene expression profiling studies have been extensively conducted. The analysis of gene expression data has led to a deeper understanding of human genetics as well as practically useful models. Clustering analysis has been a critical component of gene expression data analysis and can ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3990-1

    authors: Teran Hidalgo SJ,Wu M,Ma S

    更新日期:2017-08-16 00:00:00

  • High-throughput comparison of gene fitness among related bacteria.

    abstract:BACKGROUND:The contribution of a gene to the fitness of a bacterium can be assayed by whether and to what degree the bacterium tolerates transposon insertions in that gene. We use this fact to compare the fitness of syntenic homologous genes among related Salmonella strains and thereby reveal differences not apparent a...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-212

    authors: Canals R,Xia XQ,Fronick C,Clifton SW,Ahmer BM,Andrews-Polymenis HL,Porwollik S,McClelland M

    更新日期:2012-05-30 00:00:00

  • Genome-wide host responses against infectious laryngotracheitis virus vaccine infection in chicken embryo lung cells.

    abstract:BACKGROUND:Infectious laryngotracheitis virus (ILTV; gallid herpesvirus 1) infection causes high mortality and huge economic losses in the poultry industry. To protect chickens against ILTV infection, chicken-embryo origin (CEO) and tissue-culture origin (TCO) vaccines have been used. However, the transmission of vacci...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-143

    authors: Lee J,Bottje WG,Kong BW

    更新日期:2012-04-24 00:00:00

  • Multiple genetic loci define Ca++ utilization by bloodstream malaria parasites.

    abstract:BACKGROUND:Bloodstream malaria parasites require Ca++ for their development, but the sites and mechanisms of Ca++ utilization are not well understood. We hypothesized that there may be differences in Ca++ uptake or utilization by genetically distinct lines of P. falciparum. These differences, if identified, may provide...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5418-y

    authors: Apolis L,Olivas J,Srinivasan P,Kushwaha AK,Desai SA

    更新日期:2019-01-16 00:00:00

  • Conditional entropy in variation-adjusted windows detects selection signatures associated with expression quantitative trait loci (eQTLs).

    abstract:BACKGROUND:Over the past 50,000 years, shifts in human-environmental or human-human interactions shaped genetic differences within and among human populations, including variants under positive selection. Shaped by environmental factors, such variants influence the genetics of modern health, disease, and treatment outc...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-16-S8-S8

    authors: Handelman SK,Seweryn M,Smith RM,Hartmann K,Wang D,Pietrzak M,Johnson AD,Kloczkowski A,Sadee W

    更新日期:2015-01-01 00:00:00

  • Transcriptomic analysis of differentially expressed genes in the Ras1(CA)-overexpressed and wildtype posterior silk glands.

    abstract:BACKGROUND:Using the piggyBac-mediated GAL4/UAS transgenic system established in the silkworm, Bombyx mori, we have previously reported that overexpression of the Ras1(CA) oncogene specifically in the posterior silk gland (PSG) improved cell growth, fibroin synthesis, and thus silk yield. However, the detailed molecula...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-182

    authors: Ma L,Ma Q,Li X,Cheng L,Li K,Li S

    更新日期:2014-03-09 00:00:00

  • Coevolution of paired receptors in Xenopus carcinoembryonic antigen-related cell adhesion molecule families suggests appropriation as pathogen receptors.

    abstract:BACKGROUND:In mammals, CEACAM1 and closely related members represent paired receptors with similar extracellular ligand-binding regions and cytoplasmic domains with opposing functions. Human CEACAM1 and CEACAM3 which have inhibitory ITIM/ITSM and activating ITAM-like motifs, respectively, in their cytoplasmic regions a...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3279-9

    authors: Zimmermann W,Kammerer R

    更新日期:2016-11-16 00:00:00

  • Identification and functional analysis of early gene expression induced by circadian light-resetting in Drosophila.

    abstract:BACKGROUND:The environmental light-dark cycle is the dominant cue that maintains 24-h biological rhythms in multicellular organisms. In Drosophila, light entrainment is mediated by the photosensitive protein CRYPTOCHROME, but the role and extent of transcription regulation in light resetting of the dipteran clock is ye...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1787-7

    authors: Adewoye AB,Kyriacou CP,Tauber E

    更新日期:2015-08-01 00:00:00

  • Annotation and classification of the bovine T cell receptor delta genes.

    abstract:BACKGROUND:gammadelta T cells differ from alphabeta T cells with regard to the types of antigen with which their T cell receptors interact; gammadelta T cell antigens are not necessarily peptides nor are they presented on MHC. Cattle are considered a "gammadelta T cell high" species indicating they have an increased pr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-100

    authors: Herzig CT,Lefranc MP,Baldwin CL

    更新日期:2010-02-09 00:00:00

  • Genome analyses of the wheat yellow (stripe) rust pathogen Puccinia striiformis f. sp. tritici reveal polymorphic and haustorial expressed secreted proteins as candidate effectors.

    abstract:BACKGROUND:Wheat yellow (stripe) rust caused by Puccinia striiformis f. sp. tritici (PST) is one of the most devastating diseases of wheat worldwide. To design effective breeding strategies that maximize the potential for durable disease resistance it is important to understand the molecular basis of PST pathogenicity....

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-270

    authors: Cantu D,Segovia V,MacLean D,Bayles R,Chen X,Kamoun S,Dubcovsky J,Saunders DG,Uauy C

    更新日期:2013-04-22 00:00:00

  • Alpha tubulin genes from Leishmania braziliensis: genomic organization, gene structure and insights on their expression.

    abstract:BACKGROUND:Alpha tubulin is a fundamental component of the cytoskeleton which is responsible for cell shape and is involved in cell division, ciliary and flagellar motility and intracellular transport. Alpha tubulin gene expression varies according to the morphological changes suffered by Leishmania in its life cycle. ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-454

    authors: Ramírez CA,Requena JM,Puerta CJ

    更新日期:2013-07-06 00:00:00

  • High frequency of microsatellites in S. cerevisiae meiotic recombination hotspots.

    abstract:BACKGROUND:Microsatellites are highly abundant in eukaryotic genomes but their function and evolution are not yet well understood. Their elevated mutation rate makes them ideal markers of genetic difference, but high levels of unexplained heterogeneity in mutation rates among microsatellites at different genomic locati...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-49

    authors: Bagshaw AT,Pitt JP,Gemmell NJ

    更新日期:2008-01-28 00:00:00

  • mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development.

    abstract:BACKGROUND:Rich in genetic information and cost-effective to genotype, the Insertion-Deletion (InDel) molecular marker system is an important tool for studies in genetics, genomics and for marker-assisted breeding. Advent of next-generation sequencing (NGS) revolutionized the speed and throughput of sequence data gener...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2614-5

    authors: Lv Y,Liu Y,Zhao H

    更新日期:2016-04-14 00:00:00

  • Daily rhythmicity of clock gene transcript levels in fast and slow muscle fibers from Chinese perch (Siniperca chuatsi).

    abstract:BACKGROUND:Clock genes are considered to be the molecular core of biological clock in vertebrates and they are directly involved in the regulation of daily rhythms in vertebrate tissues such as skeletal muscles. Fish myotomes are composed of anatomically segregated fast and slow muscle fibers that possess different met...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3373-z

    authors: Wu P,Li YL,Cheng J,Chen L,Zhu X,Feng ZG,Zhang JS,Chu WY

    更新日期:2016-12-08 00:00:00

  • Comparative analysis of the acute response of the trout, O. mykiss, head kidney to in vivo challenge with virulent and attenuated infectious hematopoietic necrosis virus and LPS-induced inflammation.

    abstract:BACKGROUND:The response of the trout, O. mykiss, head kidney to bacterial lipopolysaccharide (LPS) or active and attenuated infectious hematopoietic necrosis virus (IHNV and attINHV respectively) intraperitoneal challenge, 24 and 72 hours post-injection, was investigated using a salmonid-specific cDNA microarray. RESU...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-141

    authors: MacKenzie S,Balasch JC,Novoa B,Ribas L,Roher N,Krasnov A,Figueras A

    更新日期:2008-03-26 00:00:00

  • EPAS1 gene variants are associated with sprint/power athletic performance in two cohorts of European athletes.

    abstract:BACKGROUND:The endothelial PAS domain protein 1 (EPAS1) activates genes that are involved in erythropoiesis and angiogenesis, thus favoring a better delivery of oxygen to the tissues and is a plausible candidate to influence athletic performance. Using innovative statistical methods we compared genotype distributions a...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-382

    authors: Voisin S,Cieszczyk P,Pushkarev VP,Dyatlov DA,Vashlyayev BF,Shumaylov VA,Maciejewska-Karlowska A,Sawczuk M,Skuza L,Jastrzebski Z,Bishop DJ,Eynon N

    更新日期:2014-05-18 00:00:00

  • Insights into the Musa genome: syntenic relationships to rice and between Musa species.

    abstract:BACKGROUND:Musa species (Zingiberaceae, Zingiberales) including bananas and plantains are collectively the fourth most important crop in developing countries. Knowledge concerning Musa genome structure and the origin of distinct cultivars has greatly increased over the last few years. Until now, however, no large-scale...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-58

    authors: Lescot M,Piffanelli P,Ciampi AY,Ruiz M,Blanc G,Leebens-Mack J,da Silva FR,Santos CM,D'Hont A,Garsmeur O,Vilarinhos AD,Kanamori H,Matsumoto T,Ronning CM,Cheung F,Haas BJ,Althoff R,Arbogast T,Hine E,Pappas GJ Jr,Sas

    更新日期:2008-01-30 00:00:00

  • An efficient approach to finding Siraitia grosvenorii triterpene biosynthetic genes by RNA-seq and digital gene expression analysis.

    abstract:BACKGROUND:Siraitia grosvenorii (Luohanguo) is an herbaceous perennial plant native to southern China and most prevalent in Guilin city. Its fruit contains a sweet, fleshy, edible pulp that is widely used in traditional Chinese medicine. The major bioactive constituents in the fruit extract are the cucurbitane-type tri...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-343

    authors: Tang Q,Ma X,Mo C,Wilson IW,Song C,Zhao H,Yang Y,Fu W,Qiu D

    更新日期:2011-07-05 00:00:00

  • A unified approach for allele frequency estimation, SNP detection and association studies based on pooled sequencing data using EM algorithms.

    abstract:BACKGROUND:Genome-wide association studies (GWAS) have identified many common polymorphisms associated with complex traits. However, these associated common variants explain only a small fraction of the phenotypic variances, leaving a substantial portion of genetic heritability unexplained. As a result, searches for "m...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-S1-S1

    authors: Chen Q,Sun F

    更新日期:2013-01-01 00:00:00

  • MetaTopics: an integration tool to analyze microbial community profile by topic model.

    abstract:BACKGROUND:Deciphering taxonomical structures based on high dimensional sequencing data is still challenging in metagenomics study. Moreover, the common workflow processed in this field fails to identify microbial communities and their effect on a specific disease status. Even the relationships and interactions between...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3257-2

    authors: Yan J,Chuai G,Qi T,Shao F,Zhou C,Zhu C,Yang J,Yu Y,Shi C,Kang N,He Y,Liu Q

    更新日期:2017-01-25 00:00:00

  • Transcriptome analysis of the oil-rich seed of the bioenergy crop Jatropha curcas L.

    abstract:BACKGROUND:To date, oil-rich plants are the main source of biodiesel products. Because concerns have been voiced about the impact of oil-crop cultivation on the price of food commodities, the interest in oil plants not used for food production and amenable to cultivation on non-agricultural land has soared. As a non-fo...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-462

    authors: Costa GG,Cardoso KC,Del Bem LE,Lima AC,Cunha MA,de Campos-Leite L,Vicentini R,Papes F,Moreira RC,Yunes JA,Campos FA,Da Silva MJ

    更新日期:2010-08-06 00:00:00

  • Genetic architecture and genomic selection of female reproduction traits in rainbow trout.

    abstract:BACKGROUND:Rainbow trout is a significant fish farming species under temperate climates. Female reproduction traits play an important role in the economy of breeding companies with the sale of fertilized eggs. The objectives of this study are threefold: to estimate the genetic parameters of female reproduction traits, ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-06955-7

    authors: D'Ambrosio J,Morvezen R,Brard-Fudulea S,Bestin A,Acin Perez A,Guéméné D,Poncet C,Haffray P,Dupont-Nivet M,Phocas F

    更新日期:2020-08-14 00:00:00