Identification of lung cancer gene markers through kernel maximum mean discrepancy and information entropy.

Abstract:

BACKGROUND:The early diagnosis of lung cancer has been a critical problem in clinical practice for a long time and identifying differentially expressed gene as disease marker is a promising solution. However, the most existing gene differential expression analysis (DEA) methods have two main drawbacks: First, these methods are based on fixed statistical hypotheses and not always effective; Second, these methods can not identify a certain expression level boundary when there is no obvious expression level gap between control and experiment groups. METHODS:This paper proposed a novel approach to identify marker genes and gene expression level boundary for lung cancer. By calculating a kernel maximum mean discrepancy, our method can evaluate the expression differences between normal, normal adjacent to tumor (NAT) and tumor samples. For the potential marker genes, the expression level boundaries among different groups are defined with the information entropy method. RESULTS:Compared with two conventional methods t-test and fold change, the top average ranked genes selected by our method can achieve better performance under all metrics in the 10-fold cross-validation. Then GO and KEGG enrichment analysis are conducted to explore the biological function of the top 100 ranked genes. At last, we choose the top 10 average ranked genes as lung cancer markers and their expression boundaries are calculated and reported. CONCLUSION:The proposed approach is effective to identify gene markers for lung cancer diagnosis. It is not only more accurate than conventional DEA methods but also provides a reliable method to identify the gene expression level boundaries.

journal_name

BMC Med Genomics

journal_title

BMC medical genomics

authors

Zhao Z,Peng H,Zhang X,Zheng Y,Chen F,Fang L,Li J

doi

10.1186/s12920-019-0630-4

subject

Has Abstract

pub_date

2019-12-20 00:00:00

pages

183

issue

Suppl 8

issn

1755-8794

pii

10.1186/s12920-019-0630-4

journal_volume

12

pub_type

杂志文章
  • The caudate nucleus undergoes dramatic and unique transcriptional changes in human prodromal Huntington's disease brain.

    abstract:BACKGROUND:The mechanisms underlying neurodegeneration in the striatum of Huntingon's Disease (HD) brain are currently unknown. While the striatum is massively degenerated in symptomatic individuals, which makes cellular characterization difficult, it is largely intact in asymptomatic HD gene positive (HD+) individuals...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-019-0581-9

    authors: Agus F,Crespo D,Myers RH,Labadorf A

    更新日期:2019-10-16 00:00:00

  • Identification of exon skipping events associated with Alzheimer's disease in the human hippocampus.

    abstract:BACKGROUND:At least 90% of human genes are alternatively spliced. Alternative splicing has an important function regulating gene expression and miss-splicing can contribute to risk for human diseases, including Alzheimer's disease (AD). METHODS:We developed a splicing decision model as a molecular mechanism to identif...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0453-8

    authors: Han S,Miller JE,Byun S,Kim D,Risacher SL,Saykin AJ,Lee Y,Nho K,for Alzheimer’s Disease Neuroimaging Initiative.

    更新日期:2019-01-31 00:00:00

  • IL-17A polymorphism (rs2275913) and levels are associated with preeclampsia pathogenesis in Chinese patients.

    abstract:BACKGROUND:Preeclampsia (PE) is a pregnancy-related condition that affects both the infant and the mother. Although the role of various inflammatory molecules in PE has been demonstrated, the importance of pro-inflammatory molecules such as IL-17A, IL-23 is not well understood. In the present investigation, a potential...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-020-00840-8

    authors: Lang X,Liu W,Hou Y,Zhao W,Yang X,Chen L,Yan Q,Cheng W

    更新日期:2021-01-06 00:00:00

  • Identification and validation of suitable endogenous reference genes for gene expression studies in human peripheral blood.

    abstract:BACKGROUND:Gene expression studies require appropriate normalization methods. One such method uses stably expressed reference genes. Since suitable reference genes appear to be unique for each tissue, we have identified an optimal set of the most stably expressed genes in human blood that can be used for normalization....

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-2-49

    authors: Stamova BS,Apperson M,Walker WL,Tian Y,Xu H,Adamczy P,Zhan X,Liu DZ,Ander BP,Liao IH,Gregg JP,Turner RJ,Jickling G,Lit L,Sharp FR

    更新日期:2009-08-05 00:00:00

  • Categorizing biomedicine images using novel image features and sparse coding representation.

    abstract:BACKGROUND:Images embedded in biomedical publications carry rich information that often concisely summarize key hypotheses adopted, methods employed, or results obtained in a published study. Therefore, they offer valuable clues for understanding main content in a biomedical publication. Prior studies have pointed out ...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-6-S3-S8

    authors: Sheng J,Xu S,Luo X

    更新日期:2013-01-01 00:00:00

  • Clinical analysis of germline copy number variation in DMD using a non-conjugate hierarchical Bayesian model.

    abstract:BACKGROUND:Detection of copy number variants (CNVs) is an important aspect of clinical testing for several disorders, including Duchenne muscular dystrophy, and is often performed using multiplex ligation-dependent probe amplification (MLPA). However, since many genetic carrier screens depend instead on next-generation...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0404-4

    authors: Kozareva V,Stroff C,Silver M,Freidin JF,Delaney NF

    更新日期:2018-10-20 00:00:00

  • WISARD: workbench for integrated superfast association studies for related datasets.

    abstract:BACKGROUND:A Mendelian transmission produces phenotypic and genetic relatedness between family members, giving family-based analytical methods an important role in genetic epidemiological studies-from heritability estimations to genetic association analyses. With the advance in genotyping technologies, whole-genome seq...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0345-y

    authors: Lee S,Choi S,Qiao D,Cho M,Silverman EK,Park T,Won S

    更新日期:2018-04-20 00:00:00

  • Comparative analysis of the human hepatic and adipose tissue transcriptomes during LPS-induced inflammation leads to the identification of differential biological pathways and candidate biomarkers.

    abstract:BACKGROUND:Insulin resistance (IR) is accompanied by chronic low grade systemic inflammation, obesity, and deregulation of total body energy homeostasis. We induced inflammation in adipose and liver tissues in vitro in order to mimic inflammation in vivo with the aim to identify tissue-specific processes implicated in ...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-4-71

    authors: Szalowska E,Dijkstra M,Elferink MG,Weening D,de Vries M,Bruinenberg M,Hoek A,Roelofsen H,Groothuis GM,Vonk RJ

    更新日期:2011-10-06 00:00:00

  • Genome scale analysis of pathogenic variants targetable for single base editing.

    abstract:BACKGROUND:Single nucleotide variants account for approximately 90% of all known pathogenic variants responsible for human diseases. Recently discovered CRISPR/Cas9 base editors can correct individual nucleotides without cutting DNA and inducing double-stranded breaks. We aimed to find all possible pathogenic variants ...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-020-00735-8

    authors: Lavrov AV,Varenikov GG,Skoblov MY

    更新日期:2020-09-18 00:00:00

  • Fraternal twins with Phelan-McDermid syndrome not involving the SHANK3 gene: case report and literature review.

    abstract:BACKGROUND:Phelan-McDermid syndrome (PMS, OMIM#606232), or 22q13 deletion syndrome, is a rare genetic disorder caused by deletion of the distal long arm of chromosome 22 with a variety of clinical features that display considerably heterogeneous degrees of severity. The SHANK3 gene is understood to be the critical gene...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-020-00802-0

    authors: Li S,Xi KW,Liu T,Zhang Y,Zhang M,Zeng LD,Li J

    更新日期:2020-10-06 00:00:00

  • Segmental neurofibromatosis type 2: discriminating two hit from four hit in a patient presenting multiple schwannomas confined to one limb.

    abstract:BACKGROUND:A clinical overlap exists between mosaic Neurofibromatosis Type 2 and sporadic Schwannomatosis conditions. In these cases a molecular analysis of tumors is recommended for a proper genetic diagnostics. This analysis is challenged by the fact that schwannomas in both conditions bear a somatic double inactivat...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-015-0076-2

    authors: Castellanos E,Bielsa I,Carrato C,Rosas I,Solanes A,Hostalot C,Amilibia E,Prades J,Roca-Ribas F,Lázaro C,Blanco I,Serra E,NF2 Multidisciplinary Clinics HUGTiP-ICO-IMPPC.

    更新日期:2015-01-24 00:00:00

  • DNA methylation differences in monozygotic twin pairs discordant for schizophrenia identifies psychosis related genes and networks.

    abstract:BACKGROUND:Despite their singular origin, monozygotic twin pairs often display discordance for complex disorders including schizophrenia. It is a common (1%) and often familial disease with a discordance rate of ~50% in monozygotic twins. This high discordance is often explained by the role of yet unknown environmental...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-015-0093-1

    authors: Castellani CA,Laufer BI,Melka MG,Diehl EJ,O'Reilly RL,Singh SM

    更新日期:2015-05-06 00:00:00

  • Identification of sequence variants associated with severe microtia-astresia by targeted sequencing.

    abstract:BACKGROUND:Microtia-atresia is characterized by abnormalities of the auricle (microtia) and aplasia or hypoplasia of the external auditory canal, often associated with middle ear abnormalities. To date, no causal genetic mutations or genes have been identified in microtia-atresia patients. METHODS:We designed a panel ...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-019-0475-x

    authors: Wang P,Wang Y,Fan X,Liu Y,Fan Y,Liu T,Chen C,Zhang S,Chen X

    更新日期:2019-01-28 00:00:00

  • Genomic selection of reference genes for real-time PCR in human myocardium.

    abstract:BACKGROUND:Reliability of real-time PCR (RT-qPCR) data is dependent on the use of appropriate reference gene(s) for normalization. To date, no validated reference genes have been reported for normalizing gene expression in human myocardium. This study aimed to identify validated reference genes for use in gene expressi...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-1-64

    authors: Pilbrow AP,Ellmers LJ,Black MA,Moravec CS,Sweet WE,Troughton RW,Richards AM,Frampton CM,Cameron VA

    更新日期:2008-12-29 00:00:00

  • Within-pair differences of DNA methylation levels between monozygotic twins are different between male and female pairs.

    abstract:BACKGROUND:DNA methylation levels will be important for detection of epigenetic effects. However, there are few reports showing sex-related differences in the sensitivity to DNA methylation. To evaluate their sex-related individual differences in the sensitivity to methylation rigorously, we performed a systematic anal...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-016-0217-2

    authors: Watanabe M,Honda C,Osaka Twin Research Group.,Iwatani Y,Yorifuji S,Iso H,Kamide K,Hatazawa J,Kihara S,Sakai N,Watanabe H,Makimoto K,Watanabe M,Honda C,Iwatani Y

    更新日期:2016-08-26 00:00:00

  • Integrative analysis reveals disease-associated genes and biomarkers for prostate cancer progression.

    abstract:BACKGROUND:Prostate cancer is one of the most common complex diseases with high leading cause of death in men. Identifications of prostate cancer associated genes and biomarkers are thus essential as they can gain insights into the mechanisms underlying disease progression and advancing for early diagnosis and developi...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-7-S1-S3

    authors: Li Y,Vongsangnak W,Chen L,Shen B

    更新日期:2014-01-01 00:00:00

  • Modified entropy-based procedure detects gene-gene-interactions in unconventional genetic models.

    abstract:BACKGROUND:Since it is assumed that genetic interactions play an important role in understanding the mechanisms of complex diseases, different statistical approaches have been suggested in recent years for this task. One interesting approach is the entropy-based IGENT method by Kwon et al. that promises an efficient de...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-020-0703-4

    authors: Malten J,König IR

    更新日期:2020-04-23 00:00:00

  • Development of somatic mutation signatures for risk stratification and prognosis in lung and colorectal adenocarcinomas.

    abstract:BACKGROUND:Prognostic signatures are vital to precision medicine. However, development of somatic mutation prognostic signatures for cancers remains a challenge. In this study we developed a novel method for discovering somatic mutation based prognostic signatures. RESULTS:Somatic mutation and clinical data for lung a...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0454-7

    authors: Menor M,Zhu Y,Wang Y,Zhang J,Jiang B,Deng Y

    更新日期:2019-01-31 00:00:00

  • Detecting early-warning signals of type 1 diabetes and its leading biomolecular networks by dynamical network biomarkers.

    abstract:BACKGROUND:Type 1 diabetes (T1D) is a complex disease and harmful to human health, and most of the existing biomarkers are mainly to measure the disease phenotype after the disease onset (or drastic deterioration). Until now, there is no effective biomarker which can predict the upcoming disease (or pre-disease state) ...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-6-S2-S8

    authors: Liu X,Liu R,Zhao XM,Chen L

    更新日期:2013-01-01 00:00:00

  • Molecular conservation of estrogen-response associated with cell cycle regulation, hormonal carcinogenesis and cancer in zebrafish and human cancer cell lines.

    abstract:BACKGROUND:The zebrafish is recognized as a versatile cancer and drug screening model. However, it is not known whether the estrogen-responsive genes and signaling pathways that are involved in estrogen-dependent carcinogenesis and human cancer are operating in zebrafish. In order to determine the potential of zebrafis...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-4-41

    authors: Lam SH,Lee SG,Lin CY,Thomsen JS,Fu PY,Murthy KR,Li H,Govindarajan KR,Nick LC,Bourque G,Gong Z,Lufkin T,Liu ET,Mathavan S

    更新日期:2011-05-16 00:00:00

  • Key genes for modulating information flow play a temporal role as breast tumor coexpression networks are dynamically rewired by letrozole.

    abstract:BACKGROUND:Genes do not act in isolation but instead as part of complex regulatory networks. To understand how breast tumors adapt to the presence of the drug letrozole, at the molecular level, it is necessary to consider how the expression levels of genes in these networks change relative to one another. METHODS:Usin...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-6-S2-S2

    authors: Penrod NM,Moore JH

    更新日期:2013-01-01 00:00:00

  • Integration analysis of long non-coding RNA (lncRNA) role in tumorigenesis of colon adenocarcinoma.

    abstract:BACKGROUND:Colon adenocarcinoma (COAD) is one of the most common gastrointestinal cancers globally. Molecular aberrations of tumor suppressors and/or oncogenes are the main contributors to tumorigenesis. However, the exact underlying mechanisms of COAD pathogenesis are clearly not known yet. In this regard, there is an...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-020-00757-2

    authors: Poursheikhani A,Abbaszadegan MR,Nokhandani N,Kerachian MA

    更新日期:2020-07-29 00:00:00

  • Prediction of HIV-1 virus-host protein interactions using virus and host sequence motifs.

    abstract:BACKGROUND:Host protein-protein interaction networks are altered by invading virus proteins, which create new interactions, and modify or destroy others. The resulting network topology favors excessive amounts of virus production in a stressed host cell network. Short linear peptide motifs common to both virus and host...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-2-27

    authors: Evans P,Dampier W,Ungar L,Tozeren A

    更新日期:2009-05-18 00:00:00

  • Association of adipocyte genes with ASP expression: a microarray analysis of subcutaneous and omental adipose tissue in morbidly obese subjects.

    abstract:BACKGROUND:Prevalence of obesity is increasing to pandemic proportions. However, obese subjects differ in insulin resistance, adipokine production and co-morbidities. Based on fasting plasma analysis, obese subjects were grouped as Low Acylation Stimulating protein (ASP) and Triglyceride (TG) (LAT) vs High ASP and TG (...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-3-3

    authors: MacLaren RE,Cui W,Lu H,Simard S,Cianflone K

    更新日期:2010-01-27 00:00:00

  • Testicular sex cord-stromal tumor in a boy with 2q37 deletion syndrome.

    abstract:BACKGROUND:2q37 deletion syndrome is a rare congenital disorder that is characterized by facial dysmorphism, obesity, vascular and skeletal malformations, and a variable degree of intellectual disability. To date, common but variable phenotypes, such as skeletal or digit malformations and obesity, have been associated ...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-7-19

    authors: Sakai Y,Souzaki R,Yamamoto H,Matsushita Y,Nagata H,Ishizaki Y,Torisu H,Oda Y,Taguchi T,Shaw CA,Hara T

    更新日期:2014-04-22 00:00:00

  • Mosaic chromosome 18 anomaly delineated in a child with dysmorphism using a three-pronged cytogenetic techniques approach: a case report.

    abstract:BACKGROUND:A plethora of cases are reported in the literature with iso- and ring-chromosome 18. However, co-occurrence of these two abnormalities in an individual along with a third cell line and absence of numerical anomaly is extremely rare. CASE PRESENTATION:A 7-year-old female was referred for diagnosis due to gro...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-020-00796-9

    authors: Sheth H,Trivedi S,Liehr T,Patel K,Jain D,Sheth J,Sheth F

    更新日期:2020-09-24 00:00:00

  • Saliva sampling in global clinical studies: the impact of low sampling volume on performance of DNA in downstream genotyping experiments.

    abstract:BACKGROUND:The collection of viable DNA samples is an essential element of any genetics research programme. Biological samples for DNA purification are now routinely collected in many studies with a variety of sampling methods available. Initial observation in this study suggested a reduced genotyping success rate of s...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-6-20

    authors: Pulford DJ,Mosteller M,Briley JD,Johansson KW,Nelsen AJ

    更新日期:2013-06-10 00:00:00

  • Gene expression signatures in childhood acute leukemias are largely unique and distinct from those of normal tissues and other malignancies.

    abstract:BACKGROUND:Childhood leukemia is characterized by the presence of balanced chromosomal translocations or by other structural or numerical chromosomal changes. It is well know that leukemias with specific molecular abnormalities display profoundly different global gene expression profiles. However, it is largely unknown...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-3-6

    authors: Andersson A,Edén P,Olofsson T,Fioretos T

    更新日期:2010-03-08 00:00:00

  • Transcriptome sequencing of lncRNA, miRNA, mRNA and interaction network constructing in coronary heart disease.

    abstract:BACKGROUND:Non-coding RNA has been shown to participate in numerous biological and pathological processes and has attracted increasing attention in recent years. Recent studies have demonstrated that long non-coding RNA and micro RNA can interact through various mechanisms to regulate mRNA. Yet the gene-gene interactio...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-019-0570-z

    authors: Liao J,Wang J,Liu Y,Li J,Duan L

    更新日期:2019-08-23 00:00:00

  • Overlap of expression quantitative trait loci (eQTL) in human brain and blood.

    abstract:BACKGROUND:Expression quantitative trait loci (eQTL) are genomic regions regulating RNA transcript expression levels. Genome-wide Association Studies (GWAS) have identified many variants, often in non-coding regions, with unknown functions and eQTL provide a possible mechanism by which these variants may influence obse...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-7-31

    authors: McKenzie M,Henders AK,Caracella A,Wray NR,Powell JE

    更新日期:2014-06-03 00:00:00