A fast and high performance multiple data integration algorithm for identifying human disease genes.

Abstract:

BACKGROUND:Integrating multiple data sources is indispensable in improving disease gene identification. It is not only due to the fact that disease genes associated with similar genetic diseases tend to lie close with each other in various biological networks, but also due to the fact that gene-disease associations are complex. Although various algorithms have been proposed to identify disease genes, their prediction performances and the computational time still should be further improved. RESULTS:In this study, we propose a fast and high performance multiple data integration algorithm for identifying human disease genes. A posterior probability of each candidate gene associated with individual diseases is calculated by using a Bayesian analysis method and a binary logistic regression model. Two prior probability estimation strategies and two feature vector construction methods are developed to test the performance of the proposed algorithm. CONCLUSIONS:The proposed algorithm is not only generated predictions with high AUC scores, but also runs very fast. When only a single PPI network is employed, the AUC score is 0.769 by using F2 as feature vectors. The average running time for each leave-one-out experiment is only around 1.5 seconds. When three biological networks are integrated, the AUC score using F3 as feature vectors increases to 0.830, and the average running time for each leave-one-out experiment takes only about 12.54 seconds. It is better than many existing algorithms.

journal_name

BMC Med Genomics

journal_title

BMC medical genomics

authors

Chen B,Li M,Wang J,Shang X,Wu FX

doi

10.1186/1755-8794-8-S3-S2

subject

Has Abstract

pub_date

2015-01-01 00:00:00

pages

S2

issn

1755-8794

pii

1755-8794-8-S3-S2

journal_volume

8 Suppl 3

pub_type

杂志文章
  • Study design and data analysis considerations for the discovery of prognostic molecular biomarkers: a case study of progression free survival in advanced serous ovarian cancer.

    abstract:BACKGROUND:Accurate discovery of molecular biomarkers that are prognostic of a clinical outcome is an important yet challenging task, partly due to the combination of the typically weak genomic signal for a clinical outcome and the frequently strong noise due to microarray handling effects. Effective strategies to reso...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-016-0187-4

    authors: Qin LX,Levine DA

    更新日期:2016-06-10 00:00:00

  • Detecting early-warning signals of type 1 diabetes and its leading biomolecular networks by dynamical network biomarkers.

    abstract:BACKGROUND:Type 1 diabetes (T1D) is a complex disease and harmful to human health, and most of the existing biomarkers are mainly to measure the disease phenotype after the disease onset (or drastic deterioration). Until now, there is no effective biomarker which can predict the upcoming disease (or pre-disease state) ...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-6-S2-S8

    authors: Liu X,Liu R,Zhao XM,Chen L

    更新日期:2013-01-01 00:00:00

  • African ancestry is associated with cluster-based childhood asthma subphenotypes.

    abstract:BACKGROUND:Childhood asthma is a syndrome composed of heterogeneous phenotypes; furthermore, intrinsic biologic variation among racial/ethnic populations suggests possible genetic ancestry variation in childhood asthma. The objective of the study is to identify clinically homogeneous asthma subphenotypes in a diverse s...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0367-5

    authors: Ding L,Li D,Wathen M,Altaye M,Mersha TB

    更新日期:2018-05-31 00:00:00

  • Searching for molecular markers in head and neck squamous cell carcinomas (HNSCC) by statistical and bioinformatic analysis of larynx-derived SAGE libraries.

    abstract:BACKGROUND:Head and neck squamous cell carcinoma (HNSCC) is one of the most common malignancies in humans. The average 5-year survival rate is one of the lowest among aggressive cancers, showing no significant improvement in recent years. When detected early, HNSCC has a good prognosis, but most patients present metast...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-1-56

    authors: Silveira NJ,Varuzza L,Machado-Lima A,Lauretto MS,Pinheiro DG,Rodrigues RV,Severino P,Nobrega FG,Head and Neck Genome Project GENCAPO.,Silva WA Jr,de B Pereira CA,Tajara EH

    更新日期:2008-11-11 00:00:00

  • Transcriptomic analysis of fetal membranes reveals pathways involved in preterm birth.

    abstract:BACKGROUND:Preterm birth (PTB), defined as infant delivery before 37 weeks of completed gestation, results from the interaction of both genetic and environmental components and constitutes a complex multifactorial syndrome. Transcriptome analysis of PTB has proven challenging because of the multiple causes of PTB and t...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-019-0498-3

    authors: Pereyra S,Sosa C,Bertoni B,Sapiro R

    更新日期:2019-04-01 00:00:00

  • The functional cancer map: a systems-level synopsis of genetic deregulation in cancer.

    abstract:BACKGROUND:Cancer cells are characterized by massive dysegulation of physiological cell functions with considerable disruption of transcriptional regulation. Genome-wide transcriptome profiling can be utilized for early detection and molecular classification of cancers. Accurate discrimination of functionally different...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-4-53

    authors: Krupp M,Maass T,Marquardt JU,Staib F,Bauer T,König R,Biesterfeld S,Galle PR,Tresch A,Teufel A

    更新日期:2011-06-30 00:00:00

  • wtest: an integrated R package for genetic epistasis testing.

    abstract:BACKGROUND:With the increasing amount of high-throughput genomic sequencing data, there is a growing demand for a robust and flexible tool to perform interaction analysis. The identification of SNP-SNP, SNP-CpG, and higher order interactions helps explain the genetic etiology of human diseases, yet genome-wide analysis...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-019-0638-9

    authors: Sun R,Xia X,Chong KC,Zee BC,Wu WKK,Wang MH

    更新日期:2019-12-24 00:00:00

  • Overlap of expression quantitative trait loci (eQTL) in human brain and blood.

    abstract:BACKGROUND:Expression quantitative trait loci (eQTL) are genomic regions regulating RNA transcript expression levels. Genome-wide Association Studies (GWAS) have identified many variants, often in non-coding regions, with unknown functions and eQTL provide a possible mechanism by which these variants may influence obse...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-7-31

    authors: McKenzie M,Henders AK,Caracella A,Wray NR,Powell JE

    更新日期:2014-06-03 00:00:00

  • Transcriptome analysis reveals the link between lncRNA-mRNA co-expression network and tumor immune microenvironment and overall survival in head and neck squamous cell carcinoma.

    abstract:BACKGROUND:As the sixth most common cancer worldwide, head and neck squamous cell carcinoma (HNSCC) develops visceral metastases during the advanced stage of the disease and exhibits a low five-year survival rate. The importance of tumor microenvironment (TME) in tumor initiation and metastasis is widely recognized. In...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-020-0707-0

    authors: Zhong Z,Hong M,Chen X,Xi Y,Xu Y,Kong D,Deng J,Li Y,Hu R,Sun C,Liang J

    更新日期:2020-03-30 00:00:00

  • Developing a healthcare dataset information resource (DIR) based on Semantic Web.

    abstract:BACKGROUND:The right dataset is essential to obtain the right insights in data science; therefore, it is important for data scientists to have a good understanding of the availability of relevant datasets as well as the content, structure, and existing analyses of these datasets. While a number of efforts are underway ...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0411-5

    authors: Shi J,Zheng M,Yao L,Ge Y

    更新日期:2018-11-20 00:00:00

  • Primary microcephaly case from the Karachay-Cherkess Republic poses an additional support for microcephaly and Seckel syndrome spectrum disorders.

    abstract:BACKGROUND:Primary microcephaly represents an example of clinically and genetically heterogeneous condition. Here we describe a case of primary microcephaly from the Karachay-Cherkess Republic, which was initially diagnosed with Seckel syndrome. CASE PRESENTATION:Clinical exome sequencing of the proband revealed a nov...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0326-1

    authors: Marakhonov AV,Konovalov FA,Makaov AK,Vasilyeva TA,Kadyshev VV,Galkina VA,Dadali EL,Kutsev SI,Zinchenko RA

    更新日期:2018-02-13 00:00:00

  • Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data.

    abstract:BACKGROUND:Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. Th...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-6-S1-S10

    authors: Teng S,Yang JY,Wang L

    更新日期:2013-01-01 00:00:00

  • Advancing research in NeuroAIDS using collaboration and public data sharing.

    abstract::In this issue of BMC Medical Genomics Griffin et al. present a user-friendly and freely accessible HIV-associated neurocognitive disorder (HAND) genomic database that compiles viral (HIV-1) genetic sequences and other relevant clinical and treatment data. We discuss the benefits and caveats of public data sharing in N...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-015-0150-9

    authors: Cysique LA

    更新日期:2015-11-11 00:00:00

  • Chronic insulin treatment of diabetes does not fully normalize alterations in the retinal transcriptome.

    abstract:BACKGROUND:Diabetic retinopathy (DR) is a leading cause of blindness in working age adults. Approximately 95% of patients with Type 1 diabetes develop some degree of retinopathy within 25 years of diagnosis despite normalization of blood glucose by insulin therapy. The goal of this study was to identify molecular chang...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-4-40

    authors: Bixler GV,Vanguilder HD,Brucklacher RM,Kimball SR,Bronson SK,Freeman WM

    更新日期:2011-05-15 00:00:00

  • DNA methylation differences at growth related genes correlate with birth weight: a molecular signature linked to developmental origins of adult disease?

    abstract:BACKGROUND:Infant birth weight is a complex quantitative trait associated with both neonatal and long-term health outcomes. Numerous studies have been published in which candidate genes (IGF1, IGF2, IGF2R, IGF binding proteins, PHLDA2 and PLAGL1) have been associated with birth weight, but these studies are difficult t...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-5-10

    authors: Turan N,Ghalwash MF,Katari S,Coutifaris C,Obradovic Z,Sapienza C

    更新日期:2012-04-12 00:00:00

  • Reverse-engineering of gene networks for regulating early blood development from single-cell measurements.

    abstract:BACKGROUND:Recent advances in omics technologies have raised great opportunities to study large-scale regulatory networks inside the cell. In addition, single-cell experiments have measured the gene and protein activities in a large number of cells under the same experimental conditions. However, a significant challeng...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-017-0312-z

    authors: Wei J,Hu X,Zou X,Tian T

    更新日期:2017-12-28 00:00:00

  • Clinical utility of the low-density Infinium QC genotyping Array in a genomics-based diagnostics laboratory.

    abstract:BACKGROUND:With 15,949 markers, the low-density Infinium QC Array-24 BeadChip enables linkage analysis, HLA haplotyping, fingerprinting, ethnicity determination, mitochondrial genome variations, blood groups and pharmacogenomics. It represents an attractive independent QC option for NGS-based diagnostic laboratories, a...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-017-0297-7

    authors: Ponomarenko P,Ryutov A,Maglinte DT,Baranova A,Tatarinova TV,Gai X

    更新日期:2017-10-06 00:00:00

  • DNA methylation differences in monozygotic twin pairs discordant for schizophrenia identifies psychosis related genes and networks.

    abstract:BACKGROUND:Despite their singular origin, monozygotic twin pairs often display discordance for complex disorders including schizophrenia. It is a common (1%) and often familial disease with a discordance rate of ~50% in monozygotic twins. This high discordance is often explained by the role of yet unknown environmental...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-015-0093-1

    authors: Castellani CA,Laufer BI,Melka MG,Diehl EJ,O'Reilly RL,Singh SM

    更新日期:2015-05-06 00:00:00

  • 12q14 microduplication: a new clinical entity reciprocal to the microdeletion syndrome?

    abstract:BACKGROUND:12q14 microdeletion syndrome is characterized by low birth weight and failure to thrive, proportionate short stature and developmental delay. The opposite syndrome (microduplication) has not yet been characterized. Our main objective is the recognition of a new clinical entity - 12q14 microduplication syndro...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-019-0653-x

    authors: Dória S,Alves D,Pinho MJ,Pinto J,Leão M

    更新日期:2020-01-03 00:00:00

  • Prediction of HIV-1 virus-host protein interactions using virus and host sequence motifs.

    abstract:BACKGROUND:Host protein-protein interaction networks are altered by invading virus proteins, which create new interactions, and modify or destroy others. The resulting network topology favors excessive amounts of virus production in a stressed host cell network. Short linear peptide motifs common to both virus and host...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-2-27

    authors: Evans P,Dampier W,Ungar L,Tozeren A

    更新日期:2009-05-18 00:00:00

  • Identification of exon skipping events associated with Alzheimer's disease in the human hippocampus.

    abstract:BACKGROUND:At least 90% of human genes are alternatively spliced. Alternative splicing has an important function regulating gene expression and miss-splicing can contribute to risk for human diseases, including Alzheimer's disease (AD). METHODS:We developed a splicing decision model as a molecular mechanism to identif...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0453-8

    authors: Han S,Miller JE,Byun S,Kim D,Risacher SL,Saykin AJ,Lee Y,Nho K,for Alzheimer’s Disease Neuroimaging Initiative.

    更新日期:2019-01-31 00:00:00

  • The International Conference on Intelligent Biology and Medicine (ICIBM) 2018: genomics meets medicine.

    abstract::During June 10-12, 2018, the International Conference on Intelligent Biology and Medicine (ICIBM 2018) was held in Los Angeles, California, USA. The conference included 11 scientific sessions, four tutorials, one poster session, four keynote talks and four eminent scholar talks that covered a wide range of topics rang...

    journal_title:BMC medical genomics

    pub_type: 社论

    doi:10.1186/s12920-018-0448-5

    authors: Zhi D,Zhao Z,Li F,Wu Z,Liu X,Wang K

    更新日期:2019-01-31 00:00:00

  • Biological processes, properties and molecular wiring diagrams of candidate low-penetrance breast cancer susceptibility genes.

    abstract:BACKGROUND:Recent advances in whole-genome association studies (WGASs) for human cancer risk are beginning to provide the part lists of low-penetrance susceptibility genes. However, statistical analysis in these studies is complicated by the vast number of genetic variants examined and the weak effects observed, as a r...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-1-62

    authors: Bonifaci N,Berenguer A,Díez J,Reina O,Medina I,Dopazo J,Moreno V,Pujana MA

    更新日期:2008-12-18 00:00:00

  • Transcriptomic signatures in whole blood of patients who acquire a chronic inflammatory response syndrome (CIRS) following an exposure to the marine toxin ciguatoxin.

    abstract:BACKGROUND:Ciguatoxins (CTXs) are polyether marine neurotoxins found in multiple reef-fish species and are potent activators of voltage-gated sodium channels. It is estimated that up to 500,000 people annually experience acute ciguatera poisoning from consuming toxic fish and a small percentage of these victims will de...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-015-0089-x

    authors: Ryan JC,Wu Q,Shoemaker RC

    更新日期:2015-04-02 00:00:00

  • Defining "mutation" and "polymorphism" in the era of personal genomics.

    abstract:BACKGROUND:The growing advances in DNA sequencing tools have made analyzing the human genome cheaper and faster. While such analyses are intended to identify complex variants, related to disease susceptibility and efficacy of drug responses, they have blurred the definitions of mutation and polymorphism. DISCUSSION:In...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-015-0115-z

    authors: Karki R,Pandya D,Elston RC,Ferlini C

    更新日期:2015-07-15 00:00:00

  • Identification of Chiari Type I Malformation subtypes using whole genome expression profiles and cranial base morphometrics.

    abstract:BACKGROUND:Chiari Type I Malformation (CMI) is characterized by herniation of the cerebellar tonsils through the foramen magnum at the base of the skull, resulting in significant neurologic morbidity. As CMI patients display a high degree of clinical variability and multiple mechanisms have been proposed for tonsillar ...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-7-39

    authors: Markunas CA,Lock E,Soldano K,Cope H,Ding CK,Enterline DS,Grant G,Fuchs H,Ashley-Koch AE,Gregory SG

    更新日期:2014-06-25 00:00:00

  • Loss of heterozygosity: what is it good for?

    abstract:BACKGROUND:Loss of heterozygosity (LOH) is a common genetic event in cancer development, and is known to be involved in the somatic loss of wild-type alleles in many inherited cancer syndromes. The wider involvement of LOH in cancer is assumed to relate to unmasking a somatically mutated tumour suppressor gene through ...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-015-0123-z

    authors: Ryland GL,Doyle MA,Goode D,Boyle SE,Choong DY,Rowley SM,Li J,Australian Ovarian Cancer Study Group.,Bowtell DD,Tothill RW,Campbell IG,Gorringe KL

    更新日期:2015-08-01 00:00:00

  • Comparative gene expression profiling analysis of urothelial carcinoma of the renal pelvis and bladder.

    abstract:BACKGROUND:Urothelial carcinoma (UC) can arise at any location along the urothelial tract, including the urethra, bladder, ureter, or renal pelvis. Although tumors arising in these various locations have similar morphology, it is unclear whether the gene expression profiles are similar between the upper-tract (ureter a...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-3-58

    authors: Zhang Z,Furge KA,Yang XJ,Teh BT,Hansel DE

    更新日期:2010-12-15 00:00:00

  • Phenotype-driven gene prioritization for rare diseases using graph convolution on heterogeneous networks.

    abstract:BACKGROUND:One of the major goals of genomic medicine is the identification of causal genomic variants in a patient and their relation to the observed clinical phenotypes. Prioritizing the genomic variants by considering only the genotype information usually identifies a few hundred potential variants. Narrowing it dow...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0372-8

    authors: Rao A,Vg S,Joseph T,Kotte S,Sivadasan N,Srinivasan R

    更新日期:2018-07-06 00:00:00

  • Screening significantly hypermethylated genes in fetal tissues compared with maternal blood using a methylated-CpG island recovery assay-based microarray.

    abstract:BACKGROUND:The noninvasive prenatal diagnosis procedures that are currently used to detect genetic diseases do not achieve desirable levels of sensitivity and specificity. Recently, fetal methylated DNA biomarkers in maternal peripheral blood have been explored for the noninvasive prenatal detection of genetic disorder...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-5-26

    authors: Yin A,Zhang X,Wu J,Du L,He T,Zhang X

    更新日期:2012-06-18 00:00:00