A survey on single and multi omics data mining methods in cancer data classification.

Abstract:

:Data analytics is routinely used to support biomedical research in all areas, with particular focus on the most relevant clinical conditions, such as cancer. Bioinformatics approaches, in particular, have been used to characterize the molecular aspects of diseases. In recent years, numerous studies have been performed on cancer based upon single and multi-omics data. For example, Single-omics-based studies have employed a diverse set of data, such as gene expression, DNA methylation, or miRNA, to name only a few instances. Despite that, a significant part of literature reports studies on gene expression with microarray datasets. Single-omics data have high numbers of attributes and very low sample counts. This characteristic makes them paradigmatic of an under-sampled, small-n large-p machine learning problem. An important goal of single-omics data analysis is to find the most relevant genes, in terms of their potential use in clinics and research, in the batch of available data. This problem has been addressed in gene selection as one of the pre-processing steps in data mining. An analysis that use only one type of data (single-omics) often miss the complexity of the landscape of molecular phenomena underlying the disease. As a result, they provide limited and sometimes poorly reliable information about the disease mechanisms. Therefore, in recent years, researchers have been eager to build models that are more complex, obtaining more reliable results using multi-omics data. However, to achieve this, the most important challenge is data integration. In this paper, we provide a comprehensive overview of the challenges in single and multi-omics data analysis of cancer data, focusing on gene selection and data integration methods.

journal_name

J Biomed Inform

authors

Momeni Z,Hassanzadeh E,Saniee Abadeh M,Bellazzi R

doi

10.1016/j.jbi.2020.103466

subject

Has Abstract

pub_date

2020-07-01 00:00:00

pages

103466

eissn

1532-0464

issn

1532-0480

pii

S1532-0464(20)30093-9

journal_volume

107

pub_type

杂志文章,评审
  • A comprehensive review of feature based methods for drug target interaction prediction.

    abstract::Drug target interaction is a prominent research area in the field of drug discovery. It refers to the recognition of interactions between chemical compounds and the protein targets in the human body. Wet lab experiments to identify these interactions are expensive as well as time consuming. The computational methods o...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章,评审

    doi:10.1016/j.jbi.2019.103159

    authors: Sachdev K,Gupta MK

    更新日期:2019-05-01 00:00:00

  • Health information technology adoption: Understanding research protocols and outcome measurements for IT interventions in health care.

    abstract:OBJECTIVE:To classify and characterize the variables commonly used to measure the impact of Information Technology (IT) adoption in health care, as well as settings and IT interventions tested, and to guide future research. MATERIALS AND METHODS:We conducted a descriptive study screening a sample of 236 studies from a...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2016.07.018

    authors: Colicchio TK,Facelli JC,Del Fiol G,Scammon DL,Bowes WA 3rd,Narus SP

    更新日期:2016-10-01 00:00:00

  • A Bayesian system to detect and characterize overlapping outbreaks.

    abstract::Outbreaks of infectious diseases such as influenza are a significant threat to human health. Because there are different strains of influenza which can cause independent outbreaks, and influenza can affect demographic groups at different rates and times, there is a need to recognize and characterize multiple outbreaks...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2017.08.003

    authors: Aronis JM,Millett NE,Wagner MM,Tsui F,Ye Y,Ferraro JP,Haug PJ,Gesteland PH,Cooper GF

    更新日期:2017-09-01 00:00:00

  • Temporal phenotyping of medically complex children via PARAFAC2 tensor factorization.

    abstract:OBJECTIVE:Our aim is to extract clinically-meaningful phenotypes from longitudinal electronic health records (EHRs) of medically-complex children. This is a fragile set of patients consuming a disproportionate amount of pediatric care resources but who often end up with sub-optimal clinical outcome. The rise in availab...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103125

    authors: Perros I,Papalexakis EE,Vuduc R,Searles E,Sun J

    更新日期:2019-05-01 00:00:00

  • Evaluating warfarin dosing models on multiple datasets with a novel software framework and evolutionary optimisation.

    abstract::Warfarin is an effective preventative treatment for arterial and venous thromboembolism, but requires individualised dosing due to its narrow therapeutic range and high individual variation. Many machine learning techniques have been demonstrated in this domain. This study evaluated the accuracy of the most promising ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2020.103634

    authors: Truda G,Marais P

    更新日期:2021-01-01 00:00:00

  • Extending the Fellegi-Sunter probabilistic record linkage method for approximate field comparators.

    abstract::Probabilistic record linkage is a method commonly used to determine whether demographic records refer to the same person. The Fellegi-Sunter method is a probabilistic approach that uses field weights based on log likelihood ratios to determine record similarity. This paper introduces an extension of the Fellegi-Sunter...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2009.08.004

    authors: DuVall SL,Kerber RA,Thomas A

    更新日期:2010-02-01 00:00:00

  • A Hidden Semi-Markov Model based approach for rehabilitation exercise assessment.

    abstract::In this paper, a Hidden Semi-Markov Model (HSMM) based approach is proposed to evaluate and monitor body motion during a rehabilitation training program. The approach extracts clinically relevant motion features from skeleton joint trajectories, acquired by the RGB-D camera, and provides a score for the subject's perf...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2017.12.012

    authors: Capecci M,Ceravolo MG,Ferracuti F,Iarlori S,Kyrki V,Monteriù A,Romeo L,Verdini F

    更新日期:2018-02-01 00:00:00

  • Creating hospital-specific customized clinical pathways by applying semantic reasoning to clinical data.

    abstract:OBJECTIVE:Clinical pathways (CPs) are widely studied methods to standardize clinical intervention and improve medical quality. However, standard care plans defined in current CPs are too general to execute in a practical healthcare environment. The purpose of this study was to create hospital-specific personalized CPs ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2014.07.017

    authors: Wang HQ,Zhou TS,Tian LL,Qian YM,Li JS

    更新日期:2014-12-01 00:00:00

  • Interacting agents through a web-based health serviceflow management system.

    abstract::The management of chronic and out-patients is a complex process which requires the cooperation of different agents belonging to several organizational units. Patients have to move to different locations to access the necessary services and to communicate their health status data. From their point of view there should ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2006.12.002

    authors: Leonardi G,Panzarasa S,Quaglini S,Stefanelli M,van der Aalst WM

    更新日期:2007-10-01 00:00:00

  • Classification models for the prediction of clinicians' information needs.

    abstract:OBJECTIVE:Clinicians face numerous information needs during patient care activities and most of these needs are not met. Infobuttons are information retrieval tools that help clinicians to fulfill their information needs by providing links to on-line health information resources from within an electronic medical record...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2008.07.001

    authors: Del Fiol G,Haug PJ

    更新日期:2009-02-01 00:00:00

  • Developing, implementing, and evaluating decision support systems for shared decision making in patient care: a conceptual model and case illustration.

    abstract::The importance of including patient preferences in decisions regarding their care has received increased emphasis over recent years. Medical informatics can play an important role in improving patient-centered care by developing decision support systems to support the inclusion of patient preferences in clinical decis...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/s1532-0464(03)00037-6

    authors: Ruland CM,Bakken S

    更新日期:2002-10-01 00:00:00

  • Finding disease similarity based on implicit semantic similarity.

    abstract::Genomics has contributed to a growing collection of gene-function and gene-disease annotations that can be exploited by informatics to study similarity between diseases. This can yield insight into disease etiology, reveal common pathophysiology and/or suggest treatment that can be appropriated from one disease to ano...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2011.11.017

    authors: Mathur S,Dinakarpandian D

    更新日期:2012-04-01 00:00:00

  • Patient-specific learning in real time for adaptive monitoring in critical care.

    abstract::Intensive care monitoring systems are typically developed from population data, but do not take into account the variability among individual patients' characteristics. This study develops patient-specific alarm algorithms in real time. Classification tree and neural network learning were carried out in batch mode on ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2008.03.011

    authors: Zhang Y,Szolovits P

    更新日期:2008-06-01 00:00:00

  • A new framework for the selection of tag SNPs by multimarker haplotypes.

    abstract::This paper proposes a new framework for the selection of tag SNPs based on haplotypes instead of on a single SNP. The tag SNPs found by this framework form a set of haplotypes completely predictive of the alleles of all untyped SNPs. We refer to this problem as MTMH, which is defined as follows: given a set of SNPs, f...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2008.04.003

    authors: Huang YT,Chao KM

    更新日期:2008-12-01 00:00:00

  • MorphoCol: An ontology-based knowledgebase for the characterisation of clinically significant bacterial colony morphologies.

    abstract:BACKGROUND:One of the major concerns of the biomedical community is the increasing prevalence of antimicrobial resistant microorganisms. Recent findings show that the diversification of colony morphology may be indicative of the expression of virulence factors and increased resistance to antibiotic therapeutics. To tra...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.03.007

    authors: Sousa AM,Pereira MO,Lourenço A

    更新日期:2015-06-01 00:00:00

  • Visual aggregate analysis of eligibility features of clinical trials.

    abstract:OBJECTIVE:To develop a method for profiling the collective populations targeted for recruitment by multiple clinical studies addressing the same medical condition using one eligibility feature each time. METHODS:Using a previously published database COMPACT as the backend, we designed a scalable method for visual aggr...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.01.005

    authors: He Z,Carini S,Sim I,Weng C

    更新日期:2015-04-01 00:00:00

  • Algorithms for rapid outbreak detection: a research synthesis.

    abstract::The threat of bioterrorism has stimulated interest in enhancing public health surveillance to detect disease outbreaks more rapidly than is currently possible. To advance research on improving the timeliness of outbreak detection, the Defense Advanced Research Project Agency sponsored the Bio-event Advanced Leading In...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2004.11.007

    authors: Buckeridge DL,Burkom H,Campbell M,Hogan WR,Moore AW

    更新日期:2005-04-01 00:00:00

  • Hierarchical data security in a Query-By-Example interface for a shared database.

    abstract::Whenever a shared database resource, containing critical patient data, is created, protecting the contents of the database is a high priority goal. This goal can be achieved by developing a Query-By-Example (QBE) interface, designed to access a shared database, and embedding within the QBE a hierarchical security modu...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/s1532-0464(02)00524-5

    authors: Taylor M

    更新日期:2002-06-01 00:00:00

  • Selecting significant genes by randomization test for cancer classification using gene expression data.

    abstract::Gene selection is an important task in bioinformatics studies, because the accuracy of cancer classification generally depends upon the genes that have biological relevance to the classifying problems. In this work, randomization test (RT) is used as a gene selection method for dealing with gene expression data. In th...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2013.03.009

    authors: Mao Z,Cai W,Shao X

    更新日期:2013-08-01 00:00:00

  • A novel method to estimate the indirect community benefit of HIV interventions using a microsimulation model of HIV disease.

    abstract:BACKGROUND:Microsimulation models of human immunodeficiency virus (HIV) disease that simulate individual patients one at a time and assess clinical and economic outcomes of HIV interventions often provide key details regarding direct individual clinical benefits ("individual benefit"), but they may lack detail on trans...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2020.103475

    authors: Kazemian P,Costantini S,Neilan AM,Resch SC,Walensky RP,Weinstein MC,Freedberg KA

    更新日期:2020-07-01 00:00:00

  • Comparing image search behaviour in the ARRS GoldMiner search engine and a clinical PACS/RIS.

    abstract::Information search has changed the way we manage knowledge and the ubiquity of information access has made search a frequent activity, whether via Internet search engines or increasingly via mobile devices. Medical information search is in this respect no different and much research has been devoted to analyzing the w...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.04.013

    authors: De-Arteaga M,Eggel I,Do B,Rubin D,Kahn CE Jr,Müller H

    更新日期:2015-08-01 00:00:00

  • A medical treatment based scoring model to detect abusive institutions.

    abstract::Medical abuse refers to a type of abnormal medical practice which is not in compliance with qualitative or ethical standards, such as excessive prescription or overbilling of medical services. Detection of such medical abuses is crucial, especially for the patients and insurance providers, because they become subject ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2020.103423

    authors: Lee J,Shin H,Cho S

    更新日期:2020-07-01 00:00:00

  • Genome-wide analysis of multi-view data of miRNA-seq to identify miRNA biomarkers for stomach cancer.

    abstract::Stomach cancer is one of the leading causes of cancer-related deaths worldwide. More than 80% diagnosis of this cancer occur at later stages leading to low 5-year survival rate. This emphasizes the need to have better prognostic techniques for stomach cancer. In this regard, the Next-Generation Sequencing of whole gen...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103254

    authors: Pant N,Rakshit S,Paul S,Saha I

    更新日期:2019-09-01 00:00:00

  • Research-IQ: development and evaluation of an ontology-anchored integrative query tool.

    abstract::Investigators in the translational research and systems medicine domains require highly usable, efficient and integrative tools and methods that allow for the navigation of and reasoning over emerging large-scale data sets. Such resources must cover a spectrum of granularity from bio-molecules to population phenotypes...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2011.07.006

    authors: Borlawsky TB,Lele O,Payne PR

    更新日期:2011-12-01 00:00:00

  • DiseaSE: A biomedical text analytics system for disease symptom extraction and characterization.

    abstract::Due to increasing volume and unstructured nature of the scientific literatures in biomedical domain, most of the information embedded within them remain untapped. This paper presents a biomedical text analytics system, DiseaSE (Disease Symptom Extraction), to identify and extract disease symptoms and their association...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2019.103324

    authors: Abulaish M,Parwez MA,Jahiruddin

    更新日期:2019-12-01 00:00:00

  • An image score inference system for RNAi genome-wide screening based on fuzzy mixture regression modeling.

    abstract::With recent advances in fluorescence microscopy imaging techniques and methods of gene knock down by RNA interference (RNAi), genome-scale high-content screening (HCS) has emerged as a powerful approach to systematically identify all parts of complex biological processes. However, a critical barrier preventing fulfill...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2008.04.007

    authors: Wang J,Zhou X,Li F,Bradley PL,Chang SF,Perrimon N,Wong ST

    更新日期:2009-02-01 00:00:00

  • An evaluation of clinical order patterns machine-learned from clinician cohorts stratified by patient mortality outcomes.

    abstract:OBJECTIVE:Evaluate the quality of clinical order practice patterns machine-learned from clinician cohorts stratified by patient mortality outcomes. MATERIALS AND METHODS:Inpatient electronic health records from 2010 to 2013 were extracted from a tertiary academic hospital. Clinicians (n = 1822) were stratified into lo...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2018.09.005

    authors: Wang JK,Hom J,Balasubramanian S,Schuler A,Shah NH,Goldstein MK,Baiocchi MTM,Chen JH

    更新日期:2018-10-01 00:00:00

  • BioLattice: a framework for the biological interpretation of microarray gene expression data using concept lattice analysis.

    abstract:MOTIVATION:A challenge in microarray data analysis is to interpret observed changes in terms of biological properties and relationships. One powerful approach is to make associations of gene expression clusters with biomedical ontologies and/or biological pathways. However, this approach evaluates only one cluster at a...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2007.10.003

    authors: Kim J,Chung HJ,Jung Y,Kim KK,Kim JH

    更新日期:2008-04-01 00:00:00

  • Homology assessment and molecular sequence alignment.

    abstract::Hypotheses of homology are the basis of phylogenetic analysis. All character data are considered to be equivalent regardless of the source of those characters. Putative homology statements are designated based on observations of similarity. Pairwise sequence alignment using the Needleman-Wunsch algorithm is the basis ...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章,评审

    doi:10.1016/j.jbi.2005.11.005

    authors: Phillips AJ

    更新日期:2006-02-01 00:00:00

  • Unstructured medical image query using big data - An epilepsy case study.

    abstract::Big data technologies are critical to the medical field which requires new frameworks to leverage them. Such frameworks would benefit medical experts to test hypotheses by querying huge volumes of unstructured medical data to provide better patient care. The objective of this work is to implement and examine the feasi...

    journal_title:Journal of biomedical informatics

    pub_type: 杂志文章

    doi:10.1016/j.jbi.2015.12.005

    authors: Istephan S,Siadat MR

    更新日期:2016-02-01 00:00:00