A class imbalance-aware Relief algorithm for the classification of tumors using microarray gene expression data.

Abstract:

:DNA microarray data has been widely used in cancer research due to the significant advantage helped to successfully distinguish between tumor classes. However, typical gene expression data usually presents a high-dimensional imbalanced characteristic, which poses severe challenge for traditional machine learning methods to construct a robust classifier performing well on both the minority and majority classes. As one of the most successful feature weighting techniques, Relief is considered to particularly suit to handle high-dimensional problems. Unfortunately, almost all relief-based methods have not taken the class imbalance distribution into account. This study identifies that existing Relief-based algorithms may underestimate the features with the discernibility ability of minority classes, and ignore the distribution characteristic of minority class samples. As a result, an additional bias towards being classified into the majority classes can be introduced. To this end, a new method, named imRelief, is proposed for efficiently handling high-dimensional imbalanced gene expression data. imRelief can correct the bias towards to the majority classes, and consider the scattered distributional characteristic of minority class samples in the process of estimating feature weights. This way, imRelief has the ability to reward the features which perform well at separating the minority classes from other classes. Experiments on four microarray gene expression data sets demonstrate the effectiveness of imRelief in both feature weighting and feature subset selection applications.

journal_name

Comput Biol Chem

authors

He Y,Zhou J,Lin Y,Zhu T

doi

10.1016/j.compbiolchem.2019.03.017

subject

Has Abstract

pub_date

2019-06-01 00:00:00

pages

121-127

eissn

1476-9271

issn

1476-928X

pii

S1476-9271(19)30193-8

journal_volume

80

pub_type

杂志文章
  • Tabu search algorithm for DNA sequencing by hybridization with isothermic libraries.

    abstract::In this paper, a problem of isothermic DNA sequencing by hybridization (SBH) is considered. In isothermic SBH a new type of oligonucleotide libraries is used. The library consists of oligonucleotides of different lengths depending on an oligonucleotide content. It is assumed that every oligonucleotide in such a librar...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2003.12.002

    authors: Błazewicz J,Formanowicz P,Kasprzak M,Markiewicz WT,Swiercz A

    更新日期:2004-02-01 00:00:00

  • In silico analyses of a new group of fungal and plant RecQ4-homologous proteins.

    abstract::Bacterial and eukaryotic RecQ helicases comprise a family of homologous proteins necessary for maintaining genomic integrity during the cell cycle and DNA repair. There is one known bacterial RecQ helicase, and five eukaryotic RecQ helicases that have been described: RecQ1p, RecQ4p, RecQ5p, Bloom, and Werner. While th...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2008.07.005

    authors: Barea F,Tessaro S,Bonatto D

    更新日期:2008-10-01 00:00:00

  • Zooming-in on cancer metabolic rewiring with tissue specific constraint-based models.

    abstract::The metabolic rearrangements occurring in cancer cells can be effectively investigated with a Systems Biology approach supported by metabolic network modeling. We here present tissue-specific constraint-based core models for three different types of tumors (liver, breast and lung) that serve this purpose. The core mod...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2016.03.002

    authors: Di Filippo M,Colombo R,Damiani C,Pescini D,Gaglio D,Vanoni M,Alberghina L,Mauri G

    更新日期:2016-06-01 00:00:00

  • Interactions of 2-phenyl-benzotriazole xenobiotic compounds with human Cytochrome P450-CYP1A1 by means of docking, molecular dynamics simulations and MM-GBSA calculations.

    abstract::2-phenyl-benzotriazole xenobiotic compounds (PBTA-4, PBTA-6, PBTA-7 and PBTA-8) that were previously isolated and identified in waters of the Yodo river, in Japan (Nukaya et al., 2001; Ohe et al., 2004; Watanabe et al., 2001) were characterized as powerful pro-mutagens. In order to predict the activation mechanism of ...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2018.04.004

    authors: Mena-Ulecia K,MacLeod-Carey D

    更新日期:2018-06-01 00:00:00

  • Circular code motifs in the ribosome decoding center.

    abstract::A translation (framing) code based on the circular code was proposed in Michel (2012) with the identification of X circular code motifs (X motifs shortly) in the bacterial rRNA of Thermus thermophilus, in particular in the ribosome decoding center. Three classes of X motifs are now identified in the rRNAs of bacteria ...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2014.08.001

    authors: El Soufi K,Michel CJ

    更新日期:2014-10-01 00:00:00

  • Genome-wide predicting disease-related protein complexes by walking on the heterogeneous network based on data integration and laplacian normalization.

    abstract:BACKGROUND:Associating protein complexes to human inherited diseases is critical for better understanding of biological processes and functional mechanisms of the disease. Many protein complexes have been identified and functionally annotated by computational and purification methods so far, however, the particular rol...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2017.04.007

    authors: Liu Z,Luo J

    更新日期:2017-08-01 00:00:00

  • Interaction of zervamicin IIB with lipid bilayers. Molecular dynamics study.

    abstract::In this work we have studied the interaction of zervamicin IIB (ZrvIIB) with the model membranes of eukaryotes and prokaryotes using all-atom molecular dynamics. In all our simulations zervamicin molecule interacted only with lipid headgroups but did not penetrate the hydrophobic core of the bilayers. During the inter...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2010.12.005

    authors: Levtsova OV,Antonov MY,Naumenkova TV,Sokolova OS

    更新日期:2011-02-01 00:00:00

  • Hybrid docking-QSAR studies of DPP-IV inhibition activities of a series of aminomethyl-piperidones.

    abstract::In this study, the dipeptidyl peptidase-IV (DPP-IV) inhibition activities of a series of novel aminomethyl-piperidones were investigated by molecular docking studies and modeled by quantitative structure-activity relationship (QSAR) methodology. Molecular docking studies were used to find the best conformations of the...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2016.08.003

    authors: Amini Z,Fatemi MH,Gharaghani S

    更新日期:2016-10-01 00:00:00

  • Theoretical studies and NMR assay of coumarins and neoflavanones derivatives as potential inhibitors of acetylcholinesterase.

    abstract::Currently Alzheimer's disease (AD) is a devastating neurological disorder that mainly affects the elderly. The treatment of AD has as main objective to increase the levels of ACh in the synaptic cleft by inhibiting the cholinesterase enzymes, which are responsible for the degradation of ACh. Twenty one synthesized cou...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2020.107293

    authors: de Souza LG,Moraes PF,Leão RAC,Costa PRR,Soares RO,Pascutti PG,Figueroa-Villar JD,Rennó MN

    更新日期:2020-05-29 00:00:00

  • Reprint of "Abstraction for data integration: Fusing mammalian molecular, cellular and phenotype big datasets for better knowledge extraction".

    abstract::With advances in genomics, transcriptomics, metabolomics and proteomics, and more expansive electronic clinical record monitoring, as well as advances in computation, we have entered the Big Data era in biomedical research. Data gathering is growing rapidly while only a small fraction of this data is converted to usef...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章,评审

    doi:10.1016/j.compbiolchem.2015.08.005

    authors: Rouillard AD,Wang Z,Ma'ayan A

    更新日期:2015-12-01 00:00:00

  • ScGSLC: An unsupervised graph similarity learning framework for single-cell RNA-seq data clustering.

    abstract::Accurate clustering of cells from single-cell RNA sequencing (scRNA-seq) data is an essential step for biological analysis such as putative cell type identification. However, scRNA-seq data has high dimension and high sparsity, which makes traditional clustering methods less effective to reflect the similarity between...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2020.107415

    authors: Li J,Jiang W,Han H,Liu J,Liu B,Wang Y

    更新日期:2020-11-18 00:00:00

  • Why does beta-secretase zymogen possess catalytic activity? Molecular modeling and molecular dynamics simulation studies.

    abstract::Beta-secretase is a potential target for inhibitory drugs against Alzheimer's disease as it cleaves amyloid precursor protein (APP) to form insoluble amyloid plaques and vascular deposits in the brain. Beta-secretase is matured from its precursor protein, called beta-secretase zymogen, which, different from most of ot...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2007.03.007

    authors: Zuo Z,Gang C,Zou H,Mok PC,Zhu W,Chen K,Jiang H

    更新日期:2007-06-01 00:00:00

  • Chemical reaction optimization for solving shortest common supersequence problem.

    abstract::Shortest common supersequence (SCS) is a classical NP-hard problem, where a string to be constructed that is the supersequence of a given string set. The SCS problem has an enormous application of data compression, query optimization in the database and different bioinformatics activities. Due to NP-hardness, the exac...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2016.05.004

    authors: Khaled Saifullah CM,Rafiqul Islam M

    更新日期:2016-10-01 00:00:00

  • In silico identification of novel IL-1β inhibitors to target protein-protein interfaces.

    abstract::Interleukin-1β is a drug target in rheumatoid arthritis and several auto-immune disorders. In this study, a set of 48 compounds with the determined IC50 values were used for QSAR analysis by MOE. The QSAR model was developed by using training set of 41 compounds, based on 12 unique descriptors. Model was validated by ...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2015.06.004

    authors: Halim SA,Jawad M,Ilyas M,Mir Z,Mirza AA,Husnain T

    更新日期:2015-10-01 00:00:00

  • In silico structural and functional analysis of Mesorhizobium ACC deaminase.

    abstract::Nodulation is one of the very important processes of legume plants as it is the initiating event of fixing nitrogen. Although ethylene has essential role in normal plant metabolism but it has also negative impact on plants particularly in nodule formation in legume plants. It is also produced due to a variety of bioti...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2017.02.005

    authors: Pramanik K,Soren T,Mitra S,Maiti TK

    更新日期:2017-06-01 00:00:00

  • Analysis of compensatory substitution and gene evolution on the MAGEA/CSAG-palindrome of the primate X chromosomes.

    abstract::The human X chromosome contains a large number of inverted repeat DNA palindromes. Although arbitrary substitutions destroyed the inverted repeat structure of MAGEA/CSAG-palindrome during the evolutionary process of the primates, most of the substitutions are compensatory. Using maximum parsimony, it is demonstrated t...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2012.11.002

    authors: Qi Y,Lu H,Ai D

    更新日期:2013-02-01 00:00:00

  • Dynamics of p53 and Wnt cross talk.

    abstract::We present the mechanism of interaction of Wnt network module, which is responsible for periodic somitogenesis, with p53 regulatory network, which is one of the main regulators of various cellular functions, and switching of various oscillating states by investigating p53-Wnt model. The variation in Nutlin concentrati...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2015.07.014

    authors: Zubbair Malik M,Ali S,Alam MJ,Ishrat R,Brojen Singh RK

    更新日期:2015-12-01 00:00:00

  • Phosphorylation mapping of laminin α1-chain: Kinases in association with active sites.

    abstract::Laminin-111 is a trimeric glycoprotein of the extracellular matrix (ECM) that holds a significant role in cell adhesion, migration and differentiation. Laminin-111 is the most studied laminin isoform, composed of three chains; α1, β1 and γ1. Phosphorylation is the most common eukaryotic post - translational modificati...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2019.04.012

    authors: Galliou PA,Verrou KM,Koliakos G

    更新日期:2019-06-01 00:00:00

  • Exploiting the performance of dictionary-based bio-entity name recognition in biomedical literature.

    abstract::Bio-entity name recognition is the key step for information extraction from biomedical literature. This paper presents a dictionary-based bio-entity name recognition approach. The approach expands the bio-entity name dictionary via the Abbreviation Definitions identifying algorithm, improves the recall rate through th...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2008.03.008

    authors: Yang Z,Lin H,Li Y

    更新日期:2008-08-01 00:00:00

  • A benchmark of optimally folded protein structures using integer programming and the 3D-HP-SC model.

    abstract::The Protein Structure Prediction (PSP) problem comprises, among other issues, forecasting the three-dimensional native structure of proteins using only their primary structure information. Most computational studies in this area use synthetic data instead of real biological data. However, the closer to the real-world,...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2019.107192

    authors: Hattori LT,Gutoski M,Vargas Benítez CM,Nunes LF,Lopes HS

    更新日期:2020-02-01 00:00:00

  • Prediction and verification of microRNAs related to proline accumulation under drought stress in potato.

    abstract::Proline is an important osmotic adjusting material greatly accumulated under drought stress and can help plant to adapt to osmotic stress. MicroRNAs (miRNAs) are small, endogenous RNAs that play important regulatory roles in plant development and stress response by negatively affecting gene expression at post-transcri...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2013.04.006

    authors: Yang J,Zhang N,Ma C,Qu Y,Si H,Wang D

    更新日期:2013-10-01 00:00:00

  • DFT and QTAIM based investigation on the structure and antioxidant behavior of lichen substances Atranorin, Evernic acid and Diffractaic acid.

    abstract::In this study, the structural and antioxidant behavior of the three lichen-derived natural compounds such as atranorin (AT), evernic acid (EV) and diffractaic acid (DF) has been investigated in the gas and water phase using both B3LYP and M06-2X functional level of density functional theory (DFT) with two different ba...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2019.03.009

    authors: Shameera Ahamed TK,Rajan VK,Sabira K,Muraleedharan K

    更新日期:2019-06-01 00:00:00

  • Some steps toward a central theory of ecosystem dynamics.

    abstract::Ecology is said by many to suffer for want of a central theory, such as Newton's laws of motion provide for classical mechanics or Schroedinger's wave equation provides for quantum physics. From among a plurality of contending laws to govern ecosystem behavior, the principle of increasing ascendency shows some early p...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章,评审

    doi:10.1016/s1476-9271(03)00050-1

    authors: Ulanowicz RE

    更新日期:2003-12-01 00:00:00

  • Dynamic characterization of HLA-B*44 Alleles: A comparative molecular dynamics simulation study.

    abstract::Human Leukocyte Antigens (HLA) are highly polymorphic proteins that play a key role in the immune system. HLA molecule is present on the cell membrane of antigen-presenting cells of the immune system and presents short peptides, originating from the proteins of invading pathogens or self-proteins, to the T-cell Recept...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2016.02.019

    authors: Ozbek P

    更新日期:2016-06-01 00:00:00

  • Computational analysis for the determination of deleterious nsSNPs in human MTHFR gene.

    abstract::Methylenetetrahydrofolate reductase (MTHFR) is a key enzyme involved in folate metabolism and plays a central role in DNA methylation and biosynthesis. MTHFR mutations may alter the cellular folate supply which in turn affects nucleic acid synthesis, DNA methylation and chromosomal damage. The identification of number...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2018.02.022

    authors: Desai M,Chauhan JB

    更新日期:2018-06-01 00:00:00

  • Semantically predicting protein functions based on protein functional connectivity.

    abstract:BACKGROUND:The current availability of public protein-protein interaction (PPI) databases which are usually modelled as PPI networks has led to the rapid development of protein function prediction approaches. The existing network-based prediction approaches mainly focus on the topological similarities between immediate...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2013.01.002

    authors: Zhu W,Hou J,Chen YP

    更新日期:2013-06-01 00:00:00

  • Repurposing approved drugs as potential inhibitors of 3CL-protease of SARS-CoV-2: Virtual screening and structure based drug design.

    abstract::3CL proteases (3CLpro) are only found in RNA viruses and have a central role in polyprotein processing during replication. Therefore, 3CLpro has emerged as promising drug target for therapeutic treatment of infections caused by Coronaviruses. In the light of the recent major outbreak of the SARS-CoV-2 virus and the co...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2020.107351

    authors: Meyer-Almes FJ

    更新日期:2020-10-01 00:00:00

  • Identification and characterization of differentially expressed genes in Type 2 Diabetes using in silico approach.

    abstract::Diabetes mellitus is clinically characterized by hyperglycemia. Though many studies have been done to understand the mechanism of Type 2 Diabetes (T2D), however, the complete network of diabetes and its associated disorders through polygenic involvement is still under debate. The present study designed to re-analyze p...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2019.01.010

    authors: Gupta MK,Vadde R

    更新日期:2019-04-01 00:00:00

  • Predicting human intestinal absorption of diverse chemicals using ensemble learning based QSAR modeling approaches.

    abstract::Human intestinal absorption (HIA) of the drugs administered through the oral route constitutes an important criterion for the candidate molecules. The computational approach for predicting the HIA of molecules may potentiate the screening of new drugs. In this study, ensemble learning (EL) based qualitative and quanti...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2016.01.005

    authors: Basant N,Gupta S,Singh KP

    更新日期:2016-04-01 00:00:00

  • Spontaneous formation of annular structures observed in molecular dynamics simulations of polyglutamine peptides.

    abstract::Annular structures have been observed experimentally in aggregates of polyglutamine-containing proteins and other proteins associated with diseases of the brain. Here we report the observation of annular structures in molecular-level simulations of large systems of model polyglutamine peptides. A system of 24 polyglut...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2006.01.003

    authors: Marchut AJ,Hall CK

    更新日期:2006-06-01 00:00:00