Abstract:
:DNA microarray data has been widely used in cancer research due to the significant advantage helped to successfully distinguish between tumor classes. However, typical gene expression data usually presents a high-dimensional imbalanced characteristic, which poses severe challenge for traditional machine learning methods to construct a robust classifier performing well on both the minority and majority classes. As one of the most successful feature weighting techniques, Relief is considered to particularly suit to handle high-dimensional problems. Unfortunately, almost all relief-based methods have not taken the class imbalance distribution into account. This study identifies that existing Relief-based algorithms may underestimate the features with the discernibility ability of minority classes, and ignore the distribution characteristic of minority class samples. As a result, an additional bias towards being classified into the majority classes can be introduced. To this end, a new method, named imRelief, is proposed for efficiently handling high-dimensional imbalanced gene expression data. imRelief can correct the bias towards to the majority classes, and consider the scattered distributional characteristic of minority class samples in the process of estimating feature weights. This way, imRelief has the ability to reward the features which perform well at separating the minority classes from other classes. Experiments on four microarray gene expression data sets demonstrate the effectiveness of imRelief in both feature weighting and feature subset selection applications.
journal_name
Comput Biol Chemjournal_title
Computational biology and chemistryauthors
He Y,Zhou J,Lin Y,Zhu Tdoi
10.1016/j.compbiolchem.2019.03.017subject
Has Abstractpub_date
2019-06-01 00:00:00pages
121-127eissn
1476-9271issn
1476-928Xpii
S1476-9271(19)30193-8journal_volume
80pub_type
杂志文章abstract::In this paper, a problem of isothermic DNA sequencing by hybridization (SBH) is considered. In isothermic SBH a new type of oligonucleotide libraries is used. The library consists of oligonucleotides of different lengths depending on an oligonucleotide content. It is assumed that every oligonucleotide in such a librar...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2003.12.002
更新日期:2004-02-01 00:00:00
abstract::Bacterial and eukaryotic RecQ helicases comprise a family of homologous proteins necessary for maintaining genomic integrity during the cell cycle and DNA repair. There is one known bacterial RecQ helicase, and five eukaryotic RecQ helicases that have been described: RecQ1p, RecQ4p, RecQ5p, Bloom, and Werner. While th...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2008.07.005
更新日期:2008-10-01 00:00:00
abstract::The metabolic rearrangements occurring in cancer cells can be effectively investigated with a Systems Biology approach supported by metabolic network modeling. We here present tissue-specific constraint-based core models for three different types of tumors (liver, breast and lung) that serve this purpose. The core mod...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2016.03.002
更新日期:2016-06-01 00:00:00
abstract::2-phenyl-benzotriazole xenobiotic compounds (PBTA-4, PBTA-6, PBTA-7 and PBTA-8) that were previously isolated and identified in waters of the Yodo river, in Japan (Nukaya et al., 2001; Ohe et al., 2004; Watanabe et al., 2001) were characterized as powerful pro-mutagens. In order to predict the activation mechanism of ...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2018.04.004
更新日期:2018-06-01 00:00:00
abstract::A translation (framing) code based on the circular code was proposed in Michel (2012) with the identification of X circular code motifs (X motifs shortly) in the bacterial rRNA of Thermus thermophilus, in particular in the ribosome decoding center. Three classes of X motifs are now identified in the rRNAs of bacteria ...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2014.08.001
更新日期:2014-10-01 00:00:00
abstract:BACKGROUND:Associating protein complexes to human inherited diseases is critical for better understanding of biological processes and functional mechanisms of the disease. Many protein complexes have been identified and functionally annotated by computational and purification methods so far, however, the particular rol...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2017.04.007
更新日期:2017-08-01 00:00:00
abstract::In this work we have studied the interaction of zervamicin IIB (ZrvIIB) with the model membranes of eukaryotes and prokaryotes using all-atom molecular dynamics. In all our simulations zervamicin molecule interacted only with lipid headgroups but did not penetrate the hydrophobic core of the bilayers. During the inter...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2010.12.005
更新日期:2011-02-01 00:00:00
abstract::In this study, the dipeptidyl peptidase-IV (DPP-IV) inhibition activities of a series of novel aminomethyl-piperidones were investigated by molecular docking studies and modeled by quantitative structure-activity relationship (QSAR) methodology. Molecular docking studies were used to find the best conformations of the...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2016.08.003
更新日期:2016-10-01 00:00:00
abstract::Currently Alzheimer's disease (AD) is a devastating neurological disorder that mainly affects the elderly. The treatment of AD has as main objective to increase the levels of ACh in the synaptic cleft by inhibiting the cholinesterase enzymes, which are responsible for the degradation of ACh. Twenty one synthesized cou...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2020.107293
更新日期:2020-05-29 00:00:00
abstract::With advances in genomics, transcriptomics, metabolomics and proteomics, and more expansive electronic clinical record monitoring, as well as advances in computation, we have entered the Big Data era in biomedical research. Data gathering is growing rapidly while only a small fraction of this data is converted to usef...
journal_title:Computational biology and chemistry
pub_type: 杂志文章,评审
doi:10.1016/j.compbiolchem.2015.08.005
更新日期:2015-12-01 00:00:00
abstract::Accurate clustering of cells from single-cell RNA sequencing (scRNA-seq) data is an essential step for biological analysis such as putative cell type identification. However, scRNA-seq data has high dimension and high sparsity, which makes traditional clustering methods less effective to reflect the similarity between...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2020.107415
更新日期:2020-11-18 00:00:00
abstract::Beta-secretase is a potential target for inhibitory drugs against Alzheimer's disease as it cleaves amyloid precursor protein (APP) to form insoluble amyloid plaques and vascular deposits in the brain. Beta-secretase is matured from its precursor protein, called beta-secretase zymogen, which, different from most of ot...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2007.03.007
更新日期:2007-06-01 00:00:00
abstract::Shortest common supersequence (SCS) is a classical NP-hard problem, where a string to be constructed that is the supersequence of a given string set. The SCS problem has an enormous application of data compression, query optimization in the database and different bioinformatics activities. Due to NP-hardness, the exac...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2016.05.004
更新日期:2016-10-01 00:00:00
abstract::Interleukin-1β is a drug target in rheumatoid arthritis and several auto-immune disorders. In this study, a set of 48 compounds with the determined IC50 values were used for QSAR analysis by MOE. The QSAR model was developed by using training set of 41 compounds, based on 12 unique descriptors. Model was validated by ...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2015.06.004
更新日期:2015-10-01 00:00:00
abstract::Nodulation is one of the very important processes of legume plants as it is the initiating event of fixing nitrogen. Although ethylene has essential role in normal plant metabolism but it has also negative impact on plants particularly in nodule formation in legume plants. It is also produced due to a variety of bioti...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2017.02.005
更新日期:2017-06-01 00:00:00
abstract::The human X chromosome contains a large number of inverted repeat DNA palindromes. Although arbitrary substitutions destroyed the inverted repeat structure of MAGEA/CSAG-palindrome during the evolutionary process of the primates, most of the substitutions are compensatory. Using maximum parsimony, it is demonstrated t...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2012.11.002
更新日期:2013-02-01 00:00:00
abstract::We present the mechanism of interaction of Wnt network module, which is responsible for periodic somitogenesis, with p53 regulatory network, which is one of the main regulators of various cellular functions, and switching of various oscillating states by investigating p53-Wnt model. The variation in Nutlin concentrati...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2015.07.014
更新日期:2015-12-01 00:00:00
abstract::Laminin-111 is a trimeric glycoprotein of the extracellular matrix (ECM) that holds a significant role in cell adhesion, migration and differentiation. Laminin-111 is the most studied laminin isoform, composed of three chains; α1, β1 and γ1. Phosphorylation is the most common eukaryotic post - translational modificati...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2019.04.012
更新日期:2019-06-01 00:00:00
abstract::Bio-entity name recognition is the key step for information extraction from biomedical literature. This paper presents a dictionary-based bio-entity name recognition approach. The approach expands the bio-entity name dictionary via the Abbreviation Definitions identifying algorithm, improves the recall rate through th...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2008.03.008
更新日期:2008-08-01 00:00:00
abstract::The Protein Structure Prediction (PSP) problem comprises, among other issues, forecasting the three-dimensional native structure of proteins using only their primary structure information. Most computational studies in this area use synthetic data instead of real biological data. However, the closer to the real-world,...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2019.107192
更新日期:2020-02-01 00:00:00
abstract::Proline is an important osmotic adjusting material greatly accumulated under drought stress and can help plant to adapt to osmotic stress. MicroRNAs (miRNAs) are small, endogenous RNAs that play important regulatory roles in plant development and stress response by negatively affecting gene expression at post-transcri...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2013.04.006
更新日期:2013-10-01 00:00:00
abstract::In this study, the structural and antioxidant behavior of the three lichen-derived natural compounds such as atranorin (AT), evernic acid (EV) and diffractaic acid (DF) has been investigated in the gas and water phase using both B3LYP and M06-2X functional level of density functional theory (DFT) with two different ba...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2019.03.009
更新日期:2019-06-01 00:00:00
abstract::Ecology is said by many to suffer for want of a central theory, such as Newton's laws of motion provide for classical mechanics or Schroedinger's wave equation provides for quantum physics. From among a plurality of contending laws to govern ecosystem behavior, the principle of increasing ascendency shows some early p...
journal_title:Computational biology and chemistry
pub_type: 杂志文章,评审
doi:10.1016/s1476-9271(03)00050-1
更新日期:2003-12-01 00:00:00
abstract::Human Leukocyte Antigens (HLA) are highly polymorphic proteins that play a key role in the immune system. HLA molecule is present on the cell membrane of antigen-presenting cells of the immune system and presents short peptides, originating from the proteins of invading pathogens or self-proteins, to the T-cell Recept...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2016.02.019
更新日期:2016-06-01 00:00:00
abstract::Methylenetetrahydrofolate reductase (MTHFR) is a key enzyme involved in folate metabolism and plays a central role in DNA methylation and biosynthesis. MTHFR mutations may alter the cellular folate supply which in turn affects nucleic acid synthesis, DNA methylation and chromosomal damage. The identification of number...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2018.02.022
更新日期:2018-06-01 00:00:00
abstract:BACKGROUND:The current availability of public protein-protein interaction (PPI) databases which are usually modelled as PPI networks has led to the rapid development of protein function prediction approaches. The existing network-based prediction approaches mainly focus on the topological similarities between immediate...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2013.01.002
更新日期:2013-06-01 00:00:00
abstract::3CL proteases (3CLpro) are only found in RNA viruses and have a central role in polyprotein processing during replication. Therefore, 3CLpro has emerged as promising drug target for therapeutic treatment of infections caused by Coronaviruses. In the light of the recent major outbreak of the SARS-CoV-2 virus and the co...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2020.107351
更新日期:2020-10-01 00:00:00
abstract::Diabetes mellitus is clinically characterized by hyperglycemia. Though many studies have been done to understand the mechanism of Type 2 Diabetes (T2D), however, the complete network of diabetes and its associated disorders through polygenic involvement is still under debate. The present study designed to re-analyze p...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2019.01.010
更新日期:2019-04-01 00:00:00
abstract::Human intestinal absorption (HIA) of the drugs administered through the oral route constitutes an important criterion for the candidate molecules. The computational approach for predicting the HIA of molecules may potentiate the screening of new drugs. In this study, ensemble learning (EL) based qualitative and quanti...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2016.01.005
更新日期:2016-04-01 00:00:00
abstract::Annular structures have been observed experimentally in aggregates of polyglutamine-containing proteins and other proteins associated with diseases of the brain. Here we report the observation of annular structures in molecular-level simulations of large systems of model polyglutamine peptides. A system of 24 polyglut...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2006.01.003
更新日期:2006-06-01 00:00:00