Abstract:
:Structural class characterizes the overall folding type of a protein or its domain. Many methods have been proposed to improve the prediction accuracy of protein structural class in recent years, but it is still a challenge for the low-similarity sequences. In this study, we introduce a feature extraction technique based on auto cross covariance (ACC) transformation of position-specific score matrix (PSSM) to represent a protein sequence. Then support vector machine-recursive feature elimination (SVM-RFE) is adopted to select top K features according to their importance and these features are input to a support vector machine (SVM) to conduct the prediction. Performance evaluation of the proposed method is performed using the jackknife test on three low-similarity datasets, i.e., D640, 1189 and 25PDB. By means of this method, the overall accuracies of 97.2%, 96.2%, and 93.3% are achieved on these three datasets, which are higher than those of most existing methods. This suggests that the proposed method could serve as a very cost-effective tool for predicting protein structural class especially for low-similarity datasets.
journal_name
Comput Biol Chemjournal_title
Computational biology and chemistryauthors
Li X,Liu T,Tao P,Wang C,Chen Ldoi
10.1016/j.compbiolchem.2015.08.012subject
Has Abstractpub_date
2015-12-01 00:00:00pages
95-100eissn
1476-9271issn
1476-928Xpii
S1476-9271(15)30144-4journal_volume
59 Pt Apub_type
杂志文章abstract::Protein kinases are enzymes acting as a source of phosphate through ATP to regulate protein biological activities by phosphorylating groups of specific amino acids. For that reason, inhibiting protein kinases with an active small molecule plays a significant role in cancer treatment. To achieve this aim, computational...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2020.107269
更新日期:2020-06-01 00:00:00
abstract::Various subsets of self-avoiding walks naturally appear when investigating existing methods designed to predict the 3D conformation of a protein of interest. Two such subsets, namely the folded and the unfoldable self-avoiding walks, are studied computationally in this article. We show that these two sets are equal an...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2013.10.001
更新日期:2013-12-01 00:00:00
abstract::Development in two-dimensional (2D) drug-delivery materials have quickly translated into biological and pharmacological fields. In this present work, pristine graphene (PG) and hexagonal boron nitride (h-BN) sheets are explored as a drug carrier for cytarabine (CYT) and clofarabine (CLF) anti-cancer drugs using densit...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2020.107334
更新日期:2020-10-01 00:00:00
abstract::We present an algorithm for automatically predicting the topological family of any RNA three-way junction, given only the information from the secondary structure: the sequence and the Watson-Crick pairings. The parameters of the algorithm have been determined on a data set of 33 three-way junctions whose 3D conformat...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2011.11.001
更新日期:2012-04-01 00:00:00
abstract::With an increasing number of publicly available microarray datasets, it becomes attractive to borrow information from other relevant studies to have more reliable and powerful analysis of a given dataset. We do not assume that subjects in the current study and other relevant studies are drawn from the same population ...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2005.04.002
更新日期:2005-06-01 00:00:00
abstract::Relative amino acid residue solvent accessibility values allow the quantitative comparison of atomic solvent-accessible surface areas in different residue types and physical environments in proteins and in protein structural alignments. Geometry-optimised tri-peptide structures in extended solvent-exposed reference co...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2014.11.007
更新日期:2015-02-01 00:00:00
abstract::The presence of repetitive or non-unique DNA persisting over sizable regions of a eukaryotic genome can hinder the genome's successful de novo assembly from short reads: ambiguities in assigning genome locations to the non-unique subsequences can result in premature termination of contigs and thus overfragmented assem...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2016.02.016
更新日期:2016-04-01 00:00:00
abstract::Epigallocatechin gallate, a flavonoid from Camellia sinensis possess various pharmacological activities such as anticancer, antimicrobial and antioxidant etc. Adenosine deaminase, (ADA), is a key enzyme involved in the purine metabolism, the inhibitors of which is being considered as highly promising candidate for the...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2019.107111
更新日期:2019-12-01 00:00:00
abstract::A computational study of the interaction of cardiac troponin I (cTnI) with its specific antibody and of that antibody with skeletal troponin I (sTnI), the principal interferon of cTnI, is carried out. Computational and simulation tools such as FTSite, FTMap, FTDock and pyDock are used to determine the binding sites of...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2019.04.002
更新日期:2019-06-01 00:00:00
abstract::Splice site recognition is among the most significant and challenging tasks in bioinformatics due to its key role in gene annotation. Effective prediction of splice site requires nucleotide encoding methods that reveal the characteristics of DNA sequences to provide appropriate features to serve as input of machine le...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2018.02.005
更新日期:2018-04-01 00:00:00
abstract::The metabolic rearrangements occurring in cancer cells can be effectively investigated with a Systems Biology approach supported by metabolic network modeling. We here present tissue-specific constraint-based core models for three different types of tumors (liver, breast and lung) that serve this purpose. The core mod...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2016.03.002
更新日期:2016-06-01 00:00:00
abstract::In order to extract phylogenetic information from DNA sequences, the new normalized k-word average relative distance is proposed in this paper. The proposed measure was tested by discriminate analysis and phylogenetic analysis. The phylogenetic trees based on the Manhattan distance measure are reconstructed with k ran...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2014.10.007
更新日期:2014-12-01 00:00:00
abstract::MicroRNAs (miRNAs) have been implicated in complex vertebrate developmental and pathological systems as a versatile class of molecules involved in the regulation of various biological processes and molecular pathways. To elucidate the role of miRNAs in human somatic cells, an understanding of the molecular framework r...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2009.09.003
更新日期:2009-12-01 00:00:00
abstract::It has recently been shown that cancer genes (oncogenes) tend to have heterogeneous expressions across disease samples. So it is reasonable to assume that in a microarray data only a subset of disease samples will be activated (often referred to as outliers), which presents some new challenges for statistical analysis...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2007.02.004
更新日期:2007-04-01 00:00:00
abstract::Detection of protein complexes is very important to understand the principles of cellular organization and function. Recently, large protein-protein interactions (PPIs) networks have become available using high-throughput experimental techniques. These networks make it possible to develop computational methods for pro...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2015.07.012
更新日期:2015-10-01 00:00:00
abstract::There is a large gap between the number of discovered proteins and the number of functionally annotated ones. Due to the high cost of determining protein function by wet-lab research, function prediction has become a major task for computational biology and bioinformatics. Some researches utilize the proteins interact...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2012.12.003
更新日期:2013-04-01 00:00:00
abstract::G protein-coupled receptors (GPCRs), a large eukaryotic protein family, have proved difficult to comprehensively detect and functionally identify by homology searches and domain detection, because they are highly divergent and their sequences share strikingly little similarity. Transmembrane (TM) topology pattern anal...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2003.11.003
更新日期:2004-02-01 00:00:00
abstract::With the development of cancer research, various gene expression datasets containing cancer information show an explosive growth trend. In addition, due to the continuous maturity of single-cell RNA sequencing (scRNA-seq) technology, the protein information and pedigree information of a single cell are also continuous...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2020.107368
更新日期:2020-12-01 00:00:00
abstract::This article has been withdrawn at the request of the author(s) and/or editor. The Publisher apologizes for any inconvenience this may cause. The full Elsevier Policy on Article Withdrawal can be found at http://www.elsevier.com/locate/withdrawalpolicy. ...
journal_title:Computational biology and chemistry
pub_type: 撤回出版物
doi:10.1016/j.compbiolchem.2016.02.010
更新日期:2016-02-17 00:00:00
abstract::In this paper, a method for single individual haplotype (SIH) reconstruction using Asexual reproduction optimization (ARO) is proposed. Haplotypes, as a set of genetic variations in each chromosome, contain vital information such as the relationship between human genome and diseases. Finding haplotypes in diploid orga...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2017.12.005
更新日期:2018-02-01 00:00:00
abstract::We present the mechanism of interaction of Wnt network module, which is responsible for periodic somitogenesis, with p53 regulatory network, which is one of the main regulators of various cellular functions, and switching of various oscillating states by investigating p53-Wnt model. The variation in Nutlin concentrati...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2015.07.014
更新日期:2015-12-01 00:00:00
abstract::Interleukin-1β is a drug target in rheumatoid arthritis and several auto-immune disorders. In this study, a set of 48 compounds with the determined IC50 values were used for QSAR analysis by MOE. The QSAR model was developed by using training set of 41 compounds, based on 12 unique descriptors. Model was validated by ...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2015.06.004
更新日期:2015-10-01 00:00:00
abstract::Ecology is said by many to suffer for want of a central theory, such as Newton's laws of motion provide for classical mechanics or Schroedinger's wave equation provides for quantum physics. From among a plurality of contending laws to govern ecosystem behavior, the principle of increasing ascendency shows some early p...
journal_title:Computational biology and chemistry
pub_type: 杂志文章,评审
doi:10.1016/s1476-9271(03)00050-1
更新日期:2003-12-01 00:00:00
abstract::Proteins physically interact with each other and form protein complexes to perform their biological functions. The prediction of protein complexes from protein-protein interaction (PPI) network is usually difficult when the complexes are overlapping with each other in a dense region of the network. To address the prob...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2018.03.012
更新日期:2018-06-01 00:00:00
abstract::The electrostatic (ES) energy of each residue was for the first time quantitatively evaluated in a flavin mononucleotide binding protein (FBP). A residue electrostatic energy (RES) was obtained as the sum of the ES energies between atoms in each residue and all other atoms in the FBP dimer using atomic coordinates obt...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2018.01.001
更新日期:2018-02-01 00:00:00
abstract::Often the quantitative data coming from proteomics and metabolomics studies have irregular distribution with a spike. None of the wide used methods for human QTL mapping are applicable to such traits. Researchers have to reduce the sample, excluding the spike, and analyze only continuous measurements. In this study, w...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2011.11.004
更新日期:2012-02-01 00:00:00
abstract::This paper describes an efficient algorithm based on a new concept called gene team for detecting conserved gene clusters among an arbitrary number of chromosomes. Within the clusters, neither the order of the genes nor their orientation need be conserved. In addition, insertion of foreign genes within the clusters ar...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/s1476-9271(02)00097-x
更新日期:2003-02-01 00:00:00
abstract::Optimally weighted fuzzy k-nearest neighbors (OWFKNN) algorithm has been used to predict proteins' subcellular locations based on their amino acid composition, in this paper. The datasets used consists of two species which are 997 prokaryotic and 2427 eukaryotic protein sequences. The overall prediction accuracy achie...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2008.07.011
更新日期:2008-12-01 00:00:00
abstract::Cell type identification from an unknown sample can often be done by comparing its gene expression profile against a gene expression database containing profiles of a large number of cell-types. This type of compendium-based cell-type identification strategy is particularly successful for human and mouse samples becau...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2018.10.003
更新日期:2018-12-01 00:00:00
abstract::Non-specific lipid transfer proteins (nsLTPs) are common allergens and they are particularly widespread within the plant kingdom. They have a highly conserved three-dimensional structure that generate a strong cross-reactivity among the members of this family. In the last years several web tools for the prediction of ...
journal_title:Computational biology and chemistry
pub_type: 杂志文章
doi:10.1016/j.compbiolchem.2015.11.006
更新日期:2016-02-01 00:00:00