当前位置： SCI文献检索 > BMC BIOINFORMATICS期刊下所有文献 > Predicting domain-domain interaction based on domain profiles with feature selection and support vector machines.

Predicting domain-domain interaction based on domain profiles with feature selection and support vector machines.

Abstract：

BACKGROUND:Protein-protein interaction (PPI) plays essential roles in cellular functions. The cost, time and other limitations associated with the current experimental methods have motivated the development of computational methods for predicting PPIs. As protein interactions generally occur via domains instead of the whole molecules, predicting domain-domain interaction (DDI) is an important step toward PPI prediction. Computational methods developed so far have utilized information from various sources at different levels, from primary sequences, to molecular structures, to evolutionary profiles. RESULTS:In this paper, we propose a computational method to predict DDI using support vector machines (SVMs), based on domains represented as interaction profile hidden Markov models (ipHMM) where interacting residues in domains are explicitly modeled according to the three dimensional structural information available at the Protein Data Bank (PDB). Features about the domains are extracted first as the Fisher scores derived from the ipHMM and then selected using singular value decomposition (SVD). Domain pairs are represented by concatenating their selected feature vectors, and classified by a support vector machine trained on these feature vectors. The method is tested by leave-one-out cross validation experiments with a set of interacting protein pairs adopted from the 3DID database. The prediction accuracy has shown significant improvement as compared to InterPreTS (Interaction Prediction through Tertiary Structure), an existing method for PPI prediction that also uses the sequences and complexes of known 3D structure. CONCLUSIONS:We show that domain-domain interaction prediction can be significantly enhanced by exploiting information inherent in the domain profiles via feature selection based on Fisher scores, singular value decomposition and supervised learning based on support vector machines. Datasets and source code are freely available on the web at http://liao.cis.udel.edu/pub/svdsvm. Implemented in Matlab and supported on Linux and MS Windows.

journal_name

BMC Bioinformatics

journal_title

BMC bioinformatics

authors

González AJ,Liao L

doi

10.1186/1471-2105-11-537

subject

Has Abstract

pub_date

2010-10-29 00:00:00

pages

537

issn

1471-2105

pii

1471-2105-11-537

journal_volume

pub_type

杂志文章

在线工具

Predicting domain-domain interaction based on domain profiles with feature selection and support vector machines.