当前位置： SCI文献检索 > BMC BIOINFORMATICS期刊下所有文献 > Bounded search for de novo identification of degenerate cis-regulatory elements.

Bounded search for de novo identification of degenerate cis-regulatory elements.

Abstract：

BACKGROUND:The identification of statistically overrepresented sequences in the upstream regions of coregulated genes should theoretically permit the identification of potential cis-regulatory elements. However, in practice many cis-regulatory elements are highly degenerate, precluding the use of an exhaustive word-counting strategy for their identification. While numerous methods exist for inferring base distributions using a position weight matrix, recent studies suggest that the independence assumptions inherent in the model, as well as the inability to reach a global optimum, limit this approach. RESULTS:In this paper, we report PRISM, a degenerate motif finder that leverages the relationship between the statistical significance of a set of binding sites and that of the individual binding sites. PRISM first identifies overrepresented, non-degenerate consensus motifs, then iteratively relaxes each one into a high-scoring degenerate motif. This approach requires no tunable parameters, thereby lending itself to unbiased performance comparisons. We therefore compare PRISM's performance against nine popular motif finders on 28 well-characterized S. cerevisiae regulons. PRISM consistently outperforms all other programs. Finally, we use PRISM to predict the binding sites of uncharacterized regulons. Our results support a proposed mechanism of action for the yeast cell-cycle transcription factor Stb1, whose binding site has not been determined experimentally. CONCLUSION:The relationship between statistical measures of the binding sites and the set as a whole leads to a simple means of identifying the diverse range of cis-regulatory elements to which a protein binds. This approach leverages the advantages of word-counting, in that position dependencies are implicitly accounted for and local optima are more easily avoided. While we sacrifice guaranteed optimality to prevent the exponential blowup of exhaustive search, we prove that the error is bounded and experimentally show that the performance is superior to other methods. A Java implementation of this algorithm can be downloaded from our web server at http://genie.dartmouth.edu/prism.

journal_name

BMC Bioinformatics

journal_title

BMC bioinformatics

authors

Carlson JM,Chakravarty A,Khetani RS,Gross RH

doi

10.1186/1471-2105-7-254

subject

Has Abstract

pub_date

2006-05-15 00:00:00

pages

254

issn

1471-2105

pii

1471-2105-7-254

journal_volume

pub_type

杂志文章

在线工具

Bounded search for de novo identification of degenerate cis-regulatory elements.