Abstract:
:Analysis of large-scale gene expression studies usually begins with gene clustering. A ubiquitous problem is that different algorithms applied to the same data inevitably give different results, and the differences are often substantial, involving a quarter or more of the genes analyzed. This raises a series of important but nettlesome questions: How are different clustering results related to each other and to the underlying data structure? Is one clustering objectively superior to another? Which differences, if any, are likely candidates to be biologically important? A systematic and quantitative way to address these questions is needed, together with an effective way to integrate and leverage expression results with other kinds of large-scale data and annotations. We developed a mathematical and computational framework to help quantify, compare, visualize and interactively mine clusterings. We show that by coupling confusion matrices with appropriate metrics (linear assignment and normalized mutual information scores), one can quantify and map differences between clusterings. A version of receiver operator characteristic analysis proved effective for quantifying and visualizing cluster quality and overlap. These methods, plus a flexible library of clustering algorithms, can be called from a new expandable set of software tools called CompClust 1.0 (http://woldlab.caltech.edu/compClust/). CompClust also makes it possible to relate expression clustering patterns to DNA sequence motif occurrences, protein-DNA interaction measurements and various kinds of functional annotations. Test analyses used yeast cell cycle data and revealed data structure not obvious under all algorithms. These results were then integrated with transcription motif and global protein-DNA interaction data to identify G1 regulatory modules.
journal_name
Nucleic Acids Resjournal_title
Nucleic acids researchauthors
Hart CE,Sharenbroich L,Bornstein BJ,Trout D,King B,Mjolsness E,Wold BJdoi
10.1093/nar/gki536keywords:
subject
Has Abstractpub_date
2005-05-10 00:00:00pages
2580-94issue
8eissn
0305-1048issn
1362-4962pii
33/8/2580journal_volume
33pub_type
杂志文章abstract::The pattern of preferential DNA repair of UV-induced pyrimidine dimers was studied in repair-deficient Chinese hamster ovary (CHO) cells transfected with the human excision repair gene, ERCC-1. Repair efficiency was measured in the active dihydrofolate reductase (DHFR) gene and in its flanking, non-transcribed sequenc...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/16.15.7397
更新日期:1988-08-11 00:00:00
abstract::During replication of nuclear ribosomal DNA (rDNA), clashes with the transcription apparatus can cause replication fork collapse and genomic instability. To avoid this problem, a replication fork barrier protein is situated downstream of rDNA, there preventing replication in the direction opposite rDNA transcription. ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkw302
更新日期:2016-07-08 00:00:00
abstract::Human somatic cells have essentially no telomerase activity. Telomerase is linked to tumor genesis and is a valuable marker for malignant growth. Extreme paucity of the enzyme neccessitated development of a PCR-based assay, 'telomeric repeat amplification protocol' (TRAP). Unfortunately, this method is not without dif...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/25.4.919
更新日期:1997-02-15 00:00:00
abstract::Genomic meta-analysis to combine relevant and homogeneous studies has been widely applied, but the quality control (QC) and objective inclusion/exclusion criteria have been largely overlooked. Currently, the inclusion/exclusion criteria mostly depend on ad-hoc expert opinion or naïve threshold by sample size or platfo...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkr1071
更新日期:2012-01-01 00:00:00
abstract::Coralyne is a small crescent-shaped molecule known to intercalate duplex and triplex DNA. We report that coralyne can cause the complete and irreversible disproportionation of duplex poly(dT)*poly(dA). That is, coralyne causes the strands of duplex poly(dT)*poly(dA) to repartition into equal molar equivalents of tripl...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/30.4.983
更新日期:2002-02-15 00:00:00
abstract::DAP-like kinase (Dlk, also termed ZIP kinase) is a leucine zipper-containing serine/threonine-specific protein kinase with as yet unknown biological function(s). Interaction partners so far identified are either transcription factors or proteins that can support or counteract apoptosis. Thus, Dlk might be involved in ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/30.6.1408
更新日期:2002-03-15 00:00:00
abstract::Digestion of isolated Friend erythroleukemic cell nuclei with DNase I under conditions which selectively destroy the DNA of transcriptionally "active" genes releases into the supernatant fraction proteins of the non-histone "High Mobility Group" (HMGs). Two of these, HMG-14 and HMG-17(identified by solubility in trich...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/8.9.1947
更新日期:1980-05-10 00:00:00
abstract::The transfer ribonucleic acids (tRNAs) of B. subtilis at different growth phases are examined for changes in the composition and the methylation of minor constituents. The composition of the tRNAs indicates about equal amounts of adenosine and uridine, and of guanosine and cytidine. About 3-4 residues are present as m...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/3.5.1249
更新日期:1976-05-01 00:00:00
abstract::Using a powerful computer-assisted analysis strategy, a large-scale search of small nucleolar RNA (snoRNA) genes in the recently released draft sequence of the rice genome was carried out. This analysis identified 120 different box C/D snoRNA genes with a total of 346 gene variants, which were predicted to guide 135 2...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkg373
更新日期:2003-05-15 00:00:00
abstract::We have studied the effect of selenium on the expression of a cellular glutathione peroxidase, GSHPx-1, in transfected MCF-7 cells and in doxorubicin-resistant (Adrr) MCF-7 cells. A GSHPx-1 cDNA with a Rous Sarcoma virus promoter was transfected into a human mammary carcinoma cell line, MCF-7, which has very low endog...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/18.6.1531
更新日期:1990-03-25 00:00:00
abstract::p16INK4a and p21WAF1, two major cyclin-dependent kinase inhibitors, are the products of two tumor suppressor genes that play important roles in various cellular metabolic pathways. p21WAF1 is up-regulated in response to different DNA damaging agents. While the activation of p21WAF1 is p53-dependent following -rays, th...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkl1075
更新日期:2007-01-01 00:00:00
abstract::With the aid of a novel poly-dA tailing-partial restriction technique and S1-protection mapping, the 5' terminal coding sequence for the 40S precursor ribosomal RNA of Xenopus laevis has been exactly identified. Since the promoter sequence for the 40S RNA should lie close to its 5' terminal coding sequence, we are abl...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/6.12.3733
更新日期:1979-08-24 00:00:00
abstract::Tumor formation is partially driven by DNA copy number changes, which are typically measured using array comparative genomic hybridization, SNP arrays and DNA sequencing platforms. Many techniques are available for detecting recurring aberrations across multiple tumor samples, including CMAR, STAC, GISTIC and KC-SMART...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkt155
更新日期:2013-05-01 00:00:00
abstract::An oligonucleotide hybrid is described which possesses two triple helix forming oligonucleotides which have been connected by a flexible polymeric linker chain. As a prototype, binding of this class of oligonucleotide to duplex DNA has been studied using a segment of the HSV-1 D-glycoprotein promoter, which possesses ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/21.20.4810
更新日期:1993-10-11 00:00:00
abstract::The TGGCA protein, the chicken homologue of HeLa cell NF-I, was purified to homogeneity from liver tissue by a procedure which includes preparative mobility shift electrophoresis (PMSE) as the final step. PMSE was here adjusted for the isolation of the TGGCA protein, but can be used as a general method to characterize...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/15.23.9707
更新日期:1987-12-10 00:00:00
abstract::Pre-mRNA splicing is catalyzed by the spliceosome, a multi-megadalton ribonucleoprotein machine. Previous work from our laboratory revealed the splicing factor SRSF1 as a regulator of the SUMO pathway, leading us to explore a connection between this pathway and the splicing machinery. We show here that addition of a r...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkx213
更新日期:2017-06-20 00:00:00
abstract::Backscattering interferometry (BSI) has been used to successfully monitor molecular interactions without labeling and with high sensitivity. These properties suggest that this approach might be useful for detecting biomarkers of infection. In this report, we identify interactions and characteristics of nucleic acid pr...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkt165
更新日期:2013-05-01 00:00:00
abstract::The HpaR-mediated regulation of the hpa-meta operon (Pg promoter) of the 4-hydroxyphenylacetic acid catabolic pathway of Escherichia coli has been studied. The HpaR regulator was purified to homogeneity showing that it is able to bind selectively to 4-hydroxyphenylacetic, 3-hydroxyphenylacetic and 3,4-dihydroxyphenyla...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkg851
更新日期:2003-11-15 00:00:00
abstract::Plant mitochondrial genomes show much more evolutionary plasticity than those of animals. We analysed the first mitochondrial DNA (mtDNA) of a lycophyte, the quillwort Isoetes engelmannii, which is separated from seed plants by more than 350 million years of evolution. The Isoetes mtDNA is particularly rich in recombi...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkp532
更新日期:2009-08-01 00:00:00
abstract::POGO-DB (http://pogo.ece.drexel.edu/) provides an easy platform for comparative microbial genomics. POGO-DB allows users to compare genomes using pre-computed metrics that were derived from extensive computationally intensive BLAST comparisons of >2000 microbes. These metrics include (i) average protein sequence ident...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkt1094
更新日期:2014-01-01 00:00:00
abstract::The S-adenosylmethionine (AdoMet) analog sinefungin is a natural product antibiotic that inhibits nucleic acid methyltransferases and arrests the growth of unicellular eukarya and eukaryal viruses. The basis for the particular sensitivity of fungi and protozoa to sinefungin is not known. Here we report the isolation a...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkm817
更新日期:2007-01-01 00:00:00
abstract::RNase III enzymes cleave double stranded (ds)RNA. This is an essential step for regulating the processing of mRNA, rRNA, snoRNA and other small RNAs, including siRNA and miRNA. Arabidopsis thaliana encodes nine RNase III: four DICER-LIKE (DCL) and five RNASE THREE LIKE (RTL). To better understand the molecular functio...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkx820
更新日期:2017-11-16 00:00:00
abstract::DNA synthesis is a fundamental requirement for cell proliferation and DNA repair, but no single method can identify the location, direction and speed of replication forks with high resolution. Mammalian cells have the ability to incorporate thymidine analogs along with the natural A, T, G and C bases during DNA synthe...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkaa517
更新日期:2020-09-04 00:00:00
abstract::The synthesis of oligonucleotides (ODNs) containing 5-(N-aminohexyl)carbamoyl-2'-O-methyluridine (D) is described, and thermal stability and resistance to enzymatic hydrolysis of the ODNs are compared with ODNs containing 5-(N-aminohexyl)carbamoyl-2'-deoxyuridine (H). The ODNs containing D and the complementary RNA de...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkg374
更新日期:2003-05-15 00:00:00
abstract::The vertebrate genome is a mosaic of regions differing dramatically in their G + C content. Those regions with a high G + C content contain the expected number of CpG dinucleotides and we propose that following methylation these have been protected from deamination by the increased stability of the surrounding DNA dup...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/12.14.5869
更新日期:1984-07-25 00:00:00
abstract::Several different computational problems have been solved using DNA as a medium. However, the DNA computations that have so far been carried out have examined a relatively small number of possible sequence solutions in order to find correct sequence solutions. We have encoded a search algorithm in DNA that required th...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/26.22.5203
更新日期:1998-11-15 00:00:00
abstract::We report a crystal structure that shows an antibiotic that extracts a nucleobase from a DNA molecule 'caught in the act' after forming a covalent bond but before departing with the base. The structure of trioxacarcin A covalently bound to double-stranded d(AACCGGTT) was determined to 1.78 A resolution by MAD phasing ...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkn245
更新日期:2008-06-01 00:00:00
abstract::CRISPR-Cas (clustered regularly interspaced short palindromic repeats-CRISPR associated) systems allow bacteria to adapt to infection by acquiring 'spacer' sequences from invader DNA into genomic CRISPR loci. Cas proteins use RNAs derived from these loci to target cognate sequences for destruction through CRISPR inter...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkv1259
更新日期:2015-12-15 00:00:00
abstract::We have developed an improved and rapid genomic engineering procedure for the construction of custom-designed microorganisms. This method, which can be performed in 2 days, permits restructuring of the Escherichia coli genome via markerless deletion of selected genomic regions. The deletion process was mediated by a s...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/gkn359
更新日期:2008-08-01 00:00:00
abstract::Previous studies have shown that the yeast Candida albicans encodes a unique seryl-tRNA(CAG) that should decode the leucine codon CUG as serine. However, in vitro translation of several different CUG-containing mRNAs in the presence of this unusual seryl-tRNA(CAG) result in an apparent increase in the molecular weight...
journal_title:Nucleic acids research
pub_type: 杂志文章
doi:10.1093/nar/23.9.1481
更新日期:1995-05-11 00:00:00