Assessing clusters and motifs from gene expression data.

Abstract:

:Large-scale gene expression studies and genomic sequencing projects are providing vast amounts of information that can be used to identify or predict cellular regulatory processes. Genes can be clustered on the basis of the similarity of their expression profiles or function and these clusters are likely to contain genes that are regulated by the same transcription factors. Searches for cis-regulatory elements can then be undertaken in the noncoding regions of the clustered genes. However, it is necessary to assess the efficiency of both the gene clustering and the postulated regulatory motifs, as there are many difficulties associated with clustering and determining the functional relevance of matches to sequence motifs. We have developed a method to assess the potential functional significance of clusters and motifs based on the probability of finding a certain number of matches to a motif in all of the gene clusters. To avoid problems with threshold scores for a match, the top matches to a motif are taken in several sample sizes. Genes from a sample are then counted by the cluster in which they appear. The probability of observing these counts by chance is calculated using the hypergeometric distribution. Because of the multiple sample sizes, strong and weak matching motifs can be detected and refined and significant matches to motifs across cluster boundaries are observed as all clusters are considered. By applying this method to many motifs and to a cluster set of yeast genes, we detected a similarity between Swi Five Factor and forkhead proteins and suggest that the currently unidentified Swi Five Factor is one of the yeast forkhead proteins.

journal_name

Genome Res

journal_title

Genome research

authors

Jakt LM,Cao L,Cheah KS,Smith DK

doi

10.1101/gr.148301

subject

Has Abstract

pub_date

2001-01-01 00:00:00

pages

112-23

issue

1

eissn

1088-9051

issn

1549-5469

journal_volume

11

pub_type

杂志文章
  • Cell-type, allelic, and genetic signatures in the human pancreatic beta cell transcriptome.

    abstract::Elucidating the pathophysiology and molecular attributes of common disorders as well as developing targeted and effective treatments hinges on the study of the relevant cell type and tissues. Pancreatic beta cells within the islets of Langerhans are centrally involved in the pathogenesis of both type 1 and type 2 diab...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.150706.112

    authors: Nica AC,Ongen H,Irminger JC,Bosco D,Berney T,Antonarakis SE,Halban PA,Dermitzakis ET

    更新日期:2013-09-01 00:00:00

  • Whole-genome sequence assembly for mammalian genomes: Arachne 2.

    abstract::We previously described the whole-genome assembly program Arachne, presenting assemblies of simulated data for small to mid-sized genomes. Here we describe algorithmic adaptations to the program, allowing for assembly of mammalian-size genomes, and also improving the assembly of smaller genomes. Three principal change...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.828403

    authors: Jaffe DB,Butler J,Gnerre S,Mauceli E,Lindblad-Toh K,Mesirov JP,Zody MC,Lander ES

    更新日期:2003-01-01 00:00:00

  • Polycomb preferentially targets stalled promoters of coding and noncoding transcripts.

    abstract::The Polycomb group (PcG) and Trithorax group (TrxG) of proteins are required for stable and heritable maintenance of repressed and active gene expression states. Their antagonistic function on gene control, repression for PcG and activity for TrxG, is mediated by binding to chromatin and subsequent epigenetic modifica...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.114348.110

    authors: Enderle D,Beisel C,Stadler MB,Gerstung M,Athri P,Paro R

    更新日期:2011-02-01 00:00:00

  • A first version of the Caenorhabditis elegans Promoterome.

    abstract::An important aspect of the development of systems biology approaches in metazoans is the characterization of expression patterns of nearly all genes predicted from genome sequences. Such "localizome" maps should provide information on where (in what cells or tissues) and when (at what stage of development or under wha...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2497604

    authors: Dupuy D,Li QR,Deplancke B,Boxem M,Hao T,Lamesch P,Sequerra R,Bosak S,Doucette-Stamm L,Hope IA,Hill DE,Walhout AJ,Vidal M

    更新日期:2004-10-01 00:00:00

  • Genome function and nuclear architecture: from gene expression to nanoscience.

    abstract::Biophysical, chemical, and nanoscience approaches to the study of nuclear structure and activity have been developing recently and hold considerable promise. A selection of fundamental problems in genome organization and function are reviewed and discussed in the context of these new perspectives and approaches. Advan...

    journal_title:Genome research

    pub_type: 杂志文章,评审

    doi:10.1101/gr.946403

    authors: O'Brien TP,Bult CJ,Cremer C,Grunze M,Knowles BB,Langowski J,McNally J,Pederson T,Politz JC,Pombo A,Schmahl G,Spatz JP,van Driel R

    更新日期:2003-06-01 00:00:00

  • Sequential ChIP-bisulfite sequencing enables direct genome-scale investigation of chromatin and DNA methylation cross-talk.

    abstract::Cross-talk between DNA methylation and histone modifications drives the establishment of composite epigenetic signatures and is traditionally studied using correlative rather than direct approaches. Here, we present sequential ChIP-bisulfite-sequencing (ChIP-BS-seq) as an approach to quantitatively assess DNA methylat...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.133728.111

    authors: Brinkman AB,Gu H,Bartels SJ,Zhang Y,Matarese F,Simmer F,Marks H,Bock C,Gnirke A,Meissner A,Stunnenberg HG

    更新日期:2012-06-01 00:00:00

  • Allele-specific Holliday junction formation: a new mechanism of allelic discrimination for SNP scoring.

    abstract::We report here a new mechanism for allelic discrimination--allele-specific Holliday Junction formation. The Holliday Junction (HJ) is a unique DNA structure that can be formed in a sequence-nonspecific manner by routine PCR. To cause the PCR-based HJ formation to occur in an allele-specific manner, the PCR primers are...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.997703

    authors: Yang Q,Lishanski A,Yang W,Hatcher S,Seet H,Gregg JP

    更新日期:2003-07-01 00:00:00

  • Integrative analysis of genome-wide loss of heterozygosity and monoallelic expression at nucleotide resolution reveals disrupted pathways in triple-negative breast cancer.

    abstract::Loss of heterozygosity (LOH) and copy number alteration (CNA) feature prominently in the somatic genomic landscape of tumors. As such, karyotypic aberrations in cancer genomes have been studied extensively to discover novel oncogenes and tumor-suppressor genes. Advances in sequencing technology have enabled the cost-e...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.137570.112

    authors: Ha G,Roth A,Lai D,Bashashati A,Ding J,Goya R,Giuliany R,Rosner J,Oloumi A,Shumansky K,Chin SF,Turashvili G,Hirst M,Caldas C,Marra MA,Aparicio S,Shah SP

    更新日期:2012-10-01 00:00:00

  • X chromosome cDNA microarray screening identifies a functional PLP2 promoter polymorphism enriched in patients with X-linked mental retardation.

    abstract::X-linked Mental Retardation (XLMR) occurs in 1 in 600 males and is highly genetically heterogeneous. We used a novel human X chromosome cDNA microarray (XCA) to survey the expression profile of X-linked genes in lymphoblasts of XLMR males. Genes with altered expression verified by Northern blot and/or quantitative PCR...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5336307

    authors: Zhang L,Jie C,Obie C,Abidi F,Schwartz CE,Stevenson RE,Valle D,Wang T

    更新日期:2007-05-01 00:00:00

  • Large-scale genome analysis of bovine commensal Escherichia coli reveals that bovine-adapted E. coli lineages are serving as evolutionary sources of the emergence of human intestinal pathogenic strains.

    abstract::How pathogens evolve their virulence to humans in nature is a scientific issue of great medical and biological importance. Shiga toxin (Stx)-producing Escherichia coli (STEC) and enteropathogenic E. coli (EPEC) are the major foodborne pathogens that can cause hemolytic uremic syndrome and infantile diarrhea, respectiv...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.249268.119

    authors: Arimizu Y,Kirino Y,Sato MP,Uno K,Sato T,Gotoh Y,Auvray F,Brugere H,Oswald E,Mainil JG,Anklam KS,Döpfer D,Yoshino S,Ooka T,Tanizawa Y,Nakamura Y,Iguchi A,Morita-Ishihara T,Ohnishi M,Akashi K,Hayashi T,Ogura Y

    更新日期:2019-09-01 00:00:00

  • Immune signatures correlate with L1 retrotransposition in gastrointestinal cancers.

    abstract::Long interspersed nuclear element-1 (LINE-1 or L1) retrotransposons are normally suppressed in somatic tissues mainly due to DNA methylation and antiviral defense. However, the mechanism to suppress L1s may be disrupted in cancers, thus allowing L1s to act as insertional mutagens and cause genomic rearrangement and in...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.231837.117

    authors: Jung H,Choi JK,Lee EA

    更新日期:2018-08-01 00:00:00

  • General gene movement off the X chromosome in the Drosophila genus.

    abstract::In Drosophila melanogaster, there is an excess of genes duplicated by retroposition from the X chromosome to the autosomes. Most of those retrogenes that originated on the X chromosome have testis expression pattern. These observations could be explained by natural selection favoring genes that avoided spermatogenesis...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.088609.108

    authors: Vibranovski MD,Zhang Y,Long M

    更新日期:2009-05-01 00:00:00

  • Functional DNA methylation differences between tissues, cell types, and across individuals discovered using the M&M algorithm.

    abstract::DNA methylation plays key roles in diverse biological processes such as X chromosome inactivation, transposable element repression, genomic imprinting, and tissue-specific gene expression. Sequencing-based DNA methylation profiling provides an unprecedented opportunity to map and compare complete DNA methylomes. This ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.156539.113

    authors: Zhang B,Zhou Y,Lin N,Lowdon RF,Hong C,Nagarajan RP,Cheng JB,Li D,Stevens M,Lee HJ,Xing X,Zhou J,Sundaram V,Elliott G,Gu J,Shi T,Gascard P,Sigaroudinia M,Tlsty TD,Kadlecek T,Weiss A,O'Geen H,Farnham PJ,Maire

    更新日期:2013-09-01 00:00:00

  • Massive reshaping of genome-nuclear lamina interactions during oncogene-induced senescence.

    abstract::Cellular senescence is a mechanism that virtually irreversibly suppresses the proliferative capacity of cells in response to various stress signals. This includes the expression of activated oncogenes, which causes Oncogene-Induced Senescence (OIS). A body of evidence points to the involvement in OIS of chromatin reor...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.225763.117

    authors: Lenain C,de Graaf CA,Pagie L,Visser NL,de Haas M,de Vries SS,Peric-Hupkes D,van Steensel B,Peeper DS

    更新日期:2017-10-01 00:00:00

  • Genomic analysis identifies association of Fusobacterium with colorectal carcinoma.

    abstract::The tumor microenvironment of colorectal carcinoma is a complex community of genomically altered cancer cells, nonneoplastic cells, and a diverse collection of microorganisms. Each of these components may contribute to carcinogenesis; however, the role of the microbiota is the least well understood. We have characteri...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.126573.111

    authors: Kostic AD,Gevers D,Pedamallu CS,Michaud M,Duke F,Earl AM,Ojesina AI,Jung J,Bass AJ,Tabernero J,Baselga J,Liu C,Shivdasani RA,Ogino S,Birren BW,Huttenhower C,Garrett WS,Meyerson M

    更新日期:2012-02-01 00:00:00

  • TATA is a modular component of synthetic promoters.

    abstract::The expression of most genes is regulated by multiple transcription factors. The interactions between transcription factors produce complex patterns of gene expression that are not always obvious from the arrangement of cis-regulatory elements in a promoter. One critical element of promoters is the TATA box, the docki...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.106732.110

    authors: Mogno I,Vallania F,Mitra RD,Cohen BA

    更新日期:2010-10-01 00:00:00

  • The human obese (OB) gene: RNA expression pattern and mapping on the physical, cytogenetic, and genetic maps of chromosome 7.

    abstract::The recently identified mouse obese (ob) gene apparently encodes a secreted protein that may function in the signaling pathway of adipose tissue. Mutations in the mouse ob gene are associated with the early development of gross obesity. A detailed knowledge concerning the RNA expression pattern and precise genomic loc...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5.1.5

    authors: Green ED,Maffei M,Braden VV,Proenca R,DeSilva U,Zhang Y,Chua SC Jr,Leibel RL,Weissenbach J,Friedman JM

    更新日期:1995-08-01 00:00:00

  • Interaction between the X chromosome and an autosome regulates size sexual dimorphism in Portuguese Water Dogs.

    abstract::Size sexual dimorphism occurs in almost all mammals. In Portuguese Water Dogs, much of the difference in skeletal size between females and males is due to the interaction between a Quantitative Trait Locus (QTL) on the X-chromosome and a QTL linked to Insulin-like Growth Factor 1 (IGF-1) on the CFA 15 autosome. In fem...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3712705

    authors: Chase K,Carrier DR,Adler FR,Ostrander EA,Lark KG

    更新日期:2005-12-01 00:00:00

  • Realizing the potential of blockchain technologies in genomics.

    abstract::Genomics data introduce a substantial computational burden as well as data privacy and ownership issues. Data sets generated by high-throughput sequencing platforms require immense amounts of computational resources to align to reference genomes and to call and annotate genomic variants. This problem is even more pron...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.207464.116

    authors: Ozercan HI,Ileri AM,Ayday E,Alkan C

    更新日期:2018-09-01 00:00:00

  • Transcript assembly improves expression quantification of transposable elements in single-cell RNA-seq data.

    abstract::Transposable elements (TEs) are an integral part of the host transcriptome. TE-containing noncoding RNAs (ncRNAs) show considerable tissue specificity and play important roles during development, including stem cell maintenance and cell differentiation. Recent advances in single-cell RNA-seq (scRNA-seq) revolutionized...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.265173.120

    authors: Shao W,Wang T

    更新日期:2021-01-01 00:00:00

  • Fugu ESTs: new resources for transcription analysis and genome annotation.

    abstract::The draft Fugu rubripes genome was released in 2002, at which time relatively few cDNAs were available to aid in the annotation of genes. The data presented here describe the sequencing and analysis of 24,398 expressed sequence tags (ESTs) generated from 15 different adult and juvenile Fugu tissues, 74% of which match...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1691503

    authors: Clark MS,Edwards YJ,Peterson D,Clifton SW,Thompson AJ,Sasaki M,Suzuki Y,Kikuchi K,Watabe S,Kawakami K,Sugano S,Elgar G,Johnson SL

    更新日期:2003-12-01 00:00:00

  • Retroelement distributions in the human genome: variations associated with age and proximity to genes.

    abstract::Remnants of more than 3 million transposable elements, primarily retroelements, comprise nearly half of the human genome and have generated much speculation concerning their evolutionary significance. We have exploited the draft human genome sequence to examine the distributions of retroelements on a genome-wide scale...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.388902

    authors: Medstrand P,van de Lagemaat LN,Mager DL

    更新日期:2002-10-01 00:00:00

  • Evolution of gene order in the genomes of two related yeast species.

    abstract::Changes in gene order between the genomes of two related yeast species, Saccharomyces cerevisiae and Saccharomyces bayanus var. uvarum were studied. From the dataset of a previous low coverage sequencing of the S. bayanus var. uvarum genome, 35 different synteny breakpoints between neighboring genes and two cases of l...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.212701

    authors: Fischer G,Neuvéglise C,Durrens P,Gaillardin C,Dujon B

    更新日期:2001-12-01 00:00:00

  • Murine single-cell RNA-seq reveals cell-identity- and tissue-specific trajectories of aging.

    abstract::Aging is a pleiotropic process affecting many aspects of mammalian physiology. Mammals are composed of distinct cell type identities and tissue environments, but the influence of these cell identities and environments on the trajectory of aging in individual cells remains unclear. Here, we performed single-cell RNA-se...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.253880.119

    authors: Kimmel JC,Penland L,Rubinstein ND,Hendrickson DG,Kelley DR,Rosenthal AZ

    更新日期:2019-12-01 00:00:00

  • Dynamic effects of interacting genes underlying rice flowering-time phenotypic plasticity and global adaptation.

    abstract::The phenotypic variation of living organisms is shaped by genetics, environment, and their interaction. Understanding phenotypic plasticity under natural conditions is hindered by the apparently complex environment and the interacting genes and pathways. Herein, we report findings from the dissection of rice flowering...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.255703.119

    authors: Guo T,Mu Q,Wang J,Vanous AE,Onogi A,Iwata H,Li X,Yu J

    更新日期:2020-05-01 00:00:00

  • Reconstructing complex regions of genomes using long-read sequencing technology.

    abstract::Obtaining high-quality sequence continuity of complex regions of recent segmental duplication remains one of the major challenges of finishing genome assemblies. In the human and mouse genomes, this was achieved by targeting large-insert clones using costly and laborious capillary-based sequencing approaches. Sanger s...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.168450.113

    authors: Huddleston J,Ranade S,Malig M,Antonacci F,Chaisson M,Hon L,Sudmant PH,Graves TA,Alkan C,Dennis MY,Wilson RK,Turner SW,Korlach J,Eichler EE

    更新日期:2014-04-01 00:00:00

  • De novo rates and selection of large copy number variation.

    abstract::While copy number variation (CNV) is an active area of research, de novo mutation rates within human populations are not well characterized. By focusing on large (>100 kbp) events, we estimate the rate of de novo CNV formation in humans by analyzing 4394 transmissions from human pedigrees with and without neurocogniti...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.107680.110

    authors: Itsara A,Wu H,Smith JD,Nickerson DA,Romieu I,London SJ,Eichler EE

    更新日期:2010-11-01 00:00:00

  • A network of transcriptionally coordinated functional modules in Saccharomyces cerevisiae.

    abstract::Recent computational and experimental work suggests that functional modules underlie much of cellular physiology and are a useful unit of cellular organization from the perspective of systems biology. Because interactions among modules can give rise to higher-level properties that are essential to cellular function, a...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3847105

    authors: Petti AA,Church GM

    更新日期:2005-09-01 00:00:00

  • Two large families of chemoreceptor genes in the nematodes Caenorhabditis elegans and Caenorhabditis briggsae reveal extensive gene duplication, diversification, movement, and intron loss.

    abstract::The str family of genes encoding seven-transmembrane G-protein-coupled or serpentine receptors related to the ODR-10 diacetyl chemoreceptor is very large, with at least 197 members in the Caenorhabditis elegans genome. The closely related stl family has 43 genes, and both families are distantly related to the srd fami...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.8.5.449

    authors: Robertson HM

    更新日期:1998-05-01 00:00:00

  • Prioritizing candidate disease genes by network-based boosting of genome-wide association data.

    abstract::Network "guilt by association" (GBA) is a proven approach for identifying novel disease genes based on the observation that similar mutational phenotypes arise from functionally related genes. In principle, this approach could account even for nonadditive genetic interactions, which underlie the synergistic combinatio...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.118992.110

    authors: Lee I,Blom UM,Wang PI,Shim JE,Marcotte EM

    更新日期:2011-07-01 00:00:00