Cluster analysis of replicated alternative polyadenylation data using canonical correlation analysis.

Abstract:

BACKGROUND:Alternative polyadenylation (APA) has emerged as a pervasive mechanism that contributes to the transcriptome complexity and dynamics of gene regulation. The current tsunami of whole genome poly(A) site data from various conditions generated by 3' end sequencing provides a valuable data source for the study of APA-related gene expression. Cluster analysis is a powerful technique for investigating the association structure among genes, however, conventional gene clustering methods are not suitable for APA-related data as they fail to consider the information of poly(A) sites (e.g., location, abundance, number, etc.) within each gene or measure the association among poly(A) sites between two genes. RESULTS:Here we proposed a computational framework, named PASCCA, for clustering genes from replicated or unreplicated poly(A) site data using canonical correlation analysis (CCA). PASCCA incorporates multiple layers of gene expression data from both the poly(A) site level and gene level and takes into account the number of replicates and the variability within each experimental group. Moreover, PASCCA characterizes poly(A) sites in various ways including the abundance and relative usage, which can exploit the advantages of 3' end deep sequencing in quantifying APA sites. Using both real and synthetic poly(A) site data sets, the cluster analysis demonstrates that PASCCA outperforms other widely-used distance measures under five performance metrics including connectivity, the Dunn index, average distance, average distance between means, and the biological homogeneity index. We also used PASCCA to infer APA-specific gene modules from recently published poly(A) site data of rice and discovered some distinct functional gene modules. We have made PASCCA an easy-to-use R package for APA-related gene expression analyses, including the characterization of poly(A) sites, quantification of association between genes, and clustering of genes. CONCLUSIONS:By providing a better treatment of the noise inherent in repeated measurements and taking into account multiple layers of poly(A) site data, PASCCA could be a general tool for clustering and analyzing APA-specific gene expression data. PASCCA could be used to elucidate the dynamic interplay of genes and their APA sites among various biological conditions from emerging 3' end sequencing data to address the complex biological phenomenon.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Ye W,Long Y,Ji G,Su Y,Ye P,Fu H,Wu X

doi

10.1186/s12864-019-5433-7

subject

Has Abstract

pub_date

2019-01-22 00:00:00

pages

75

issue

1

issn

1471-2164

pii

10.1186/s12864-019-5433-7

journal_volume

20

pub_type

杂志文章
  • Discovery and profiling of small RNAs responsive to stress conditions in the plant pathogen Pectobacterium atrosepticum.

    abstract:BACKGROUND:Small RNAs (sRNAs) have emerged as important regulatory molecules and have been studied in several bacteria. However, to date, there have been no whole-transcriptome studies on sRNAs in any of the Soft Rot Enterobacteriaceae (SRE) group of pathogens. Although the main ecological niches for these pathogens ar...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2376-0

    authors: Kwenda S,Gorshkov V,Ramesh AM,Naidoo S,Rubagotti E,Birch PR,Moleleki LN

    更新日期:2016-01-12 00:00:00

  • Correction to: RNA-sequencing reveals positional memory of multipotent mesenchymal stromal cells from oral and maxillofacial tissue transcriptomes.

    abstract::An amendment to this paper has been published and can be accessed via the original article. ...

    journal_title:BMC genomics

    pub_type: 杂志文章,已发布勘误

    doi:10.1186/s12864-020-06939-7

    authors: Onizuka S,Yamazaki Y,Park SJ,Sugimoto T,Sone Y,Sjöqvist S,Usui M,Takeda A,Nakai K,Nakashima K,Iwata T

    更新日期:2020-08-11 00:00:00

  • In silico identification and comparative analysis of differentially expressed genes in human and mouse tissues.

    abstract:BACKGROUND:Screening for differentially expressed genes on the genomic scale and comparative analysis of the expression profiles of orthologous genes between species to study gene function and regulation are becoming increasingly feasible. Expressed sequence tags (ESTs) are an excellent source of data for such studies ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-86

    authors: Pao SY,Lin WL,Hwang MJ

    更新日期:2006-04-21 00:00:00

  • High-throughput sequencing of Astrammina rara: sampling the giant genome of a giant foraminiferan protist.

    abstract:BACKGROUND:Foraminiferan protists, which are significant players in most marine ecosystems, are also genetic innovators, harboring unique modifications to proteins that make up the basic eukaryotic cell machinery. Despite their ecological and evolutionary importance, foraminiferan genomes are poorly understood due to t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-169

    authors: Habura A,Hou Y,Reilly AA,Bowser SS

    更新日期:2011-03-31 00:00:00

  • The genome of Ensifer alkalisoli YIC4027 provides insights for host specificity and environmental adaptations.

    abstract:BACKGROUND:Ensifer alkalisoli YIC4027, a recently characterized nitrogen-fixing bacterium of the genus Ensifer, has been isolated from root nodules of the host plant Sesbania cannabina. This plant is widely used as green manure and for soil remediation. E. alkalisoli YIC4027 can grow in saline-alkaline soils and is a n...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6004-7

    authors: Dang X,Xie Z,Liu W,Sun Y,Liu X,Zhu Y,Staehelin C

    更新日期:2019-08-12 00:00:00

  • RNAseq expression analysis of resistant and susceptible mice after influenza A virus infection identifies novel genes associated with virus replication and important for host resistance to infection.

    abstract:BACKGROUND:The host response to influenza A infections is strongly influenced by host genetic factors. Animal models of genetically diverse mouse strains are well suited to identify host genes involved in severe pathology, viral replication and immune responses. Here, we have utilized a dual RNAseq approach that allowe...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1867-8

    authors: Wilk E,Pandey AK,Leist SR,Hatesuer B,Preusse M,Pommerenke C,Wang J,Schughart K

    更新日期:2015-09-02 00:00:00

  • Accumulation of interspersed and sex-specific repeats in the non-recombining region of papaya sex chromosomes.

    abstract:BACKGROUND:The papaya Y chromosome has undergone a degenerative expansion from its ancestral autosome, as a consequence of recombination suppression in the sex determining region of the sex chromosomes. The non-recombining feature led to the accumulation of repetitive sequences in the male- or hermaphrodite-specific re...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-335

    authors: Na JK,Wang J,Ming R

    更新日期:2014-05-04 00:00:00

  • Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples.

    abstract:BACKGROUND:Genomes store information for building and maintaining organisms. Complete sequencing of many genomes provides the opportunity to study and compare global information properties of those genomes. RESULTS:We have analyzed aspects of the information content of Homo sapiens, Mus musculus, Drosophila melanogast...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-509

    authors: Liu Z,Venkatesh SS,Maley CC

    更新日期:2008-10-30 00:00:00

  • The gonadal transcriptome of the unisexual Amazon molly Poecilia formosa in comparison to its sexual ancestors, Poecilia mexicana and Poecilia latipinna.

    abstract:BACKGROUND:The unisexual Amazon molly (Poecilia formosa) originated from a hybridization between two sexual species, the sailfin molly (Poecilia latipinna) and the Atlantic molly (Poecilia mexicana). The Amazon molly reproduces clonally via sperm-dependent parthenogenesis (gynogenesis), in which the sperm of closely re...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4382-2

    authors: Schedina IM,Groth D,Schlupp I,Tiedemann R

    更新日期:2018-01-03 00:00:00

  • Transcriptomic analysis reveals the gene expression profile that specifically responds to IBA during adventitious rooting in mung bean seedlings.

    abstract:BACKGROUND:Auxin plays a critical role in inducing adventitious rooting in many plants. Indole-3-butyric acid (IBA) is the most widely employed auxin for adventitious rooting. However, the molecular mechanisms by which auxin regulate the process of adventitious rooting are less well known. RESULTS:The RNA-Seq data ana...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2372-4

    authors: Li SW,Shi RF,Leng Y,Zhou Y

    更新日期:2016-01-12 00:00:00

  • Generation of SNP datasets for orangutan population genomics using improved reduced-representation sequencing and direct comparisons of SNP calling algorithms.

    abstract:BACKGROUND:High-throughput sequencing has opened up exciting possibilities in population and conservation genetics by enabling the assessment of genetic variation at genome-wide scales. One approach to reduce genome complexity, i.e. investigating only parts of the genome, is reduced-representation library (RRL) sequenc...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-16

    authors: Greminger MP,Stölting KN,Nater A,Goossens B,Arora N,Bruggmann R,Patrignani A,Nussberger B,Sharma R,Kraus RH,Ambu LN,Singleton I,Chikhi L,van Schaik CP,Krützen M

    更新日期:2014-01-10 00:00:00

  • Overlap between eQTL and QTL associated with production traits and fertility in dairy cattle.

    abstract:BACKGROUND:Identifying causative mutations or genes through which quantitative trait loci (QTL) act has proven very difficult. Using information such as gene expression may help to identify genes and mutations underlying QTL. Our objective was to identify regions associated both with production traits or fertility and ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5656-7

    authors: van den Berg I,Hayes BJ,Chamberlain AJ,Goddard ME

    更新日期:2019-04-15 00:00:00

  • A catalog of CasX genome editing sites in common model organisms.

    abstract::DpbCasX, also called Cas12e, is an RNA-guided DNA endonuclease isolated from Deltaproteobacteria. In this paper I characterized the CasX-compatible genome editing sites in the reference genomes of yeast (Saccharomyces cerevisiae), flatworms (Caenorhabditis elegans), flies (Drosophila melanogaster), zebrafish (Danio re...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5924-6

    authors: Roberson EDO

    更新日期:2019-06-27 00:00:00

  • Multiple actions of lysophosphatidic acid on fibroblasts revealed by transcriptional profiling.

    abstract:BACKGROUND:Lysophosphatidic acid (LPA) is a lipid mediator that acts through specific G protein-coupled receptors to stimulate the proliferation, migration and survival of many cell types. LPA signaling has been implicated in development, wound healing and cancer. While LPA signaling pathways have been studied extensiv...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-387

    authors: Stortelers C,Kerkhoven R,Moolenaar WH

    更新日期:2008-08-14 00:00:00

  • Comparative genomics of Eucalyptus and Corymbia reveals low rates of genome structural rearrangement.

    abstract:BACKGROUND:Previous studies suggest genome structure is largely conserved between Eucalyptus species. However, it is unknown if this conservation extends to more divergent eucalypt taxa. We performed comparative genomics between the eucalypt genera Eucalyptus and Corymbia. Our results will facilitate transfer of genomi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3782-7

    authors: Butler JB,Vaillancourt RE,Potts BM,Lee DJ,King GJ,Baten A,Shepherd M,Freeman JS

    更新日期:2017-05-22 00:00:00

  • Hyper-expansion of large DNA segments in the genome of kuruma shrimp, Marsupenaeus japonicus.

    abstract:BACKGROUND:Higher crustaceans (class Malacostraca) represent the most species-rich and morphologically diverse group of non-insect arthropods and many of its members are commercially important. Although the crustacean DNA sequence information is growing exponentially, little is known about the genome organization of Ma...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-141

    authors: Koyama T,Asakawa S,Katagiri T,Shimizu A,Fagutao FF,Mavichak R,Santos MD,Fuji K,Sakamoto T,Kitakado T,Kondo H,Shimizu N,Aoki T,Hirono I

    更新日期:2010-02-26 00:00:00

  • Comparative genomics and evolution of the HSP90 family of genes across all kingdoms of organisms.

    abstract:BACKGROUND:HSP90 proteins are essential molecular chaperones involved in signal transduction, cell cycle control, stress management, and folding, degradation, and transport of proteins. HSP90 proteins have been found in a variety of organisms suggesting that they are ancient and conserved. In this study we investigate ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-156

    authors: Chen B,Zhong D,Monteiro A

    更新日期:2006-06-17 00:00:00

  • Exploiting combinatorial cultivation conditions to infer transcriptional regulation.

    abstract:BACKGROUND:Regulatory networks often employ the model that attributes changes in gene expression levels, as observed across different cellular conditions, to changes in the activity of transcription factors (TFs). Although the actual conditions that trigger a change in TF activity should form an integral part of the ge...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-25

    authors: Knijnenburg TA,de Winde JH,Daran JM,Daran-Lapujade P,Pronk JT,Reinders MJ,Wessels LF

    更新日期:2007-01-22 00:00:00

  • Multifunctional polyketide synthase genes identified by genomic survey of the symbiotic dinoflagellate, Symbiodinium minutum.

    abstract:BACKGROUND:Dinoflagellates are unicellular marine and freshwater eukaryotes. They possess large nuclear genomes (1.5-245 gigabases) and produce structurally unique and biologically active polyketide secondary metabolites. Although polyketide biosynthesis is well studied in terrestrial and freshwater organisms, only rec...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2195-8

    authors: Beedessee G,Hisata K,Roy MC,Satoh N,Shoguchi E

    更新日期:2015-11-14 00:00:00

  • Patterned sequence in the transcriptome of vascular plants.

    abstract:BACKGROUND:Microsatellites (repeated subsequences based on motifs of one to six nucleotides) are widely used as codominant genetic markers because of their frequent polymorphism and relative selective neutrality. Minisatellites are repeats of motifs having seven or more nucleotides. The large number of EST sequences no...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-173

    authors: Crane CF

    更新日期:2007-06-15 00:00:00

  • Genome-wide identification of novel intergenic enhancer-like elements: implications in the regulation of transcription in Plasmodium falciparum.

    abstract:BACKGROUND:The molecular mechanisms of transcriptional regulation are poorly understood in Plasmodium falciparum. In addition, most of the genes in Plasmodium falciparum are transcriptionally poised and only a handful of cis-regulatory elements are known to operate in transcriptional regulation. Here, we employed an ep...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4052-4

    authors: Ubhe S,Rawat M,Verma S,Anamika K,Karmodiya K

    更新日期:2017-08-23 00:00:00

  • TICdb: a collection of gene-mapped translocation breakpoints in cancer.

    abstract:BACKGROUND:Despite the importance of chromosomal translocations in the initiation and/or progression of cancer, a comprehensive catalog of translocation breakpoints in which these are precisely located on the reference sequence of the human genome is not available at present. DESCRIPTION:We have created a database tha...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-33

    authors: Novo FJ,de Mendíbil IO,Vizmanos JL

    更新日期:2007-01-26 00:00:00

  • Widespread Alu repeat-driven expansion of consensus DR2 retinoic acid response elements during primate evolution.

    abstract:BACKGROUND:Nuclear receptors are hormone-regulated transcription factors whose signaling controls numerous aspects of development and physiology. Many receptors recognize DNA hormone response elements formed by direct repeats of RGKTCA motifs separated by 1 to 5 bp (DR1-DR5). Although many known such response elements ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-23

    authors: Laperriere D,Wang TT,White JH,Mader S

    更新日期:2007-01-19 00:00:00

  • Identification and evolutionary analysis of the nucleolar proteome of Giardia lamblia.

    abstract:BACKGROUND:The nucleoli, including their proteomes, of higher eukaryotes have been extensively studied, while few studies about the nucleoli of the lower eukaryotes - protists were reported. Giardia lamblia, a protist with the controversy of whether it is an extreme primitive eukaryote or just a highly evolved parasite...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6679-9

    authors: Feng JM,Yang CL,Tian HF,Wang JX,Wen JF

    更新日期:2020-03-30 00:00:00

  • Visualizing spatiotemporal dynamics of apoptosis after G1 arrest by human T cell leukemia virus type 1 Tax and insights into gene expression changes using microarray-based gene expression analysis.

    abstract:BACKGROUND:Human T cell leukemia virus type 1 (HTLV-1) Tax is a potent activator of viral and cellular gene expression that interacts with a number of cellular proteins. Many reports show that Tax is capable of regulating cell cycle progression and apoptosis both positively and negatively. However, it still remains to ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-275

    authors: Arainga M,Murakami H,Aida Y

    更新日期:2012-06-22 00:00:00

  • Genome-wide study on genetic diversity and phylogeny of five species in the genus Cervus.

    abstract:BACKGROUND:Previous investigations of phylogeny in Cervus recovered many clades without whole genomic support. METHODS:In this study, the genetic diversity and phylogeny of 5 species (21 subspecies/populations from C. unicolor, C. albirostris, C. nippon, C. elaphus and C. eldii) in the genus Cervus were analyzed using...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5785-z

    authors: Hu P,Shao Y,Xu J,Wang T,Li Y,Liu H,Rong M,Su W,Chen B,Cui S,Cui X,Yang F,Tamate H,Xing X

    更新日期:2019-05-17 00:00:00

  • Genome-wide association analysis identified splicing single nucleotide polymorphism in CFLAR predictive of triptolide chemo-sensitivity.

    abstract:BACKGROUND:Triptolide is a therapeutic diterpenoid derived from the Chinese herb Tripterygium wilfordii Hook f. Triptolide has been shown to induce apoptosis by activation of pro-apoptotic proteins, inhibiting NFkB and c-KIT pathways, suppressing the Jak2 transcription, activating MAPK8/JNK signaling and modulating the...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1614-1

    authors: Chauhan L,Jenkins GD,Bhise N,Feldberg T,Mitra-Ghosh T,Fridley BL,Lamba JK

    更新日期:2015-06-30 00:00:00

  • A genomics-based systems approach towards drug repositioning for rheumatoid arthritis.

    abstract:BACKGROUND:Rheumatoid arthritis (RA) is a chronic autoimmune disease characterized by inflammation and destruction of synovial joints. RA affects up to 1 % of the population worldwide. Currently, there are no drugs that can cure RA or achieve sustained remission. The unknown cause of the disease represents a significan...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2910-0

    authors: Xu R,Wang Q

    更新日期:2016-08-22 00:00:00

  • Reconstruction of temporal activity of microRNAs from gene expression data in breast cancer cell line.

    abstract:BACKGROUND:MicroRNAs (miRNAs) are small non-coding RNAs that regulate genes at the post-transcriptional level in spatiotemporal manner. Several miRNAs are identified as prognostic and diagnostic markers in many human cancers. Estimation of the temporal activities of the miRNAs is an important step in the way to underst...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2260-3

    authors: Jayavelu ND,Bar N

    更新日期:2015-12-18 00:00:00

  • Identification and functional analysis of early gene expression induced by circadian light-resetting in Drosophila.

    abstract:BACKGROUND:The environmental light-dark cycle is the dominant cue that maintains 24-h biological rhythms in multicellular organisms. In Drosophila, light entrainment is mediated by the photosensitive protein CRYPTOCHROME, but the role and extent of transcription regulation in light resetting of the dipteran clock is ye...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1787-7

    authors: Adewoye AB,Kyriacou CP,Tauber E

    更新日期:2015-08-01 00:00:00