Abstract:
BACKGROUND:Alternative polyadenylation (APA) has emerged as a pervasive mechanism that contributes to the transcriptome complexity and dynamics of gene regulation. The current tsunami of whole genome poly(A) site data from various conditions generated by 3' end sequencing provides a valuable data source for the study of APA-related gene expression. Cluster analysis is a powerful technique for investigating the association structure among genes, however, conventional gene clustering methods are not suitable for APA-related data as they fail to consider the information of poly(A) sites (e.g., location, abundance, number, etc.) within each gene or measure the association among poly(A) sites between two genes. RESULTS:Here we proposed a computational framework, named PASCCA, for clustering genes from replicated or unreplicated poly(A) site data using canonical correlation analysis (CCA). PASCCA incorporates multiple layers of gene expression data from both the poly(A) site level and gene level and takes into account the number of replicates and the variability within each experimental group. Moreover, PASCCA characterizes poly(A) sites in various ways including the abundance and relative usage, which can exploit the advantages of 3' end deep sequencing in quantifying APA sites. Using both real and synthetic poly(A) site data sets, the cluster analysis demonstrates that PASCCA outperforms other widely-used distance measures under five performance metrics including connectivity, the Dunn index, average distance, average distance between means, and the biological homogeneity index. We also used PASCCA to infer APA-specific gene modules from recently published poly(A) site data of rice and discovered some distinct functional gene modules. We have made PASCCA an easy-to-use R package for APA-related gene expression analyses, including the characterization of poly(A) sites, quantification of association between genes, and clustering of genes. CONCLUSIONS:By providing a better treatment of the noise inherent in repeated measurements and taking into account multiple layers of poly(A) site data, PASCCA could be a general tool for clustering and analyzing APA-specific gene expression data. PASCCA could be used to elucidate the dynamic interplay of genes and their APA sites among various biological conditions from emerging 3' end sequencing data to address the complex biological phenomenon.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Ye W,Long Y,Ji G,Su Y,Ye P,Fu H,Wu Xdoi
10.1186/s12864-019-5433-7subject
Has Abstractpub_date
2019-01-22 00:00:00pages
75issue
1issn
1471-2164pii
10.1186/s12864-019-5433-7journal_volume
20pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Small RNAs (sRNAs) have emerged as important regulatory molecules and have been studied in several bacteria. However, to date, there have been no whole-transcriptome studies on sRNAs in any of the Soft Rot Enterobacteriaceae (SRE) group of pathogens. Although the main ecological niches for these pathogens ar...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2376-0
更新日期:2016-01-12 00:00:00
abstract::An amendment to this paper has been published and can be accessed via the original article. ...
journal_title:BMC genomics
pub_type: 杂志文章,已发布勘误
doi:10.1186/s12864-020-06939-7
更新日期:2020-08-11 00:00:00
abstract:BACKGROUND:Screening for differentially expressed genes on the genomic scale and comparative analysis of the expression profiles of orthologous genes between species to study gene function and regulation are becoming increasingly feasible. Expressed sequence tags (ESTs) are an excellent source of data for such studies ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-7-86
更新日期:2006-04-21 00:00:00
abstract:BACKGROUND:Foraminiferan protists, which are significant players in most marine ecosystems, are also genetic innovators, harboring unique modifications to proteins that make up the basic eukaryotic cell machinery. Despite their ecological and evolutionary importance, foraminiferan genomes are poorly understood due to t...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-169
更新日期:2011-03-31 00:00:00
abstract:BACKGROUND:Ensifer alkalisoli YIC4027, a recently characterized nitrogen-fixing bacterium of the genus Ensifer, has been isolated from root nodules of the host plant Sesbania cannabina. This plant is widely used as green manure and for soil remediation. E. alkalisoli YIC4027 can grow in saline-alkaline soils and is a n...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-6004-7
更新日期:2019-08-12 00:00:00
abstract:BACKGROUND:The host response to influenza A infections is strongly influenced by host genetic factors. Animal models of genetically diverse mouse strains are well suited to identify host genes involved in severe pathology, viral replication and immune responses. Here, we have utilized a dual RNAseq approach that allowe...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1867-8
更新日期:2015-09-02 00:00:00
abstract:BACKGROUND:The papaya Y chromosome has undergone a degenerative expansion from its ancestral autosome, as a consequence of recombination suppression in the sex determining region of the sex chromosomes. The non-recombining feature led to the accumulation of repetitive sequences in the male- or hermaphrodite-specific re...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-335
更新日期:2014-05-04 00:00:00
abstract:BACKGROUND:Genomes store information for building and maintaining organisms. Complete sequencing of many genomes provides the opportunity to study and compare global information properties of those genomes. RESULTS:We have analyzed aspects of the information content of Homo sapiens, Mus musculus, Drosophila melanogast...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-509
更新日期:2008-10-30 00:00:00
abstract:BACKGROUND:The unisexual Amazon molly (Poecilia formosa) originated from a hybridization between two sexual species, the sailfin molly (Poecilia latipinna) and the Atlantic molly (Poecilia mexicana). The Amazon molly reproduces clonally via sperm-dependent parthenogenesis (gynogenesis), in which the sperm of closely re...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4382-2
更新日期:2018-01-03 00:00:00
abstract:BACKGROUND:Auxin plays a critical role in inducing adventitious rooting in many plants. Indole-3-butyric acid (IBA) is the most widely employed auxin for adventitious rooting. However, the molecular mechanisms by which auxin regulate the process of adventitious rooting are less well known. RESULTS:The RNA-Seq data ana...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2372-4
更新日期:2016-01-12 00:00:00
abstract:BACKGROUND:High-throughput sequencing has opened up exciting possibilities in population and conservation genetics by enabling the assessment of genetic variation at genome-wide scales. One approach to reduce genome complexity, i.e. investigating only parts of the genome, is reduced-representation library (RRL) sequenc...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-16
更新日期:2014-01-10 00:00:00
abstract:BACKGROUND:Identifying causative mutations or genes through which quantitative trait loci (QTL) act has proven very difficult. Using information such as gene expression may help to identify genes and mutations underlying QTL. Our objective was to identify regions associated both with production traits or fertility and ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5656-7
更新日期:2019-04-15 00:00:00
abstract::DpbCasX, also called Cas12e, is an RNA-guided DNA endonuclease isolated from Deltaproteobacteria. In this paper I characterized the CasX-compatible genome editing sites in the reference genomes of yeast (Saccharomyces cerevisiae), flatworms (Caenorhabditis elegans), flies (Drosophila melanogaster), zebrafish (Danio re...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5924-6
更新日期:2019-06-27 00:00:00
abstract:BACKGROUND:Lysophosphatidic acid (LPA) is a lipid mediator that acts through specific G protein-coupled receptors to stimulate the proliferation, migration and survival of many cell types. LPA signaling has been implicated in development, wound healing and cancer. While LPA signaling pathways have been studied extensiv...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-387
更新日期:2008-08-14 00:00:00
abstract:BACKGROUND:Previous studies suggest genome structure is largely conserved between Eucalyptus species. However, it is unknown if this conservation extends to more divergent eucalypt taxa. We performed comparative genomics between the eucalypt genera Eucalyptus and Corymbia. Our results will facilitate transfer of genomi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-3782-7
更新日期:2017-05-22 00:00:00
abstract:BACKGROUND:Higher crustaceans (class Malacostraca) represent the most species-rich and morphologically diverse group of non-insect arthropods and many of its members are commercially important. Although the crustacean DNA sequence information is growing exponentially, little is known about the genome organization of Ma...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-141
更新日期:2010-02-26 00:00:00
abstract:BACKGROUND:HSP90 proteins are essential molecular chaperones involved in signal transduction, cell cycle control, stress management, and folding, degradation, and transport of proteins. HSP90 proteins have been found in a variety of organisms suggesting that they are ancient and conserved. In this study we investigate ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-7-156
更新日期:2006-06-17 00:00:00
abstract:BACKGROUND:Regulatory networks often employ the model that attributes changes in gene expression levels, as observed across different cellular conditions, to changes in the activity of transcription factors (TFs). Although the actual conditions that trigger a change in TF activity should form an integral part of the ge...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-8-25
更新日期:2007-01-22 00:00:00
abstract:BACKGROUND:Dinoflagellates are unicellular marine and freshwater eukaryotes. They possess large nuclear genomes (1.5-245 gigabases) and produce structurally unique and biologically active polyketide secondary metabolites. Although polyketide biosynthesis is well studied in terrestrial and freshwater organisms, only rec...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-2195-8
更新日期:2015-11-14 00:00:00
abstract:BACKGROUND:Microsatellites (repeated subsequences based on motifs of one to six nucleotides) are widely used as codominant genetic markers because of their frequent polymorphism and relative selective neutrality. Minisatellites are repeats of motifs having seven or more nucleotides. The large number of EST sequences no...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-8-173
更新日期:2007-06-15 00:00:00
abstract:BACKGROUND:The molecular mechanisms of transcriptional regulation are poorly understood in Plasmodium falciparum. In addition, most of the genes in Plasmodium falciparum are transcriptionally poised and only a handful of cis-regulatory elements are known to operate in transcriptional regulation. Here, we employed an ep...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4052-4
更新日期:2017-08-23 00:00:00
abstract:BACKGROUND:Despite the importance of chromosomal translocations in the initiation and/or progression of cancer, a comprehensive catalog of translocation breakpoints in which these are precisely located on the reference sequence of the human genome is not available at present. DESCRIPTION:We have created a database tha...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-8-33
更新日期:2007-01-26 00:00:00
abstract:BACKGROUND:Nuclear receptors are hormone-regulated transcription factors whose signaling controls numerous aspects of development and physiology. Many receptors recognize DNA hormone response elements formed by direct repeats of RGKTCA motifs separated by 1 to 5 bp (DR1-DR5). Although many known such response elements ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-8-23
更新日期:2007-01-19 00:00:00
abstract:BACKGROUND:The nucleoli, including their proteomes, of higher eukaryotes have been extensively studied, while few studies about the nucleoli of the lower eukaryotes - protists were reported. Giardia lamblia, a protist with the controversy of whether it is an extreme primitive eukaryote or just a highly evolved parasite...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-6679-9
更新日期:2020-03-30 00:00:00
abstract:BACKGROUND:Human T cell leukemia virus type 1 (HTLV-1) Tax is a potent activator of viral and cellular gene expression that interacts with a number of cellular proteins. Many reports show that Tax is capable of regulating cell cycle progression and apoptosis both positively and negatively. However, it still remains to ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-275
更新日期:2012-06-22 00:00:00
abstract:BACKGROUND:Previous investigations of phylogeny in Cervus recovered many clades without whole genomic support. METHODS:In this study, the genetic diversity and phylogeny of 5 species (21 subspecies/populations from C. unicolor, C. albirostris, C. nippon, C. elaphus and C. eldii) in the genus Cervus were analyzed using...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5785-z
更新日期:2019-05-17 00:00:00
abstract:BACKGROUND:Triptolide is a therapeutic diterpenoid derived from the Chinese herb Tripterygium wilfordii Hook f. Triptolide has been shown to induce apoptosis by activation of pro-apoptotic proteins, inhibiting NFkB and c-KIT pathways, suppressing the Jak2 transcription, activating MAPK8/JNK signaling and modulating the...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1614-1
更新日期:2015-06-30 00:00:00
abstract:BACKGROUND:Rheumatoid arthritis (RA) is a chronic autoimmune disease characterized by inflammation and destruction of synovial joints. RA affects up to 1 % of the population worldwide. Currently, there are no drugs that can cure RA or achieve sustained remission. The unknown cause of the disease represents a significan...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2910-0
更新日期:2016-08-22 00:00:00
abstract:BACKGROUND:MicroRNAs (miRNAs) are small non-coding RNAs that regulate genes at the post-transcriptional level in spatiotemporal manner. Several miRNAs are identified as prognostic and diagnostic markers in many human cancers. Estimation of the temporal activities of the miRNAs is an important step in the way to underst...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-2260-3
更新日期:2015-12-18 00:00:00
abstract:BACKGROUND:The environmental light-dark cycle is the dominant cue that maintains 24-h biological rhythms in multicellular organisms. In Drosophila, light entrainment is mediated by the photosensitive protein CRYPTOCHROME, but the role and extent of transcription regulation in light resetting of the dipteran clock is ye...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1787-7
更新日期:2015-08-01 00:00:00