SCOPE++: sequence classification of homoPolymer emissions.

Abstract:

BACKGROUND:mRNA polyadenylation, the addition of a poly(A) tail to the 3'-end of pre-mRNA, is a process critical to gene expression and regulation in eukaryotes. To understand the molecular mechanisms governing polyadenylation and other relevant biological processes, it is important to identify these poly(A) tails accurately in transcriptome sequencing data and differentiate them from artificial adapter sequences added in the sequencing process. But the annotation of these tails is complicated by the presence of sequencing errors and post-transcriptional modifications. While determining that a tail is present in a given transcript fragment is straight-forward, these obfuscations make the problem of boundary identification a challenge; conventional seed-and-extend algorithms struggle to accurately identify these poly(A) tail end-points. Further, all existing tools that we are aware of focus exclusively on the trimming of poly(A) tails, failing to provide the detailed information needed for studying the polyadenylation process. RESULTS:We have created SCOPE++, an open-source tool for finding the precise border of poly(A) tails and other homopolymers in raw mRNA sequence reads. Based on a Hidden Markov Model (HMM) approach, SCOPE++ accurately identifies specific homopolymer sequences in error-prone EST/cDNA data or RNA-Seq data at a speed appropriate for large sequence sets. CONCLUSIONS:We demonstrate that our tool can precisely identify poly(A) tails with near perfect accuracy at the speed required for high-throughput applications, providing a valuable resource for polyadenylation research.

journal_name

Genomics

journal_title

Genomics

authors

Morton JT,Abrudan P,Figueroa N,Liang C,Karro JE

doi

10.1016/j.ygeno.2014.07.005

subject

Has Abstract

pub_date

2014-09-01 00:00:00

pages

157-62

issue

3

eissn

0888-7543

issn

1089-8646

pii

S0888-7543(14)00120-7

journal_volume

104

pub_type

杂志文章

相关文献

GENOMICS文献大全
  • Chromosomal mapping of five highly conserved murine homologues of the Drosophila RING finger gene seven-in-absentia.

    abstract::Seven-in-absentia (sina) is epistatic to all other known genes in the sevenless-ras signaling pathway, which mediates R7 photoreceptor formation in the Drosophila eye. The murine genome contains several closely related sina homologues (Siah1A-D, Siah2) that are also likely to participate in ras signaling. As part of a...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1997.4642

    authors: Holloway AJ,Della NG,Fletcher CF,Largespada DA,Copeland NG,Jenkins NA,Bowtell DD

    更新日期:1997-04-15 00:00:00

  • Stage-specific transcriptomic analysis of the model cestode Hymenolepis microstoma.

    abstract::Most parasitic flatworms go through different life stages with important physiological and morphological changes. In this work, we used a transcriptomic approach to analyze the main life-stages of the model tapeworm Hymenolepis microstoma (eggs, cysticercoids, and adults). Our results showed massive transcriptomic cha...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2021.01.005

    authors: Preza M,Calvelo J,Langleib M,Hoffmann F,Castillo E,Koziol U,Iriarte A

    更新日期:2021-01-21 00:00:00

  • Evidence that the SRY protein is encoded by a single exon on the human Y chromosome.

    abstract::To facilitate studies of the SRY gene, a 4741-bp portion of the sex-determining region of the human Y chromosome was sequenced and characterized. Two RNAs were found to hybridize to this genomic segment, one transcript deriving from SRY and the second cross-hybridizing to a pseudogene located 2.5 kb 5' of the SRY open...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1993.1395

    authors: Behlke MA,Bogan JS,Beer-Romero P,Page DC

    更新日期:1993-09-01 00:00:00

  • CLONEPLACER: a software tool for simulating contig formation for ordered shotgun sequencing.

    abstract::This communication describes a software tool that enables one to simulate large-scale regional mapping using an ordered shotgun sequencing approach. The analysis routines that are provided yield an estimate of the depth of coverage of the physical map, the largest contig formed, and the number of gaps remaining at any...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/0888-7543(95)80057-s

    authors: Singh GB,Krawetz SA

    更新日期:1995-01-20 00:00:00

  • Structure, chromosomal location, and expression pattern of three mouse genes homologous to the human MAGE genes.

    abstract::The human MAGE1 gene directs the expression of an antigen recognized on a melanoma by autologous cytolytic T lymphocytes. MAGE1 belongs to a family of genes that are expressed in a number of tumors of various histological types but not in normal tissues except testis. The MAGE genes are arranged in two groups that are...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1995.1108

    authors: De Backer O,Verheyden AM,Martin B,Godelaine D,De Plaen E,Brasseur R,Avner P,Boon T

    更新日期:1995-07-01 00:00:00

  • Identification of two evolutionarily conserved and functional regulatory elements in intron 2 of the human BRCA1 gene.

    abstract::Cross-species comparative genomics is a powerful strategy for identifying functional regulatory elements within noncoding DNA. In this paper, comparative analysis of human and mouse intronic sequences in the breast cancer susceptibility gene (BRCA1) revealed two evolutionarily conserved noncoding sequences (CNS) in in...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2005.05.006

    authors: Wardrop SL,Brown MA,kConFab Investigators.

    更新日期:2005-09-01 00:00:00

  • Structural organization and chromosomal localization of Hyal2, a gene encoding a lysosomal hyaluronidase.

    abstract::The human HYAL2 gene encodes a lysosomal hyaluronidase that is related to the testicular PH-20 hyaluronidase. Regions conserved in these proteins have been used to design PCR primers suitable for the isolation of a fragment of the murine Hyal2 gene. This fragment was used to isolate the Hyal2 cDNA from a cDNA library....

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1998.5472

    authors: Strobl B,Wechselberger C,Beier DR,Lepperdinger G

    更新日期:1998-10-15 00:00:00

  • Random forests for genomic data analysis.

    abstract::Random forests (RF) is a popular tree-based ensemble machine learning tool that is highly data adaptive, applies to "large p, small n" problems, and is able to account for correlation as well as interactions among features. This makes RF particularly appealing for high-dimensional genomic data analysis. In this articl...

    journal_title:Genomics

    pub_type: 杂志文章,评审

    doi:10.1016/j.ygeno.2012.04.003

    authors: Chen X,Ishwaran H

    更新日期:2012-06-01 00:00:00

  • cDNA and genomic cloning of human palmitoyl-protein thioesterase (PPT), the enzyme defective in infantile neuronal ceroid lipofuscinosis.

    abstract::Palmitoyl-protein thioesterase (PPT) is a small glycoprotein that removes palmitate groups from cysteine residues in lipid-modified proteins. We recently reported mutations in PPT in patients with infantile neuronal ceroid lipofuscinosis (INCL), a severe neurodegenerative disorder (J. Vesa et al., 1995, Nature 376: 58...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1996.0292

    authors: Schriner JE,Yi W,Hofmann SL

    更新日期:1996-06-15 00:00:00

  • The human glutamate dehydrogenase gene family: gene organization and structural characterization.

    abstract::Glutamate dehydrogenase is a mitochondrially located, key metabolic enzyme. In addition to its general metabolic role, GLUD is important in neurotransmission. Significant alterations in GLUD enzymatic activity have been associated with certain neurodegenerative human disorders. Although a single species of human GLUD ...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1993.1152

    authors: Michaelidis TM,Tzimagiorgis G,Moschonas NK,Papamatheakis J

    更新日期:1993-04-01 00:00:00

  • A large duplicated area in the polycystic kidney disease 1 (PKD1) region of chromosome 16 is prone to rearrangement.

    abstract::An area of 500 kb at the proximal end of the polycystic kidney disease 1 (PKD1) region has been mapped in detail, with 260 kb cloned in cosmids. The area cloned from normal individuals contains two homologous but divergent regions each of 75 kb, including the previously described marker 26-6. Pulsed-field gel electrop...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1994.1507

    authors: Harris PC,Thomas S,MacCarthy AB,Stallings RL,Breuning MH,Jenne DE,Fink TM,Buckle VJ,Ratcliffe PJ,Ward CJ

    更新日期:1994-09-15 00:00:00

  • Paralogy mapping: identification of a region in the human MHC triplicated onto human chromosomes 1 and 9 allows the prediction and isolation of novel PBX and NOTCH loci.

    abstract::The human genome contains a group of gene families whose members map within the same regions of chromosomes 1, 6, and 9. The number of gene families involved and their pronounced clustering to the same areas of the genome indicate that their mapping relationship is nonrandom. By combining mapping data and sequence inf...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1996.0328

    authors: Katsanis N,Fitzgibbon J,Fisher EM

    更新日期:1996-07-01 00:00:00

  • Characterization of the complete mitochondrial genome of Uca lacteus and comparison with other Brachyuran crabs.

    abstract::Brachyuran crabs comprise the most species-rich clade among the crustacean order Decapoda and are divided into several major superfamilies. However, the monophyly of the superfamilies Ocypodoidea and Grapsoidea in their current compositions within the Brachyura remains inconclusive. In this study, the complete mitocho...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2019.06.004

    authors: Wang Z,Shi X,Guo H,Tang D,Bai Y,Wang Z

    更新日期:2020-01-01 00:00:00

  • Genome-wide analysis of expression modes and DNA methylation status at sense-antisense transcript loci in mouse.

    abstract::The functionality of sense-antisense transcripts (SATs), although widespread throughout the mammalian genome, is largely unknown. Here, we analyzed the SATs expression and its associated promoter DNA methylation status by surveying 12 tissues of mice to gain insights into the relationship between expression and DNA me...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2010.08.007

    authors: Watanabe Y,Numata K,Murata S,Osada Y,Saito R,Nakaoka H,Yamamoto N,Watanabe K,Kato H,Abe K,Kiyosawa H

    更新日期:2010-12-01 00:00:00

  • Identification and characterization of differentially expressed genes in the rice root following exogenous application of spermidine during salt stress.

    abstract::Salinity is a major limiting factor in crop production. Exogenous spermidine (spd) effectively ameliorates salt injury, though the underlying molecular mechanism is poorly understood. We have used a suppression subtractive hybridization method to construct a cDNA library that has identified up-regulated genes from ric...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2020.07.011

    authors: Saha J,Giri K,Roy S

    更新日期:2020-11-01 00:00:00

  • Characterization of the mouse apolipoprotein Apoa-1/Apoc-3 gene locus: genomic, mRNA, and protein sequences with comparisons to other species.

    abstract::In this report we present the genomic, cDNA, and predicted protein sequences for mouse apolipoproteins A-I and CIII, as well as sequence comparisons with other species. The genes for these apolipoproteins are within 2.5 kb of each other and convergently transcribed. The almost 9 kb of genomic sequence presented extend...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/s0888-7543(05)80133-8

    authors: Januzzi JL,Azrolan N,O'Connell A,Aalto-Setälä K,Breslow JL

    更新日期:1992-12-01 00:00:00

  • Genes regulating the serotonin metabolic pathway in the brain stem and their role in the etiopathogenesis of the sudden infant death syndrome.

    abstract::Genotypes and allelic frequencies of TPH2, 5-HTTLPR, the 5-HTT (SLC6A4) intron 2 variable-number tandem repeat (VNTR) region, and the MAOA VNTR region were determined in brain-stem samples of 20 "genuine" SIDS cases and compared with results obtained from 150 healthy controls. The SNP G1463A responsible for 80% functi...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2008.01.010

    authors: Nonnis Marzano F,Maldini M,Filonzi L,Lavezzi AM,Parmigiani S,Magnani C,Bevilacqua G,Matturri L

    更新日期:2008-06-01 00:00:00

  • The mouse neurofibromatosis type 2 gene maps to chromosome 11.

    abstract::Neurofibromatosis type 2 (NF2) is a dominantly inherited disease characterized by the development of bilateral vestibular schwannomas and meningiomas, which together represent 30% of primary brain tumors. The NF2 gene, which has recently been isolated, maps to the long arm of human chromosome 22. Using recombinant inb...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1994.1291

    authors: Claudio JO,Malo D,Rouleau GA

    更新日期:1994-05-15 00:00:00

  • Genetic basis of neural tube defects: the mouse gene loop-tail maps to a region of chromosome 1 syntenic with human 1q21-q23.

    abstract::A genetic basis for neural tube defects (NTD) is rarely doubted, but the genes involved have not yet been identified. This is partly due to a lack of suitable families on which to perform linkage analysis. An alternative approach is to use the many mouse genes that cause NTD as a means of isolating their human homolog...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/0888-7543(95)80165-i

    authors: Stanier P,Henson JN,Eddleston J,Moore GE,Copp AJ

    更新日期:1995-04-10 00:00:00

  • Amplification of the E2F1 transcription factor gene in the HEL erythroleukemia cell line.

    abstract::The E2F transcription factor plays an important regulatory role in cell proliferation, mediating the expression of genes whose products are essential for inducing resting cells to enter the cell cycle and synthesize DNA. To investigate the possible involvement of E2F in hematopoietic malignancies, we isolated genomic ...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/0888-7543(95)80118-6

    authors: Saito M,Helin K,Valentine MB,Griffith BB,Willman CL,Harlow E,Look AT

    更新日期:1995-01-01 00:00:00

  • The proximity of DNA sequences in interphase cell nuclei is correlated to genomic distance and permits ordering of cosmids spanning 250 kilobase pairs.

    abstract::The physical distance between DNA sequences in interphase nuclei was determined using eight cosmids containing fragments of the Chinese hamster genome that span 273 kb surrounding the dihydrofolate reductase (DHFR) gene. The distance between these sequences at the molecular level has been determined previously by rest...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/0888-7543(89)90112-2

    authors: Trask B,Pinkel D,van den Engh G

    更新日期:1989-11-01 00:00:00

  • DNA-methylation dependent regulation of embryo-specific 5S ribosomal DNA cluster transcription in adult tissues of sea urchin Paracentrotus lividus.

    abstract::We have previously reported a molecular and cytogenetic characterization of three different 5S rDNA clusters in the sea urchin Paracentrotus lividus and recently, demonstrated the presence of high heterogeneity in functional 5S rRNA. In this paper, we show some important distinctive data on 5S rRNA transcription for t...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2013.08.001

    authors: Bellavia D,Dimarco E,Naselli F,Caradonna F

    更新日期:2013-10-01 00:00:00

  • miRNA biomarkers for predicting overall survival outcomes for head and neck squamous cell carcinoma.

    abstract::Head and neck squamous cell carcinoma (HNSCC) is a malignant tumor of the upper aerodigestive tract. The loss and gain of miRNA function promote cancer development through various mechanisms. RNA sequencing (RNA-seq) and miRNAs sequencing data from the Cancer Genome Atlas (TCGA) was used to show the dysfunctional miRN...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2020.12.002

    authors: Wu ZH,Zhong Y,Zhou T,Xiao HJ

    更新日期:2021-01-01 00:00:00

  • Distribution of DNA methylation, CpGs, and CpG islands in human isochores.

    abstract::DNA methylation is a major epigenetic modification of the genome that affects basic biological functions, such as gene expression and cell development. We used the human genome sequences and the DNA methylation data that are available in order to establish a map of the levels of GC and methylation in isochores. We als...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2009.09.006

    authors: Varriale A,Bernardi G

    更新日期:2010-01-01 00:00:00

  • Fractalkine shares signal sequence with TARC: gene structures and expression profiles of two chemokine genes.

    abstract::In the process of cloning the gene (Scyd1) encoding the mouse CX3C chemokine fractalkine, we identified a novel cDNA that encodes a chimeric molecule termed fracTARC. This molecule is a variant form of the mouse CC chemokine, TARC (for thymus- and activation-regulated chemokine), bearing the fractalkine signal sequenc...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.2001.6585

    authors: Hiroyama T,Iwama A,Nakamura Y,Nakauchi H

    更新日期:2001-07-01 00:00:00

  • Practicability of detecting somatic point mutation from RNA high throughput sequencing data.

    abstract::Traditionally, somatic mutations are detected by examining DNA sequence. The maturity of sequencing technology has allowed researchers to screen for somatic mutations in the whole genome. Increasingly, researchers have become interested in identifying somatic mutations through RNAseq data. With this motivation, we eva...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2016.03.006

    authors: Sheng Q,Zhao S,Li CI,Shyr Y,Guo Y

    更新日期:2016-05-01 00:00:00

  • Gene expression profiling in livers of mice after acute inhibition of beta-oxidation.

    abstract::Inborn errors of mitochondrial beta-oxidation cause ectopic fat accumulation, particularly in the liver. Fatty liver is associated with insulin resistance and predisposes to hepatic fibrosis. The factors underlying the pathophysiological consequences of hepatic fat accumulation have remained poorly defined. Gene expre...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2007.08.004

    authors: van der Leij FR,Bloks VW,Grefhorst A,Hoekstra J,Gerding A,Kooi K,Gerbens F,te Meerman G,Kuipers F

    更新日期:2007-12-01 00:00:00

  • Molecular cloning and characterization of a novel human CC chemokine, SCYA26.

    abstract::By searching the Expressed Sequence Tag database, a full-length cDNA for a novel human CC chemokine was cloned. This cDNA encoded a 94-amino-acid protein with a putative signal peptide of 26 amino acids. The deduced mature protein had the four conserved cysteine residues characteristic of CC chemokines and showed 44% ...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1999.5837

    authors: Guo RF,Ward PA,Hu SM,McDuffie JE,Huber-Lang M,Shi MM

    更新日期:1999-06-15 00:00:00

  • Isolation of a novel human homologue of the gene coding for echinoderm microtubule-associated protein (EMAP) from the Usher syndrome type 1a locus at 14q32.

    abstract::Usher syndrome type 1 (USH1) is an autosomal recessive, genetically heterogeneous disorder causing severe congenital deafness, retinitis pigmentosa, and vestibular dysfunction. The USHla locus located on 14q32 has been linked to the genetic markers D14S250 and D14S78. Using D14S250 and D14S78, we have isolated two non...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1997.4779

    authors: Eudy JD,Ma-Edmonds M,Yao SF,Talmadge CB,Kelley PM,Weston MD,Kimberling WJ,Sumegi J

    更新日期:1997-07-01 00:00:00

  • Genetic variants associated with primary open angle glaucoma in Indian population.

    abstract::Glaucoma is a very common disorder of the eye wherein the disturbance of the structural or functional integrity of the optic nerve causes characteristic atrophic changes in the optic nerve, which may lead to specific visual field defects over time. Primary open angle glaucoma (POAG) is most frequent among the three pr...

    journal_title:Genomics

    pub_type: 杂志文章,评审

    doi:10.1016/j.ygeno.2016.11.003

    authors: Kumar S,Malik MA,K S,Sihota R,Kaur J

    更新日期:2017-01-01 00:00:00