Abstract:
BACKGROUND:mRNA polyadenylation, the addition of a poly(A) tail to the 3'-end of pre-mRNA, is a process critical to gene expression and regulation in eukaryotes. To understand the molecular mechanisms governing polyadenylation and other relevant biological processes, it is important to identify these poly(A) tails accurately in transcriptome sequencing data and differentiate them from artificial adapter sequences added in the sequencing process. But the annotation of these tails is complicated by the presence of sequencing errors and post-transcriptional modifications. While determining that a tail is present in a given transcript fragment is straight-forward, these obfuscations make the problem of boundary identification a challenge; conventional seed-and-extend algorithms struggle to accurately identify these poly(A) tail end-points. Further, all existing tools that we are aware of focus exclusively on the trimming of poly(A) tails, failing to provide the detailed information needed for studying the polyadenylation process. RESULTS:We have created SCOPE++, an open-source tool for finding the precise border of poly(A) tails and other homopolymers in raw mRNA sequence reads. Based on a Hidden Markov Model (HMM) approach, SCOPE++ accurately identifies specific homopolymer sequences in error-prone EST/cDNA data or RNA-Seq data at a speed appropriate for large sequence sets. CONCLUSIONS:We demonstrate that our tool can precisely identify poly(A) tails with near perfect accuracy at the speed required for high-throughput applications, providing a valuable resource for polyadenylation research.
journal_name
Genomicsjournal_title
Genomicsauthors
Morton JT,Abrudan P,Figueroa N,Liang C,Karro JEdoi
10.1016/j.ygeno.2014.07.005subject
Has Abstractpub_date
2014-09-01 00:00:00pages
157-62issue
3eissn
0888-7543issn
1089-8646pii
S0888-7543(14)00120-7journal_volume
104pub_type
杂志文章相关文献
GENOMICS文献大全abstract::Seven-in-absentia (sina) is epistatic to all other known genes in the sevenless-ras signaling pathway, which mediates R7 photoreceptor formation in the Drosophila eye. The murine genome contains several closely related sina homologues (Siah1A-D, Siah2) that are also likely to participate in ras signaling. As part of a...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1006/geno.1997.4642
更新日期:1997-04-15 00:00:00
abstract::Most parasitic flatworms go through different life stages with important physiological and morphological changes. In this work, we used a transcriptomic approach to analyze the main life-stages of the model tapeworm Hymenolepis microstoma (eggs, cysticercoids, and adults). Our results showed massive transcriptomic cha...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/j.ygeno.2021.01.005
更新日期:2021-01-21 00:00:00
abstract::To facilitate studies of the SRY gene, a 4741-bp portion of the sex-determining region of the human Y chromosome was sequenced and characterized. Two RNAs were found to hybridize to this genomic segment, one transcript deriving from SRY and the second cross-hybridizing to a pseudogene located 2.5 kb 5' of the SRY open...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1006/geno.1993.1395
更新日期:1993-09-01 00:00:00
abstract::This communication describes a software tool that enables one to simulate large-scale regional mapping using an ordered shotgun sequencing approach. The analysis routines that are provided yield an estimate of the depth of coverage of the physical map, the largest contig formed, and the number of gaps remaining at any...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/0888-7543(95)80057-s
更新日期:1995-01-20 00:00:00
abstract::The human MAGE1 gene directs the expression of an antigen recognized on a melanoma by autologous cytolytic T lymphocytes. MAGE1 belongs to a family of genes that are expressed in a number of tumors of various histological types but not in normal tissues except testis. The MAGE genes are arranged in two groups that are...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1006/geno.1995.1108
更新日期:1995-07-01 00:00:00
abstract::Cross-species comparative genomics is a powerful strategy for identifying functional regulatory elements within noncoding DNA. In this paper, comparative analysis of human and mouse intronic sequences in the breast cancer susceptibility gene (BRCA1) revealed two evolutionarily conserved noncoding sequences (CNS) in in...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/j.ygeno.2005.05.006
更新日期:2005-09-01 00:00:00
abstract::The human HYAL2 gene encodes a lysosomal hyaluronidase that is related to the testicular PH-20 hyaluronidase. Regions conserved in these proteins have been used to design PCR primers suitable for the isolation of a fragment of the murine Hyal2 gene. This fragment was used to isolate the Hyal2 cDNA from a cDNA library....
journal_title:Genomics
pub_type: 杂志文章
doi:10.1006/geno.1998.5472
更新日期:1998-10-15 00:00:00
abstract::Random forests (RF) is a popular tree-based ensemble machine learning tool that is highly data adaptive, applies to "large p, small n" problems, and is able to account for correlation as well as interactions among features. This makes RF particularly appealing for high-dimensional genomic data analysis. In this articl...
journal_title:Genomics
pub_type: 杂志文章,评审
doi:10.1016/j.ygeno.2012.04.003
更新日期:2012-06-01 00:00:00
abstract::Palmitoyl-protein thioesterase (PPT) is a small glycoprotein that removes palmitate groups from cysteine residues in lipid-modified proteins. We recently reported mutations in PPT in patients with infantile neuronal ceroid lipofuscinosis (INCL), a severe neurodegenerative disorder (J. Vesa et al., 1995, Nature 376: 58...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1006/geno.1996.0292
更新日期:1996-06-15 00:00:00
abstract::Glutamate dehydrogenase is a mitochondrially located, key metabolic enzyme. In addition to its general metabolic role, GLUD is important in neurotransmission. Significant alterations in GLUD enzymatic activity have been associated with certain neurodegenerative human disorders. Although a single species of human GLUD ...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1006/geno.1993.1152
更新日期:1993-04-01 00:00:00
abstract::An area of 500 kb at the proximal end of the polycystic kidney disease 1 (PKD1) region has been mapped in detail, with 260 kb cloned in cosmids. The area cloned from normal individuals contains two homologous but divergent regions each of 75 kb, including the previously described marker 26-6. Pulsed-field gel electrop...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1006/geno.1994.1507
更新日期:1994-09-15 00:00:00
abstract::The human genome contains a group of gene families whose members map within the same regions of chromosomes 1, 6, and 9. The number of gene families involved and their pronounced clustering to the same areas of the genome indicate that their mapping relationship is nonrandom. By combining mapping data and sequence inf...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1006/geno.1996.0328
更新日期:1996-07-01 00:00:00
abstract::Brachyuran crabs comprise the most species-rich clade among the crustacean order Decapoda and are divided into several major superfamilies. However, the monophyly of the superfamilies Ocypodoidea and Grapsoidea in their current compositions within the Brachyura remains inconclusive. In this study, the complete mitocho...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/j.ygeno.2019.06.004
更新日期:2020-01-01 00:00:00
abstract::The functionality of sense-antisense transcripts (SATs), although widespread throughout the mammalian genome, is largely unknown. Here, we analyzed the SATs expression and its associated promoter DNA methylation status by surveying 12 tissues of mice to gain insights into the relationship between expression and DNA me...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/j.ygeno.2010.08.007
更新日期:2010-12-01 00:00:00
abstract::Salinity is a major limiting factor in crop production. Exogenous spermidine (spd) effectively ameliorates salt injury, though the underlying molecular mechanism is poorly understood. We have used a suppression subtractive hybridization method to construct a cDNA library that has identified up-regulated genes from ric...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/j.ygeno.2020.07.011
更新日期:2020-11-01 00:00:00
abstract::In this report we present the genomic, cDNA, and predicted protein sequences for mouse apolipoproteins A-I and CIII, as well as sequence comparisons with other species. The genes for these apolipoproteins are within 2.5 kb of each other and convergently transcribed. The almost 9 kb of genomic sequence presented extend...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/s0888-7543(05)80133-8
更新日期:1992-12-01 00:00:00
abstract::Genotypes and allelic frequencies of TPH2, 5-HTTLPR, the 5-HTT (SLC6A4) intron 2 variable-number tandem repeat (VNTR) region, and the MAOA VNTR region were determined in brain-stem samples of 20 "genuine" SIDS cases and compared with results obtained from 150 healthy controls. The SNP G1463A responsible for 80% functi...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/j.ygeno.2008.01.010
更新日期:2008-06-01 00:00:00
abstract::Neurofibromatosis type 2 (NF2) is a dominantly inherited disease characterized by the development of bilateral vestibular schwannomas and meningiomas, which together represent 30% of primary brain tumors. The NF2 gene, which has recently been isolated, maps to the long arm of human chromosome 22. Using recombinant inb...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1006/geno.1994.1291
更新日期:1994-05-15 00:00:00
abstract::A genetic basis for neural tube defects (NTD) is rarely doubted, but the genes involved have not yet been identified. This is partly due to a lack of suitable families on which to perform linkage analysis. An alternative approach is to use the many mouse genes that cause NTD as a means of isolating their human homolog...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/0888-7543(95)80165-i
更新日期:1995-04-10 00:00:00
abstract::The E2F transcription factor plays an important regulatory role in cell proliferation, mediating the expression of genes whose products are essential for inducing resting cells to enter the cell cycle and synthesize DNA. To investigate the possible involvement of E2F in hematopoietic malignancies, we isolated genomic ...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/0888-7543(95)80118-6
更新日期:1995-01-01 00:00:00
abstract::The physical distance between DNA sequences in interphase nuclei was determined using eight cosmids containing fragments of the Chinese hamster genome that span 273 kb surrounding the dihydrofolate reductase (DHFR) gene. The distance between these sequences at the molecular level has been determined previously by rest...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/0888-7543(89)90112-2
更新日期:1989-11-01 00:00:00
abstract::We have previously reported a molecular and cytogenetic characterization of three different 5S rDNA clusters in the sea urchin Paracentrotus lividus and recently, demonstrated the presence of high heterogeneity in functional 5S rRNA. In this paper, we show some important distinctive data on 5S rRNA transcription for t...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/j.ygeno.2013.08.001
更新日期:2013-10-01 00:00:00
abstract::Head and neck squamous cell carcinoma (HNSCC) is a malignant tumor of the upper aerodigestive tract. The loss and gain of miRNA function promote cancer development through various mechanisms. RNA sequencing (RNA-seq) and miRNAs sequencing data from the Cancer Genome Atlas (TCGA) was used to show the dysfunctional miRN...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/j.ygeno.2020.12.002
更新日期:2021-01-01 00:00:00
abstract::DNA methylation is a major epigenetic modification of the genome that affects basic biological functions, such as gene expression and cell development. We used the human genome sequences and the DNA methylation data that are available in order to establish a map of the levels of GC and methylation in isochores. We als...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/j.ygeno.2009.09.006
更新日期:2010-01-01 00:00:00
abstract::In the process of cloning the gene (Scyd1) encoding the mouse CX3C chemokine fractalkine, we identified a novel cDNA that encodes a chimeric molecule termed fracTARC. This molecule is a variant form of the mouse CC chemokine, TARC (for thymus- and activation-regulated chemokine), bearing the fractalkine signal sequenc...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1006/geno.2001.6585
更新日期:2001-07-01 00:00:00
abstract::Traditionally, somatic mutations are detected by examining DNA sequence. The maturity of sequencing technology has allowed researchers to screen for somatic mutations in the whole genome. Increasingly, researchers have become interested in identifying somatic mutations through RNAseq data. With this motivation, we eva...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/j.ygeno.2016.03.006
更新日期:2016-05-01 00:00:00
abstract::Inborn errors of mitochondrial beta-oxidation cause ectopic fat accumulation, particularly in the liver. Fatty liver is associated with insulin resistance and predisposes to hepatic fibrosis. The factors underlying the pathophysiological consequences of hepatic fat accumulation have remained poorly defined. Gene expre...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1016/j.ygeno.2007.08.004
更新日期:2007-12-01 00:00:00
abstract::By searching the Expressed Sequence Tag database, a full-length cDNA for a novel human CC chemokine was cloned. This cDNA encoded a 94-amino-acid protein with a putative signal peptide of 26 amino acids. The deduced mature protein had the four conserved cysteine residues characteristic of CC chemokines and showed 44% ...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1006/geno.1999.5837
更新日期:1999-06-15 00:00:00
abstract::Usher syndrome type 1 (USH1) is an autosomal recessive, genetically heterogeneous disorder causing severe congenital deafness, retinitis pigmentosa, and vestibular dysfunction. The USHla locus located on 14q32 has been linked to the genetic markers D14S250 and D14S78. Using D14S250 and D14S78, we have isolated two non...
journal_title:Genomics
pub_type: 杂志文章
doi:10.1006/geno.1997.4779
更新日期:1997-07-01 00:00:00
abstract::Glaucoma is a very common disorder of the eye wherein the disturbance of the structural or functional integrity of the optic nerve causes characteristic atrophic changes in the optic nerve, which may lead to specific visual field defects over time. Primary open angle glaucoma (POAG) is most frequent among the three pr...
journal_title:Genomics
pub_type: 杂志文章,评审
doi:10.1016/j.ygeno.2016.11.003
更新日期:2017-01-01 00:00:00