Synthetic spike-in standards for RNA-seq experiments.

Abstract:

:High-throughput sequencing of cDNA (RNA-seq) is a widely deployed transcriptome profiling and annotation technique, but questions about the performance of different protocols and platforms remain. We used a newly developed pool of 96 synthetic RNAs with various lengths, and GC content covering a 2(20) concentration range as spike-in controls to measure sensitivity, accuracy, and biases in RNA-seq experiments as well as to derive standard curves for quantifying the abundance of transcripts. We observed linearity between read density and RNA input over the entire detection range and excellent agreement between replicates, but we observed significantly larger imprecision than expected under pure Poisson sampling errors. We use the control RNAs to directly measure reproducible protocol-dependent biases due to GC content and transcript length as well as stereotypic heterogeneity in coverage across transcripts correlated with position relative to RNA termini and priming sequence bias. These effects lead to biased quantification for short transcripts and individual exons, which is a serious problem for measurements of isoform abundances, but that can partially be corrected using appropriate models of bias. By using the control RNAs, we derive limits for the discovery and detection of rare transcripts in RNA-seq experiments. By using data collected as part of the model organism and human Encyclopedia of DNA Elements projects (ENCODE and modENCODE), we demonstrate that external RNA controls are a useful resource for evaluating sensitivity and accuracy of RNA-seq experiments for transcriptome discovery and quantification. These quality metrics facilitate comparable analysis across different samples, protocols, and platforms.

journal_name

Genome Res

journal_title

Genome research

authors

Jiang L,Schlesinger F,Davis CA,Zhang Y,Li R,Salit M,Gingeras TR,Oliver B

doi

10.1101/gr.121095.111

subject

Has Abstract

pub_date

2011-09-01 00:00:00

pages

1543-51

issue

9

eissn

1088-9051

issn

1549-5469

pii

gr.121095.111

journal_volume

21

pub_type

杂志文章
  • Retroelement distributions in the human genome: variations associated with age and proximity to genes.

    abstract::Remnants of more than 3 million transposable elements, primarily retroelements, comprise nearly half of the human genome and have generated much speculation concerning their evolutionary significance. We have exploited the draft human genome sequence to examine the distributions of retroelements on a genome-wide scale...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.388902

    authors: Medstrand P,van de Lagemaat LN,Mager DL

    更新日期:2002-10-01 00:00:00

  • Genomic organization of the sex-determining and adjacent regions of the sex chromosomes of medaka.

    abstract::Sequencing of the human Y chromosome has uncovered the peculiarities of the genomic organization of a heterogametic sex chromosome of old evolutionary age, and has led to many insights into the evolutionary changes that occurred during its long history. We have studied the genomic organization of the medaka fish Y chr...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5016106

    authors: Kondo M,Hornung U,Nanda I,Imai S,Sasaki T,Shimizu A,Asakawa S,Hori H,Schmid M,Shimizu N,Schartl M

    更新日期:2006-07-01 00:00:00

  • The pig X and Y Chromosomes: structure, sequence, and evolution.

    abstract::We have generated an improved assembly and gene annotation of the pig X Chromosome, and a first draft assembly of the pig Y Chromosome, by sequencing BAC and fosmid clones from Duroc animals and incorporating information from optical mapping and fiber-FISH. The X Chromosome carries 1033 annotated genes, 690 of which a...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.188839.114

    authors: Skinner BM,Sargent CA,Churcher C,Hunt T,Herrero J,Loveland JE,Dunn M,Louzada S,Fu B,Chow W,Gilbert J,Austin-Guest S,Beal K,Carvalho-Silva D,Cheng W,Gordon D,Grafham D,Hardy M,Harley J,Hauser H,Howden P,Howe K,

    更新日期:2016-01-01 00:00:00

  • The effect of translocation-induced nuclear reorganization on gene expression.

    abstract::Translocations are known to affect the expression of genes at the breakpoints and, in the case of unbalanced translocations, alter the gene copy number. However, a comprehensive understanding of the functional impact of this class of variation is lacking. Here, we have studied the effect of balanced chromosomal rearra...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.103622.109

    authors: Harewood L,Schütz F,Boyle S,Perry P,Delorenzi M,Bickmore WA,Reymond A

    更新日期:2010-05-01 00:00:00

  • Ancestry-agnostic estimation of DNA sample contamination from sequence reads.

    abstract::Detecting and estimating DNA sample contamination are important steps to ensure high-quality genotype calls and reliable downstream analysis. Existing methods rely on population allele frequency information for accurate estimation of contamination rates. Correctly specifying population allele frequencies for each indi...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.246934.118

    authors: Zhang F,Flickinger M,Taliun SAG,InPSYght Psychiatric Genetics Consortium.,Abecasis GR,Scott LJ,McCaroll SA,Pato CN,Boehnke M,Kang HM

    更新日期:2020-02-01 00:00:00

  • A transposon-based strategy for sequencing repetitive DNA in eukaryotic genomes.

    abstract::Repetitive DNA is a significant component of eukaryotic genomes. We have developed a strategy to efficiently and accurately sequence repetitive DNA in the nematode Caenorhabditis elegans using integrated artificial transposons and automated fluorescent sequencing. Mapping and assembly tools represent important compone...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.7.5.551

    authors: Devine SE,Chissoe SL,Eby Y,Wilson RK,Boeke JD

    更新日期:1997-05-01 00:00:00

  • DNA profiling of B chromosomes from the yellow-necked mouse Apodemus flavicollis (Rodentia, Mammalia).

    abstract::Using AP-PCR-based DNA profiling we examined some structural features of B chromosomes from yellow-necked mice Apodemus flavicollis. Mice harboring one, two, or three or lacking B chromosomes were examined. Chromosomal structure was scanned for variant bands by using a series of arbitrary primers and from these, infor...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:

    authors: Tanic N,Dedovic N,Vujosevic M,Dimitrijevic B

    更新日期:2000-01-01 00:00:00

  • High resolution mapping of modified DNA nucleobases using excision repair enzymes.

    abstract::The incorporation and creation of modified nucleobases in DNA have profound effects on genome function. We describe methods for mapping positions and local content of modified DNA nucleobases in genomic DNA. We combined in vitro nucleobase excision with massively parallel DNA sequencing (Excision-seq) to determine the...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.174052.114

    authors: Bryan DS,Ransom M,Adane B,York K,Hesselberth JR

    更新日期:2014-09-01 00:00:00

  • Methylation analysis of a marsupial X-linked CpG island by bisulfite genomic sequencing.

    abstract::Paternal X chromosome inactivation occurs in rodent extraembryonic membranes and in all tissues of marsupials. Methylation of CpG islands occurs on the inactive X in eutherians and is considered to be a stabilizing mechanism. The only previous study of a marsupial X-linked CpG island was of the G6PD gene of the Virgin...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6.2.114

    authors: Loebel DA,Johnston PG

    更新日期:1996-02-01 00:00:00

  • Genomic localization of RNA binding proteins reveals links between pre-mRNA processing and transcription.

    abstract::Pre-mRNA processing often occurs in coordination with transcription thereby coupling these two key regulatory events. As such, many proteins involved in mRNA processing associate with the transcriptional machinery and are in proximity to DNA. This proximity allows for the mapping of the genomic associations of RNA bin...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5211806

    authors: Swinburne IA,Meyer CA,Liu XS,Silver PA,Brodsky AS

    更新日期:2006-07-01 00:00:00

  • Prioritizing candidate disease genes by network-based boosting of genome-wide association data.

    abstract::Network "guilt by association" (GBA) is a proven approach for identifying novel disease genes based on the observation that similar mutational phenotypes arise from functionally related genes. In principle, this approach could account even for nonadditive genetic interactions, which underlie the synergistic combinatio...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.118992.110

    authors: Lee I,Blom UM,Wang PI,Shim JE,Marcotte EM

    更新日期:2011-07-01 00:00:00

  • Species-specific class I gene expansions formed the telomeric 1 mb of the mouse major histocompatibility complex.

    abstract::We have determined the complete sequence of 951,695 bp from the class I region of H2, the mouse major histocompatibility complex (Mhc) from strain 129/Sv (haplotype bc). The sequence contains 26 genes. The sequence spans from the last 50 kb of the H2-T region, including 2 class I genes and 3 class I pseudogenes, and i...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.975303

    authors: Takada T,Kumánovics A,Amadou C,Yoshino M,Jones EP,Athanasiou M,Evans GA,Fischer Lindahl K

    更新日期:2003-04-01 00:00:00

  • Characterization of the RNA content of chromatin.

    abstract::Noncoding RNA (ncRNA) constitutes a significant portion of the mammalian transcriptome. Emerging evidence suggests that it regulates gene expression in cis or trans by modulating the chromatin structure. To uncover the functional role of ncRNA in chromatin organization, we deep sequenced chromatin-associated RNAs (CAR...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.103473.109

    authors: Mondal T,Rasmussen M,Pandey GK,Isaksson A,Kanduri C

    更新日期:2010-07-01 00:00:00

  • A comprehensive survey of 3' animal miRNA modification events and a possible role for 3' adenylation in modulating miRNA targeting effectiveness.

    abstract::Animal microRNA sequences are subject to 3' nucleotide addition. Through detailed analysis of deep-sequenced short RNA data sets, we show adenylation and uridylation of miRNA is globally present and conserved across Drosophila and vertebrates. To better understand 3' adenylation function, we deep-sequenced RNA after k...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.106054.110

    authors: Burroughs AM,Ando Y,de Hoon MJ,Tomaru Y,Nishibu T,Ukekawa R,Funakoshi T,Kurokawa T,Suzuki H,Hayashizaki Y,Daub CO

    更新日期:2010-10-01 00:00:00

  • A chromosome-level assembly of the Atlantic herring genome-detection of a supergene and other signals of selection.

    abstract::The Atlantic herring is a model species for exploring the genetic basis for ecological adaptation, due to its huge population size and extremely low genetic differentiation at selectively neutral loci. However, such studies have so far been hampered because of a highly fragmented genome assembly. Here, we deliver a ch...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.253435.119

    authors: Pettersson ME,Rochus CM,Han F,Chen J,Hill J,Wallerman O,Fan G,Hong X,Xu Q,Zhang H,Liu S,Liu X,Haggerty L,Hunt T,Martin FJ,Flicek P,Bunikis I,Folkvord A,Andersson L

    更新日期:2019-11-01 00:00:00

  • Parente2: a fast and accurate method for detecting identity by descent.

    abstract::Identity-by-descent (IBD) inference is the problem of establishing a genetic connection between two individuals through a genomic segment that is inherited by both individuals from a recent common ancestor. IBD inference is an important preceding step in a variety of population genomic studies, ranging from demographi...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.173641.114

    authors: Rodriguez JM,Bercovici S,Huang L,Frostig R,Batzoglou S

    更新日期:2015-02-01 00:00:00

  • Genome-wide parent-of-origin DNA methylation analysis reveals the intricacies of human imprinting and suggests a germline methylation-independent mechanism of establishment.

    abstract::Differential methylation between the two alleles of a gene has been observed in imprinted regions, where the methylation of one allele occurs on a parent-of-origin basis, the inactive X-chromosome in females, and at those loci whose methylation is driven by genetic variants. We have extensively characterized imprinted...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.164913.113

    authors: Court F,Tayama C,Romanelli V,Martin-Trujillo A,Iglesias-Platas I,Okamura K,Sugahara N,Simón C,Moore H,Harness JV,Keirstead H,Sanchez-Mut JV,Kaneki E,Lapunzina P,Soejima H,Wake N,Esteller M,Ogata T,Hata K,Nakabayashi

    更新日期:2014-04-01 00:00:00

  • H3K27me3 forms BLOCs over silent genes and intergenic regions and specifies a histone banding pattern on a mouse autosomal chromosome.

    abstract::In mammals, genome-wide chromatin maps and immunofluorescence studies show that broad domains of repressive histone modifications are present on pericentromeric and telomeric repeats and on the inactive X chromosome. However, only a few autosomal loci such as silent Hox gene clusters have been shown to lie in broad do...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.080861.108

    authors: Pauler FM,Sloane MA,Huang R,Regha K,Koerner MV,Tamir I,Sommer A,Aszodi A,Jenuwein T,Barlow DP

    更新日期:2009-02-01 00:00:00

  • Whole-genome sequence assembly for mammalian genomes: Arachne 2.

    abstract::We previously described the whole-genome assembly program Arachne, presenting assemblies of simulated data for small to mid-sized genomes. Here we describe algorithmic adaptations to the program, allowing for assembly of mammalian-size genomes, and also improving the assembly of smaller genomes. Three principal change...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.828403

    authors: Jaffe DB,Butler J,Gnerre S,Mauceli E,Lindblad-Toh K,Mesirov JP,Zody MC,Lander ES

    更新日期:2003-01-01 00:00:00

  • Differential divergence of three human pseudoautosomal genes and their mouse homologs: implications for sex chromosome evolution.

    abstract::The human pseudoautosomal region 1 (PAR1) is essential for meiotic pairing and recombination, and its deletion causes male sterility. Comparative studies of human and mouse pseudoautosomal genes are valuable in charting the evolution of this interesting region, but have been limited by the paucity of genes conserved b...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.197001

    authors: Gianfrancesco F,Sanges R,Esposito T,Tempesta S,Rao E,Rappold G,Archidiacono N,Graves JA,Forabosco A,D'Urso M

    更新日期:2001-12-01 00:00:00

  • Complete genomic sequence and analysis of the prion protein gene region from three mammalian species.

    abstract::The prion protein (PrP), first identified in scrapie-infected rodents, is encoded by a single exon of a single-copy chromosomal gene. In addition to the protein-coding exon, PrP genes in mammals contain one or two 5'-noncoding exons. To learn more about the genomic organization of regions surrounding the PrP exons, we...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.8.10.1022

    authors: Lee IY,Westaway D,Smit AF,Wang K,Seto J,Chen L,Acharya C,Ankener M,Baskin D,Cooper C,Yao H,Prusiner SB,Hood LE

    更新日期:1998-10-01 00:00:00

  • Genome-wide analyses of alternative splicing in plants: opportunities and challenges.

    abstract::Alternative splicing (AS) creates multiple mRNA transcripts from a single gene. While AS is known to contribute to gene regulation and proteome diversity in animals, the study of its importance in plants is in its early stages. However, recently available plant genome and transcript sequence data sets are enabling a g...

    journal_title:Genome research

    pub_type: 杂志文章,评审

    doi:10.1101/gr.053678.106

    authors: Barbazuk WB,Fu Y,McGinnis KM

    更新日期:2008-09-01 00:00:00

  • Molecular cloning and RARE cleavage mapping of human 2p, 6q, 8q, 12q, and 18q telomeres.

    abstract::Large terminal fragments of human chromosomes 2p, 6p, 8q, 12q, and 18q were cloned using yeast artificial chromosomes (YACs). RecA-assisted restriction endonuclease (RARE) cleavage analysis of genomic DNA samples from II unrelated individuals using YAC-derived probes confirmed the telomeric localizations of the half-Y...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5.3.225

    authors: Macina RA,Morii K,Hu XL,Negorev DG,Spais C,Ruthig LA,Riethman HC

    更新日期:1995-10-01 00:00:00

  • A contiguous 66-kb barley DNA sequence provides evidence for reversible genome expansion.

    abstract::Organisms with large genomes contain vast amounts of repetitive DNA sequences, much of which is composed of retrotransposons. Amplification of retrotransposons has been postulated to be a major mechanism increasing genome size and leading to "genomic obesity." To gain insights into the relation between retrotransposon...

    journal_title:Genome research

    pub_type: 评论,杂志文章

    doi:10.1101/gr.10.7.908

    authors: Shirasu K,Schulman AH,Lahaye T,Schulze-Lefert P

    更新日期:2000-07-01 00:00:00

  • Genomic evolution, patterns of global dissemination, and interspecies transmission of human and simian T-cell leukemia/lymphotropic viruses.

    abstract::Using both env and long terminal repeat (LTR) sequences, with maximal representation of genetic diversity within primate strains, we revise and expand the unique evolutionary history of human and simian T-cell leukemia/lymphotropic viruses (HTLV/STLV). Based on the robust application of three different phylogenetic al...

    journal_title:Genome research

    pub_type: 杂志文章,评审

    doi:

    authors: Slattery JP,Franchini G,Gessain A

    更新日期:1999-06-01 00:00:00

  • Assessing clusters and motifs from gene expression data.

    abstract::Large-scale gene expression studies and genomic sequencing projects are providing vast amounts of information that can be used to identify or predict cellular regulatory processes. Genes can be clustered on the basis of the similarity of their expression profiles or function and these clusters are likely to contain ge...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.148301

    authors: Jakt LM,Cao L,Cheah KS,Smith DK

    更新日期:2001-01-01 00:00:00

  • Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.

    abstract::Short tandem repeats (STRs) are implicated in dozens of human genetic diseases and contribute significantly to genome variation and instability. Yet profiling STRs from short-read sequencing data is challenging because of their high sequencing error rates. Here, we developed STR-FM, short tandem repeat profiling using...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.185892.114

    authors: Fungtammasan A,Ananda G,Hile SE,Su MS,Sun C,Harris R,Medvedev P,Eckert K,Makova KD

    更新日期:2015-05-01 00:00:00

  • Spidey: a tool for mRNA-to-genomic alignments.

    abstract::We have developed a computer program that aligns spliced sequences to genomic sequences, using local alignment algorithms and heuristics to put together a global spliced alignment. Spidey can produce reliable alignments quickly, even when confronted with noise from alternative splicing, polymorphisms, sequencing error...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.195301

    authors: Wheelan SJ,Church DM,Ostell JM

    更新日期:2001-11-01 00:00:00

  • The evolution of evolvability in microRNA target sites in vertebrates.

    abstract::The lack of long-term evolutionary conservation of microRNA (miRNA) target sites appears to contradict many analyses of their functions. Several hypotheses have been offered, but an attractive one-that the conservation may be a function of taxonomic hierarchy (vertebrates, mammals, primates, etc.)-has rarely been disc...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.148916.112

    authors: Xu J,Zhang R,Shen Y,Liu G,Lu X,Wu CI

    更新日期:2013-11-01 00:00:00

  • Distinct contributions of DNA methylation and histone acetylation to the genomic occupancy of transcription factors.

    abstract::Epigenetic modifications on chromatin play important roles in regulating gene expression. Although chromatin states are often governed by multilayered structure, how individual pathways contribute to gene expression remains poorly understood. For example, DNA methylation is known to regulate transcription factor bindi...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.257576.119

    authors: Cusack M,King HW,Spingardi P,Kessler BM,Klose RJ,Kriaucionis S

    更新日期:2020-10-01 00:00:00