Abstract:
:High-throughput sequencing of cDNA (RNA-seq) is a widely deployed transcriptome profiling and annotation technique, but questions about the performance of different protocols and platforms remain. We used a newly developed pool of 96 synthetic RNAs with various lengths, and GC content covering a 2(20) concentration range as spike-in controls to measure sensitivity, accuracy, and biases in RNA-seq experiments as well as to derive standard curves for quantifying the abundance of transcripts. We observed linearity between read density and RNA input over the entire detection range and excellent agreement between replicates, but we observed significantly larger imprecision than expected under pure Poisson sampling errors. We use the control RNAs to directly measure reproducible protocol-dependent biases due to GC content and transcript length as well as stereotypic heterogeneity in coverage across transcripts correlated with position relative to RNA termini and priming sequence bias. These effects lead to biased quantification for short transcripts and individual exons, which is a serious problem for measurements of isoform abundances, but that can partially be corrected using appropriate models of bias. By using the control RNAs, we derive limits for the discovery and detection of rare transcripts in RNA-seq experiments. By using data collected as part of the model organism and human Encyclopedia of DNA Elements projects (ENCODE and modENCODE), we demonstrate that external RNA controls are a useful resource for evaluating sensitivity and accuracy of RNA-seq experiments for transcriptome discovery and quantification. These quality metrics facilitate comparable analysis across different samples, protocols, and platforms.
journal_name
Genome Resjournal_title
Genome researchauthors
Jiang L,Schlesinger F,Davis CA,Zhang Y,Li R,Salit M,Gingeras TR,Oliver Bdoi
10.1101/gr.121095.111subject
Has Abstractpub_date
2011-09-01 00:00:00pages
1543-51issue
9eissn
1088-9051issn
1549-5469pii
gr.121095.111journal_volume
21pub_type
杂志文章相关文献
GENOME RESEARCH文献大全abstract::Remnants of more than 3 million transposable elements, primarily retroelements, comprise nearly half of the human genome and have generated much speculation concerning their evolutionary significance. We have exploited the draft human genome sequence to examine the distributions of retroelements on a genome-wide scale...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.388902
更新日期:2002-10-01 00:00:00
abstract::Sequencing of the human Y chromosome has uncovered the peculiarities of the genomic organization of a heterogametic sex chromosome of old evolutionary age, and has led to many insights into the evolutionary changes that occurred during its long history. We have studied the genomic organization of the medaka fish Y chr...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.5016106
更新日期:2006-07-01 00:00:00
abstract::We have generated an improved assembly and gene annotation of the pig X Chromosome, and a first draft assembly of the pig Y Chromosome, by sequencing BAC and fosmid clones from Duroc animals and incorporating information from optical mapping and fiber-FISH. The X Chromosome carries 1033 annotated genes, 690 of which a...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.188839.114
更新日期:2016-01-01 00:00:00
abstract::Translocations are known to affect the expression of genes at the breakpoints and, in the case of unbalanced translocations, alter the gene copy number. However, a comprehensive understanding of the functional impact of this class of variation is lacking. Here, we have studied the effect of balanced chromosomal rearra...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.103622.109
更新日期:2010-05-01 00:00:00
abstract::Detecting and estimating DNA sample contamination are important steps to ensure high-quality genotype calls and reliable downstream analysis. Existing methods rely on population allele frequency information for accurate estimation of contamination rates. Correctly specifying population allele frequencies for each indi...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.246934.118
更新日期:2020-02-01 00:00:00
abstract::Repetitive DNA is a significant component of eukaryotic genomes. We have developed a strategy to efficiently and accurately sequence repetitive DNA in the nematode Caenorhabditis elegans using integrated artificial transposons and automated fluorescent sequencing. Mapping and assembly tools represent important compone...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.7.5.551
更新日期:1997-05-01 00:00:00
abstract::Using AP-PCR-based DNA profiling we examined some structural features of B chromosomes from yellow-necked mice Apodemus flavicollis. Mice harboring one, two, or three or lacking B chromosomes were examined. Chromosomal structure was scanned for variant bands by using a series of arbitrary primers and from these, infor...
journal_title:Genome research
pub_type: 杂志文章
doi:
更新日期:2000-01-01 00:00:00
abstract::The incorporation and creation of modified nucleobases in DNA have profound effects on genome function. We describe methods for mapping positions and local content of modified DNA nucleobases in genomic DNA. We combined in vitro nucleobase excision with massively parallel DNA sequencing (Excision-seq) to determine the...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.174052.114
更新日期:2014-09-01 00:00:00
abstract::Paternal X chromosome inactivation occurs in rodent extraembryonic membranes and in all tissues of marsupials. Methylation of CpG islands occurs on the inactive X in eutherians and is considered to be a stabilizing mechanism. The only previous study of a marsupial X-linked CpG island was of the G6PD gene of the Virgin...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.6.2.114
更新日期:1996-02-01 00:00:00
abstract::Pre-mRNA processing often occurs in coordination with transcription thereby coupling these two key regulatory events. As such, many proteins involved in mRNA processing associate with the transcriptional machinery and are in proximity to DNA. This proximity allows for the mapping of the genomic associations of RNA bin...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.5211806
更新日期:2006-07-01 00:00:00
abstract::Network "guilt by association" (GBA) is a proven approach for identifying novel disease genes based on the observation that similar mutational phenotypes arise from functionally related genes. In principle, this approach could account even for nonadditive genetic interactions, which underlie the synergistic combinatio...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.118992.110
更新日期:2011-07-01 00:00:00
abstract::We have determined the complete sequence of 951,695 bp from the class I region of H2, the mouse major histocompatibility complex (Mhc) from strain 129/Sv (haplotype bc). The sequence contains 26 genes. The sequence spans from the last 50 kb of the H2-T region, including 2 class I genes and 3 class I pseudogenes, and i...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.975303
更新日期:2003-04-01 00:00:00
abstract::Noncoding RNA (ncRNA) constitutes a significant portion of the mammalian transcriptome. Emerging evidence suggests that it regulates gene expression in cis or trans by modulating the chromatin structure. To uncover the functional role of ncRNA in chromatin organization, we deep sequenced chromatin-associated RNAs (CAR...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.103473.109
更新日期:2010-07-01 00:00:00
abstract::Animal microRNA sequences are subject to 3' nucleotide addition. Through detailed analysis of deep-sequenced short RNA data sets, we show adenylation and uridylation of miRNA is globally present and conserved across Drosophila and vertebrates. To better understand 3' adenylation function, we deep-sequenced RNA after k...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.106054.110
更新日期:2010-10-01 00:00:00
abstract::The Atlantic herring is a model species for exploring the genetic basis for ecological adaptation, due to its huge population size and extremely low genetic differentiation at selectively neutral loci. However, such studies have so far been hampered because of a highly fragmented genome assembly. Here, we deliver a ch...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.253435.119
更新日期:2019-11-01 00:00:00
abstract::Identity-by-descent (IBD) inference is the problem of establishing a genetic connection between two individuals through a genomic segment that is inherited by both individuals from a recent common ancestor. IBD inference is an important preceding step in a variety of population genomic studies, ranging from demographi...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.173641.114
更新日期:2015-02-01 00:00:00
abstract::Differential methylation between the two alleles of a gene has been observed in imprinted regions, where the methylation of one allele occurs on a parent-of-origin basis, the inactive X-chromosome in females, and at those loci whose methylation is driven by genetic variants. We have extensively characterized imprinted...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.164913.113
更新日期:2014-04-01 00:00:00
abstract::In mammals, genome-wide chromatin maps and immunofluorescence studies show that broad domains of repressive histone modifications are present on pericentromeric and telomeric repeats and on the inactive X chromosome. However, only a few autosomal loci such as silent Hox gene clusters have been shown to lie in broad do...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.080861.108
更新日期:2009-02-01 00:00:00
abstract::We previously described the whole-genome assembly program Arachne, presenting assemblies of simulated data for small to mid-sized genomes. Here we describe algorithmic adaptations to the program, allowing for assembly of mammalian-size genomes, and also improving the assembly of smaller genomes. Three principal change...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.828403
更新日期:2003-01-01 00:00:00
abstract::The human pseudoautosomal region 1 (PAR1) is essential for meiotic pairing and recombination, and its deletion causes male sterility. Comparative studies of human and mouse pseudoautosomal genes are valuable in charting the evolution of this interesting region, but have been limited by the paucity of genes conserved b...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.197001
更新日期:2001-12-01 00:00:00
abstract::The prion protein (PrP), first identified in scrapie-infected rodents, is encoded by a single exon of a single-copy chromosomal gene. In addition to the protein-coding exon, PrP genes in mammals contain one or two 5'-noncoding exons. To learn more about the genomic organization of regions surrounding the PrP exons, we...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.8.10.1022
更新日期:1998-10-01 00:00:00
abstract::Alternative splicing (AS) creates multiple mRNA transcripts from a single gene. While AS is known to contribute to gene regulation and proteome diversity in animals, the study of its importance in plants is in its early stages. However, recently available plant genome and transcript sequence data sets are enabling a g...
journal_title:Genome research
pub_type: 杂志文章,评审
doi:10.1101/gr.053678.106
更新日期:2008-09-01 00:00:00
abstract::Large terminal fragments of human chromosomes 2p, 6p, 8q, 12q, and 18q were cloned using yeast artificial chromosomes (YACs). RecA-assisted restriction endonuclease (RARE) cleavage analysis of genomic DNA samples from II unrelated individuals using YAC-derived probes confirmed the telomeric localizations of the half-Y...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.5.3.225
更新日期:1995-10-01 00:00:00
abstract::Organisms with large genomes contain vast amounts of repetitive DNA sequences, much of which is composed of retrotransposons. Amplification of retrotransposons has been postulated to be a major mechanism increasing genome size and leading to "genomic obesity." To gain insights into the relation between retrotransposon...
journal_title:Genome research
pub_type: 评论,杂志文章
doi:10.1101/gr.10.7.908
更新日期:2000-07-01 00:00:00
abstract::Using both env and long terminal repeat (LTR) sequences, with maximal representation of genetic diversity within primate strains, we revise and expand the unique evolutionary history of human and simian T-cell leukemia/lymphotropic viruses (HTLV/STLV). Based on the robust application of three different phylogenetic al...
journal_title:Genome research
pub_type: 杂志文章,评审
doi:
更新日期:1999-06-01 00:00:00
abstract::Large-scale gene expression studies and genomic sequencing projects are providing vast amounts of information that can be used to identify or predict cellular regulatory processes. Genes can be clustered on the basis of the similarity of their expression profiles or function and these clusters are likely to contain ge...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.148301
更新日期:2001-01-01 00:00:00
abstract::Short tandem repeats (STRs) are implicated in dozens of human genetic diseases and contribute significantly to genome variation and instability. Yet profiling STRs from short-read sequencing data is challenging because of their high sequencing error rates. Here, we developed STR-FM, short tandem repeat profiling using...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.185892.114
更新日期:2015-05-01 00:00:00
abstract::We have developed a computer program that aligns spliced sequences to genomic sequences, using local alignment algorithms and heuristics to put together a global spliced alignment. Spidey can produce reliable alignments quickly, even when confronted with noise from alternative splicing, polymorphisms, sequencing error...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.195301
更新日期:2001-11-01 00:00:00
abstract::The lack of long-term evolutionary conservation of microRNA (miRNA) target sites appears to contradict many analyses of their functions. Several hypotheses have been offered, but an attractive one-that the conservation may be a function of taxonomic hierarchy (vertebrates, mammals, primates, etc.)-has rarely been disc...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.148916.112
更新日期:2013-11-01 00:00:00
abstract::Epigenetic modifications on chromatin play important roles in regulating gene expression. Although chromatin states are often governed by multilayered structure, how individual pathways contribute to gene expression remains poorly understood. For example, DNA methylation is known to regulate transcription factor bindi...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.257576.119
更新日期:2020-10-01 00:00:00