Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform.

Abstract:

:With read lengths of currently up to 2 × 300 bp, high throughput and low sequencing costs Illumina's MiSeq is becoming one of the most utilized sequencing platforms worldwide. The platform is manageable and affordable even for smaller labs. This enables quick turnaround on a broad range of applications such as targeted gene sequencing, metagenomics, small genome sequencing and clinical molecular diagnostics. However, Illumina error profiles are still poorly understood and programs are therefore not designed for the idiosyncrasies of Illumina data. A better knowledge of the error patterns is essential for sequence analysis and vital if we are to draw valid conclusions. Studying true genetic variation in a population sample is fundamental for understanding diseases, evolution and origin. We conducted a large study on the error patterns for the MiSeq based on 16S rRNA amplicon sequencing data. We tested state-of-the-art library preparation methods for amplicon sequencing and showed that the library preparation method and the choice of primers are the most significant sources of bias and cause distinct error patterns. Furthermore we tested the efficiency of various error correction strategies and identified quality trimming (Sickle) combined with error correction (BayesHammer) followed by read overlapping (PANDAseq) as the most successful approach, reducing substitution error rates on average by 93%.

journal_name

Nucleic Acids Res

journal_title

Nucleic acids research

authors

Schirmer M,Ijaz UZ,D'Amore R,Hall N,Sloan WT,Quince C

doi

10.1093/nar/gku1341

subject

Has Abstract

pub_date

2015-03-31 00:00:00

pages

e37

issue

6

eissn

0305-1048

issn

1362-4962

pii

gku1341

journal_volume

43

pub_type

杂志文章
  • Characterization of a novel T lymphocyte protein which binds to a site related to steroid/thyroid hormone receptor response elements in the negative regulatory sequence of the human immunodeficiency virus long terminal repeat.

    abstract::We have previously identified a T lymphocyte protein which binds to a site within the LTR of the human immunodeficiency virus type 1 (HIV-1) and exerts an inhibitory effect on virus gene expression. The palindromic site (site B) recognized by this protein is related to the palindromic binding sites of members of the s...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/20.20.5429

    authors: Orchard K,Lang G,Collins M,Latchman D

    更新日期:1992-10-25 00:00:00

  • Interaction of N-acetyl-phenylalanyl-tRNAPhe with 70S ribosomes of Escherichia coli.

    abstract::The interaction of N--Acetyl--Phe--tRNA Phe with 70 S ribosomes is a reversible process in the absence as well as in the presence of messenger. The equilibrium binding constants of these interactions were measured at different magnesium concentrations and temperatures and thermodynamical quantities computed. The entha...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/5.10.3871

    authors: Odinzov VB,Kirillov SV

    更新日期:1978-10-01 00:00:00

  • ArrayXPath: mapping and visualizing microarray gene-expression data with integrated biological pathway resources using Scalable Vector Graphics.

    abstract::Biological pathways can provide key information on the organization of biological systems. ArrayXPath (http://www.snubi.org/software/ArrayXPath/) is a web-based service for mapping and visualizing microarray gene-expression data for integrated biological pathway resources using Scalable Vector Graphics (SVG). By integ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkh476

    authors: Chung HJ,Kim M,Park CH,Kim J,Kim JH

    更新日期:2004-07-01 00:00:00

  • Defective chromatin recruitment and retention of NHEJ core components in human tumor cells expressing a Cyclin E fragment.

    abstract::Exposure to genotoxic agents, such as ionizing radiation (IR), produces double-strand breaks, repaired predominantly in mammalian cells by non-homologous end-joining (NHEJ). Ku70 was identified as an interacting partner of a proteolytic Cyclin E (CycE) fragment, p18CycE. p18CycE endogenous generation during IR-induced...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkt812

    authors: Chatterjee P,Plesca D,Mazumder S,Boutros J,Yannone SM,Almasan A

    更新日期:2013-12-01 00:00:00

  • Characterization of RNA helicase A as component of STAT6-dependent enhanceosome.

    abstract::Signal transducer and activator of transcription 6 (STAT6) is a regulator of transcription for interleukin-4 (IL-4)-induced genes. The ability of STAT6 to activate transcription depends on functional interaction with other transcription factors and coactivators. We have characterized the mechanism of STAT6-mediated tr...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkl539

    authors: Välineva T,Yang J,Silvennoinen O

    更新日期:2006-01-01 00:00:00

  • Determination of DNA cooperativity factor.

    abstract::The paper presents measurements of the difference in the melting temperature of a colE1 DNA region when it is located inside the DNA helix and at its end. A direct comparison of calculations based on the rigorous theory of helix-coil transition with experimental data for .2 M Na+ (the conditions for fully reversible m...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/9.20.5469

    authors: Amirikyan BR,Vologodskii AV,Lyubchenko YuL

    更新日期:1981-10-24 00:00:00

  • Semisynthesis of site-specifically succinylated histone reveals that succinylation regulates nucleosome unwrapping rate and DNA accessibility.

    abstract::Posttranslational modifications (PTMs) of histones represent a crucial regulatory mechanism of nucleosome and chromatin dynamics in various of DNA-based cellular processes, such as replication, transcription and DNA damage repair. Lysine succinylation (Ksucc) is a newly identified histone PTM, but its regulation and f...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkaa663

    authors: Jing Y,Ding D,Tian G,Kwan KCJ,Liu Z,Ishibashi T,Li XD

    更新日期:2020-09-25 00:00:00

  • Molecular characterization of Drosophila NELF.

    abstract::NELF and DSIF act together to inhibit transcription elongation in vitro, and are implicated in causing promoter proximal pausing on the hsp70 gene in Drosophila. Here, further characterization of Drosophila NELF is provided. Drosophila NELF has four subunits similar to subunits of human NELF. The amino acid sequences ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gki274

    authors: Wu CH,Lee C,Fan R,Smith MJ,Yamaguchi Y,Handa H,Gilmour DS

    更新日期:2005-03-01 00:00:00

  • The Radiation Hybrid Database.

    abstract::Since July 1995, the European Bioinformatics Institute (EBI) has maintained RHdb (http://www.ebi.ac.uk/RHdb/RHdb.html ), a public database for radiation hybrid data. Radiation hybrid mapping is an important technique for determining high resolution maps. Recently, CORBA access has been added to Rhdb. The EBI is an Out...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/26.1.102

    authors: Lijnzaad P,Helgesen C,Rodriguez-Tomé P

    更新日期:1998-01-01 00:00:00

  • Monitoring denaturation behaviour and comparative stability of DNA triple helices using oligonucleotide-gold nanoparticle conjugates.

    abstract::Gold nanoparticle labels, combined with UV-visible optical absorption spectroscopic methods, are employed to probe the temperature-dependent solution properties of DNA triple helices. By using oligonucleotide-nanoparticle conjugates to characterize triplex denaturation, for the first time triplex to duplex melting tra...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gnh065

    authors: Murphy D,Eritja R,Redmond G

    更新日期:2004-04-23 00:00:00

  • An N-terminal clamp restrains the motor domains of the bacterial transcription-repair coupling factor Mfd.

    abstract::Motor proteins that translocate on nucleic acids are key players in gene expression and maintenance. While the function of these proteins is diverse, they are driven by highly conserved core motor domains. In transcription-coupled DNA repair, motor activity serves to remove RNA polymerase stalled on damaged DNA, makin...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkp680

    authors: Murphy MN,Gong P,Ralto K,Manelyte L,Savery NJ,Theis K

    更新日期:2009-10-01 00:00:00

  • The Microbial Genomes Atlas (MiGA) webserver: taxonomic and gene diversity analysis of Archaea and Bacteria at the whole genome level.

    abstract::The small subunit ribosomal RNA gene (16S rRNA) has been successfully used to catalogue and study the diversity of prokaryotic species and communities but it offers limited resolution at the species and finer levels, and cannot represent the whole-genome diversity and fluidity. To overcome these limitations, we introd...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gky467

    authors: Rodriguez-R LM,Gunturu S,Harvey WT,Rosselló-Mora R,Tiedje JM,Cole JR,Konstantinidis KT

    更新日期:2018-07-02 00:00:00

  • Inter-individual variation of DNA methylation and its implications for large-scale epigenome mapping.

    abstract::Genomic DNA methylation profiles exhibit substantial variation within the human population, with important functional implications for gene regulation. So far little is known about the characteristics and determinants of DNA methylation variation among healthy individuals. We performed bioinformatic analysis of high-r...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkn122

    authors: Bock C,Walter J,Paulsen M,Lengauer T

    更新日期:2008-06-01 00:00:00

  • A novel endonuclease IV post-PCR genotyping system.

    abstract::Here we describe a novel endonuclease IV (Endo IV) based assay utilizing a substrate that mimics the abasic lesions that normally occur in double-stranded DNA. The three component substrate is characterized by single-stranded DNA target, an oligonucleotide probe, separated from a helper oligonucleotide by a one base g...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkl679

    authors: Kutyavin IV,Milesi D,Belousov Y,Podyminogin M,Vorobiev A,Gorn V,Lukhtanov EA,Vermeulen NM,Mahoney W

    更新日期:2006-01-01 00:00:00

  • Incorporation of 2'-amido-nucleosides in oligodeoxynucleotides and oligoribonucleotides as a model for 2'-linked conjugates.

    abstract::The functionalisation of oligodeoxynucleotides and oligoribonucleotides by incorporation of 2'-amido-2'-deoxyribonucleosides, possibly containing a reporter group via the 2'-amido bond, was examined. Therefore 2'-acetamido-ribonucleosides containing a small methyl group at the 2'-amido bond were synthesized as model c...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/23.1.51

    authors: Hendrix C,Devreese B,Rozenski J,van Aerschot A,De Bruyn A,Van Beeumen J,Herdewijn P

    更新日期:1995-01-11 00:00:00

  • Exploiting post-transcriptional regulation to probe RNA structures in vivo via fluorescence.

    abstract::While RNA structures have been extensively characterized in vitro, very few techniques exist to probe RNA structures inside cells. Here, we have exploited mechanisms of post-transcriptional regulation to synthesize fluorescence-based probes that assay RNA structures in vivo. Our probing system involves the co-expressi...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gku1191

    authors: Sowa SW,Vazquez-Anderson J,Clark CA,De La Peña R,Dunn K,Fung EK,Khoury MJ,Contreras LM

    更新日期:2015-01-01 00:00:00

  • The N-terminus of Prp1 (Prp6/U5-102 K) is essential for spliceosome activation in vivo.

    abstract::The spliceosomal protein Prp1 (Prp6/U5-102 K) is necessary for the integrity of pre-catalytic spliceosomal complexes. We have identified a novel regulatory function for Prp1. Expression of mutations in the N-terminus of Prp1 leads to the accumulation of pre-catalytic spliceosomal complexes containing the five snRNAs U...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkp1155

    authors: Lützelberger M,Bottner CA,Schwelnus W,Zock-Emmenthal S,Razanau A,Käufer NF

    更新日期:2010-03-01 00:00:00

  • Cloning and characterization of the C. elegans histidyl-tRNA synthetase gene.

    abstract::In this paper, we report the cloning and sequencing of the C. elegans histidyl-tRNA synthetase gene. The complete genomic sequence, and most of the cDNA sequence, of this gene is now determined. The gene size including flanking and coding regions is 2230 nucleotides long. Three small introns (45-50 bp long) are found ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/21.18.4344

    authors: Amaar YG,Baillie DL

    更新日期:1993-09-11 00:00:00

  • SCOP database in 2004: refinements integrate structure and sequence family data.

    abstract::The Structural Classification of Proteins (SCOP) database is a comprehensive ordering of all proteins of known structure, according to their evolutionary and structural relationships. Protein domains in SCOP are hierarchically classified into families, superfamilies, folds and classes. The continual accumulation of se...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkh039

    authors: Andreeva A,Howorth D,Brenner SE,Hubbard TJ,Chothia C,Murzin AG

    更新日期:2004-01-01 00:00:00

  • mrsFAST-Ultra: a compact, SNP-aware mapper for high performance sequencing applications.

    abstract::High throughput sequencing (HTS) platforms generate unprecedented amounts of data that introduce challenges for processing and downstream analysis. While tools that report the 'best' mapping location of each read provide a fast way to process HTS data, they are not suitable for many types of downstream analysis such a...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gku370

    authors: Hach F,Sarrafi I,Hormozdiari F,Alkan C,Eichler EE,Sahinalp SC

    更新日期:2014-07-01 00:00:00

  • Independent and tight regulation of transcriptional units in Escherichia coli via the LacR/O, the TetR/O and AraC/I1-I2 regulatory elements.

    abstract::Based on parameters governing promoter activity and using regulatory elements of the lac, ara and tet operon transcription control sequences were composed which permit the regulation in Escherichia coli of several gene activities independently and quantitatively. The novel promoter PLtetO-1 allows the regulation of ge...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/25.6.1203

    authors: Lutz R,Bujard H

    更新日期:1997-03-15 00:00:00

  • DNA polymerase I and a protein complex bind specifically to E. coli palindromic unit highly repetitive DNA: implications for bacterial chromosome organization.

    abstract::Starting from a crude E. coli extract, two activities which specifically protect highly repetitive bacterial DNA sequences (called PU for Palindromic Unit or REP for Repetitive Extragenic Palindromic sequence) against a digestion with Exonuclease III have been purified. We show that one of these activities is due to t...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/18.13.3941

    authors: Gilson E,Perrin D,Hofnung M

    更新日期:1990-07-11 00:00:00

  • A thermostable endonuclease III homolog from the archaeon Pyrobaculum aerophilum.

    abstract::Pyrimidine adducts in cellular DNA arise from modification of the pyrimidine 5,6-double bond by oxidation, reduction or hydration. The biological outcome includes increased mutation rate and potential lethality. A major DNA N:-glycosylase responsible for the excision of modified pyrimidine bases is the base excision r...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/29.3.604

    authors: Yang H,Phan IT,Fitz-Gibbon S,Shivji MK,Wood RD,Clendenin WM,Hyman EC,Miller JH

    更新日期:2001-02-01 00:00:00

  • The protein kinase TOUSLED facilitates RNAi in Arabidopsis.

    abstract::RNA silencing is an evolutionarily conserved mechanism triggered by double-stranded RNA that is processed into 21- to 24-nt small interfering (si)RNA or micro (mi)RNA by RNaseIII-like enzymes called Dicers. Gene regulations by RNA silencing have fundamental implications in a large number of biological processes that i...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gku422

    authors: Uddin MN,Dunoyer P,Schott G,Akhter S,Shi C,Lucas WJ,Voinnet O,Kim JY

    更新日期:2014-07-01 00:00:00

  • Investigation of Mg2+- and temperature-dependent folding of the hairpin ribozyme by photo-crosslinking: effects of photo-crosslinker tether length and chemistry.

    abstract::We have used photo-crosslinking to investigate the structure and dynamics of four-way junction hairpin ribozyme constructs. Four phenylazide photo-crosslinkers were coupled to 2'-NH2-modified U+2 in the substrate and irradiated at different Mg2+ concentrations and temperatures. Consistent with the role of divalent met...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gki237

    authors: Borda EJ,Sigurdsson ST

    更新日期:2005-02-18 00:00:00

  • Synthesis and restriction enzyme analysis of oligodeoxyribonucleotides containing the anti-cancer drug 2',2'-difluoro-2'-deoxycytidine.

    abstract::The anti-cancer drug 2',2'-difluoro-2'-deoxycytidine (dFdC) is internally incorporated into DNA in vitro. To determine the effects of this incorporation on DNA structure and function, the beta-cyanoethyl phosphoramidite of dFdC was synthesized and oligodeoxyribonucleotides containing dFdC were made using automated sol...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/20.7.1763

    authors: Richardson FC,Richardson KK,Kroin JS,Hertel LW

    更新日期:1992-04-11 00:00:00

  • The distribution of active RNA polymerase II along the transcribed region is gene-specific and controlled by elongation factors.

    abstract::In order to study the intragenic profiles of active transcription, we determined the relative levels of active RNA polymerase II present at the 3'- and 5'-ends of 261 yeast genes by run-on. The results obtained indicate that the 3'/5' run-on ratio varies among the genes studied by over 12 log(2) units. This ratio seem...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq215

    authors: Rodríguez-Gil A,García-Martínez J,Pelechano V,Muñoz-Centeno Mde L,Geli V,Pérez-Ortín JE,Chávez S

    更新日期:2010-08-01 00:00:00

  • Automated selection of aptamers against protein targets translated in vitro: from gene to aptamer.

    abstract::Reagents for proteome research must of necessity be generated by high throughput methods. Aptamers are potentially useful as reagents to identify and quantitate individual proteins, yet are currently produced for the most part by manual selection procedures. We have developed automated selection methods, but must stil...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gnf107

    authors: Cox JC,Hayhurst A,Hesselberth J,Bayer TS,Georgiou G,Ellington AD

    更新日期:2002-10-15 00:00:00

  • Sequences preceding the minimal promoter of the Xenopus somatic 5S RNA gene increase binding efficiency for transcription factors.

    abstract::Sequences preceding the minimal promoter play a role in the differential expression of the Xenopus somatic and oocyte-type 5S RNA genes. In this report, the somatic sequences between -32 and +37 are shown to increase transcriptional activity in microinjected embryos, yet have little to no effect in microinjected oocyt...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:

    authors: Reynolds WF

    更新日期:1989-11-25 00:00:00

  • Self-assembly of DNA-streptavidin nanostructures and their use as reagents in immuno-PCR.

    abstract::The self-assembly of bis-biotinylated double-stranded DNA and the tetravalent biotin-binding protein streptavidin (STV) have been studied by non-denaturing gel electrophoresis and atomic force microscopy (AFM). The rapid self-assembly reproducibly generated populations of individual oligomeric complexes. Most striking...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/27.23.4553

    authors: Niemeyer CM,Adler M,Pignataro B,Lenhert S,Gao S,Chi L,Fuchs H,Blohm D

    更新日期:1999-12-01 00:00:00