Efficient assembly of nanopore reads via highly accurate and intact error correction.

Abstract:

:Long nanopore reads are advantageous in de novo genome assembly. However, nanopore reads usually have broad error distribution and high-error-rate subsequences. Existing error correction tools cannot correct nanopore reads efficiently and effectively. Most methods trim high-error-rate subsequences during error correction, which reduces both the length of the reads and contiguity of the final assembly. Here, we develop an error correction, and de novo assembly tool designed to overcome complex errors in nanopore reads. We propose an adaptive read selection and two-step progressive method to quickly correct nanopore reads to high accuracy. We introduce a two-stage assembler to utilize the full length of nanopore reads. Our tool achieves superior performance in both error correction and de novo assembling nanopore reads. It requires only 8122 hours to assemble a 35X coverage human genome and achieves a 2.47-fold improvement in NG50. Furthermore, our assembly of the human WERI cell line shows an NG50 of 22 Mbp. The high-quality assembly of nanopore reads can significantly reduce false positives in structure variation detection.

journal_name

Nat Commun

journal_title

Nature communications

authors

Chen Y,Nie F,Xie SQ,Zheng YF,Dai Q,Bray T,Wang YX,Xing JF,Huang ZJ,Wang DP,He LJ,Luo F,Wang JX,Liu YZ,Xiao CL

doi

10.1038/s41467-020-20236-7

subject

Has Abstract

pub_date

2021-01-04 00:00:00

pages

60

issue

1

issn

2041-1723

pii

10.1038/s41467-020-20236-7

journal_volume

12

pub_type

杂志文章
  • Topologically robust sound propagation in an angular-momentum-biased graphene-like resonator lattice.

    abstract::Topological insulators do not allow conduction in the bulk, yet they support edge modes that travel along the boundary only in one direction, determined by the carried electron spin, with inherent robustness to defects and disorder. Topological insulators have inspired analogues in photonics and optics, in which one-w...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/ncomms9260

    authors: Khanikaev AB,Fleury R,Mousavi SH,Alù A

    更新日期:2015-10-06 00:00:00

  • Visualization of a ferromagnetic metallic edge state in manganite strips.

    abstract::Recently, broken symmetry effect induced edge states in two-dimensional electronic systems have attracted great attention. However, whether edge states may exist in strongly correlated oxides is not yet known. In this work, using perovskite manganites as prototype systems, we demonstrate that edge states do exist in s...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/ncomms7179

    authors: Du K,Zhang K,Dong S,Wei W,Shao J,Niu J,Chen J,Zhu Y,Lin H,Yin X,Liou SH,Yin L,Shen J

    更新日期:2015-02-04 00:00:00

  • Rethinking Indian monsoon rainfall prediction in the context of recent global warming.

    abstract::Prediction of Indian summer monsoon rainfall (ISMR) is at the heart of tropical climate prediction. Despite enormous progress having been made in predicting ISMR since 1886, the operational forecasts during recent decades (1989-2012) have little skill. Here we show, with both dynamical and physical-empirical models, t...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/ncomms8154

    authors: Wang B,Xiang B,Li J,Webster PJ,Rajeevan MN,Liu J,Ha KJ

    更新日期:2015-05-18 00:00:00

  • DG-CA3 circuitry mediates hippocampal representations of latent information.

    abstract::Survival in complex environments necessitates a flexible navigation system that incorporates memory of recent behavior and associations. Yet, how the hippocampal spatial circuit represents latent information independent of sensory inputs and future goals has not been determined. To address this, we image the activity ...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/s41467-020-16825-1

    authors: Keinath AT,Nieto-Posadas A,Robinson JC,Brandon MP

    更新日期:2020-06-15 00:00:00

  • An atlas of DNA methylomes in porcine adipose and muscle tissues.

    abstract::It is evident that epigenetic factors, especially DNA methylation, have essential roles in obesity development. Here, using pig as a model, we investigate the systematic association between DNA methylation and obesity. We sample eight variant adipose and two distinct ske...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/ncomms1854

    authors: Li M,Wu H,Luo Z,Xia Y,Guan J,Wang T,Gu Y,Chen L,Zhang K,Ma J,Liu Y,Zhong Z,Nie J,Zhou S,Mu Z,Wang X,Qu J,Jing L,Wang H,Huang S,Yi N,Wang Z,Xi D,Wang J,Yin G,Wang L,Li N,Jiang Z,Lang Q,Xiao H,Ji

    更新日期:2012-05-22 00:00:00

  • Regulation of the NaV1.5 cytoplasmic domain by calmodulin.

    abstract::Voltage-gated sodium channels (Na(v)) underlie the rapid upstroke of action potentials in excitable tissues. Binding of channel-interactive proteins is essential for controlling fast and long-term inactivation. In the structure of the complex of the carboxy-terminal portion of Na(v)1.5 (CTNa(v)1.5) with calmodulin (Ca...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/ncomms6126

    authors: Gabelli SB,Boto A,Kuhns VH,Bianchet MA,Farinelli F,Aripirala S,Yoder J,Jakoncic J,Tomaselli GF,Amzel LM

    更新日期:2014-11-05 00:00:00

  • TCR microclusters form spatially segregated domains and sequentially assemble in calcium-dependent kinetic steps.

    abstract::Engagement of the T cell receptor (TCR) by stimulatory ligand results in the rapid formation of microclusters at sites of T cell activation. Whereas microclusters have been studied extensively using confocal microscopy, the spatial and kinetic relationships of their signaling components have not been well characterize...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/s41467-018-08064-2

    authors: Yi J,Balagopalan L,Nguyen T,McIntire KM,Samelson LE

    更新日期:2019-01-17 00:00:00

  • Graphene etching on SiC grains as a path to interstellar polycyclic aromatic hydrocarbons formation.

    abstract::Polycyclic aromatic hydrocarbons as well as other organic molecules appear among the most abundant observed species in interstellar space and are key molecules to understanding the prebiotic roots of life. However, their existence and abundance in space remain a puzzle. Here we present a new top-down route to form pol...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/ncomms4054

    authors: Merino P,Švec M,Martinez JI,Jelinek P,Lacovig P,Dalmiglio M,Lizzit S,Soukiassian P,Cernicharo J,Martin-Gago JA

    更新日期:2014-01-01 00:00:00

  • Highly efficient multiplex human T cell engineering without double-strand breaks using Cas9 base editors.

    abstract::The fusion of genome engineering and adoptive cellular therapy holds immense promise for the treatment of genetic disease and cancer. Multiplex genome engineering using targeted nucleases can be used to increase the efficacy and broaden the application of such therapies but carries safety risks associated with uninten...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/s41467-019-13007-6

    authors: Webber BR,Lonetree CL,Kluesner MG,Johnson MJ,Pomeroy EJ,Diers MD,Lahr WS,Draper GM,Slipek NJ,Smeester BA,Lovendahl KN,McElroy AN,Gordon WR,Osborn MJ,Moriarity BS

    更新日期:2019-11-19 00:00:00

  • GWAS for urinary sodium and potassium excretion highlights pathways shared with cardiovascular traits.

    abstract::Urinary sodium and potassium excretion are associated with blood pressure (BP) and cardiovascular disease (CVD). The exact biological link between these traits is yet to be elucidated. Here, we identify 50 loci for sodium and 13 for potassium excretion in a large-scale genome-wide association study (GWAS) on urinary s...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/s41467-019-11451-y

    authors: Pazoki R,Evangelou E,Mosen-Ansorena D,Pinto RC,Karaman I,Blakeley P,Gill D,Zuber V,Elliott P,Tzoulaki I,Dehghan A

    更新日期:2019-08-13 00:00:00

  • Determining the rotation direction in pulsars.

    abstract::Pulsars are rotating neutron stars emitting lighthouse-like beams. Owing to their unique properties, pulsars are a unique astrophysical tool to test general relativity, inform on matter in extreme conditions, and probe galactic magnetic fields. Understanding pulsar physics and emission mechanisms is critical to these ...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/s41467-019-11243-4

    authors: Gueroult R,Shi Y,Rax JM,Fisch NJ

    更新日期:2019-07-19 00:00:00

  • Author Correction: Adaptive individual variation in phenological responses to perceived predation levels.

    abstract::An amendment to this paper has been published and can be accessed via a link at the top of the paper. ...

    journal_title:Nature communications

    pub_type: 已发布勘误

    doi:10.1038/s41467-019-13715-z

    authors: Abbey-Lee RN,Dingemanse NJ

    更新日期:2019-12-06 00:00:00

  • GWAS for Interleukin-1β levels in gingival crevicular fluid identifies IL37 variants in periodontal inflammation.

    abstract::There is no agnostic GWAS evidence for the genetic control of IL-1β expression in periodontal disease. Here we report a GWAS for "high" gingival crevicular fluid IL-1β expression among 4910 European-American adults and identify association signals in the IL37 locus. rs3811046 at this locus (p = 3.3 × 10-22) is associa...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/s41467-018-05940-9

    authors: Offenbacher S,Jiao Y,Kim SJ,Marchesan J,Moss KL,Jing L,Divaris K,Bencharit S,Agler CS,Morelli T,Zhang S,Sun L,Seaman WT,Cowley D,Barros SP,Beck JD,Munz M,Schaefer AS,North KE

    更新日期:2018-09-11 00:00:00

  • Nonreciprocal charge transport at topological insulator/superconductor interface.

    abstract::Topological superconductor is attracting growing interest for its potential application to topological quantum computation. The superconducting proximity effect on the topological insulator surface state is one promising way to yield topological superconductivity. The superconductivity realized at the interface betwee...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/s41467-019-10658-3

    authors: Yasuda K,Yasuda H,Liang T,Yoshimi R,Tsukazaki A,Takahashi KS,Nagaosa N,Kawasaki M,Tokura Y

    更新日期:2019-06-21 00:00:00

  • An unusual endo-selective C-H hydroarylationof norbornene by the Rh(I)-catalyzed reactionof benzamides.

    abstract::Hydroarylation is an environmentally attractive strategy which incorporates all of the atoms contained in the substrates into the desired products. Almost all the hydroarylations of norbornene reported to date involve an exo-selective reaction. Here we show the endo-selective hydroarylation of norbornene in the Rh(I)-...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/s41467-017-01531-2

    authors: Shibata K,Natsui S,Tobisu M,Fukumoto Y,Chatani N

    更新日期:2017-11-13 00:00:00

  • Cretaceous stem chondrichthyans survived the end-Permian mass extinction.

    abstract::Cladodontomorph sharks are Palaeozoic stem chondrichthyans thought to go extinct at the end-Permian mass extinction. This extinction preceded the diversification of euselachians, including modern sharks. Here we describe an outer-platform cladodontomorph shark tooth assemblage from the Early Cretaceous of southern Fra...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/ncomms3669

    authors: Guinot G,Adnet S,Cavin L,Cappetta H

    更新日期:2013-01-01 00:00:00

  • Ultrasensitive visual read-out of nucleic acids using electrocatalytic fluid displacement.

    abstract::Diagnosis of disease outside of sophisticated laboratories urgently requires low-cost, user-friendly devices. Disposable, instrument-free testing devices are used for home and physician office testing, but are limited in applicability to a small class of highly abundant analytes. Direct, unambiguous visual read-out is...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/ncomms7978

    authors: Besant JD,Das J,Burgess IB,Liu W,Sargent EH,Kelley SO

    更新日期:2015-04-22 00:00:00

  • Identification of key sequence features required for microRNA biogenesis in plants.

    abstract::MicroRNAs (miRNAs) are endogenous small RNAs of ∼21 nt that regulate multiple biological pathways in multicellular organisms. They derive from longer transcripts that harbor an imperfect stem-loop structure. In plants, the ribonuclease type III DICER-LIKE1 assisted by accessory proteins cleaves the precursor to releas...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/s41467-020-19129-6

    authors: Rojas AML,Drusin SI,Chorostecki U,Mateos JL,Moro B,Bologna NG,Bresso EG,Schapire A,Rasia RM,Moreno DM,Palatnik JF

    更新日期:2020-10-21 00:00:00

  • Flexible and twistable non-volatile memory cell array with all-organic one diode-one resistor architecture.

    abstract::Flexible organic memory devices are one of the integral components for future flexible organic electronics. However, high-density all-organic memory cell arrays on malleable substrates without cross-talk have not been demonstrated because of difficulties in their fabrication and relatively poor performances to date. H...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/ncomms3707

    authors: Ji Y,Zeigler DF,Lee DS,Choi H,Jen AK,Ko HC,Kim TW

    更新日期:2013-01-01 00:00:00

  • Voltage tunability of single-spin states in a quantum dot.

    abstract::Single spins in the solid state offer a unique opportunity to store and manipulate quantum information, and to perform quantum-enhanced sensing of local fields and charges. Optical control of these systems using techniques developed in atomic physics has yet to exploit all the advantages of the solid state. Here we de...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/ncomms2519

    authors: Bennett AJ,Pooley MA,Cao Y,Sköld N,Farrer I,Ritchie DA,Shields AJ

    更新日期:2013-01-01 00:00:00

  • Allosteric cross-talk in chromatin can mediate drug-drug synergy.

    abstract::Exploitation of drug-drug synergism and allostery could yield superior therapies by capitalizing on the immensely diverse, but highly specific, potential associated with the biological macromolecular landscape. Here we describe a drug-drug synergy mediated by allosteric cross-talk in chromatin, whereby the binding of ...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/ncomms14860

    authors: Adhireksan Z,Palermo G,Riedel T,Ma Z,Muhammad R,Rothlisberger U,Dyson PJ,Davey CA

    更新日期:2017-03-30 00:00:00

  • Electron-vibration coupling induced renormalization in the photoemission spectrum of diamondoids.

    abstract::The development of theories and methods devoted to the accurate calculation of the electronic quasi-particle states and levels of molecules, clusters and solids is of prime importance to interpret the experimental data. These quantum systems are often modelled by using the Born-Oppenheimer approximation where the coup...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/ncomms11327

    authors: Gali A,Demján T,Vörös M,Thiering G,Cannuccia E,Marini A

    更新日期:2016-04-22 00:00:00

  • FLOWERING LOCUS C in monocots and the tandem origin of angiosperm-specific MADS-box genes.

    abstract::MADS-domain transcription factors have been shown to act as key repressors or activators of the transition to flowering and as master regulators of reproductive organ identities. Despite their important roles in plant development, the origin of several MADS-box subfamilies has remained enigmatic so far. Here we demons...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/ncomms3280

    authors: Ruelens P,de Maagd RA,Proost S,Theißen G,Geuten K,Kaufmann K

    更新日期:2013-01-01 00:00:00

  • Mesopelagic fishes dominate otolith record of past two millennia in the Santa Barbara Basin.

    abstract::The mesopelagic (200-1000 m) separates the productive upper ocean from the deep ocean, yet little is known of its long-term dynamics despite recent research that suggests fishes of this zone likely dominate global fish biomass and contribute to the downward flux of carbon. Here we show that mesopelagic fishes dominate...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/s41467-019-12600-z

    authors: Jones WA,Checkley DM Jr

    更新日期:2019-10-08 00:00:00

  • Carbon monoxide in an extremely metal-poor galaxy.

    abstract::Extremely metal-poor galaxies with metallicity below 10% of the solar value in the local universe are the best analogues to investigating the interstellar medium at a quasi-primitive environment in the early universe. In spite of the ongoing formation of stars in these galaxies, the presence of molecular gas (which is...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/ncomms13789

    authors: Shi Y,Wang J,Zhang ZY,Gao Y,Hao CN,Xia XY,Gu Q

    更新日期:2016-12-09 00:00:00

  • Quadruple bonding between iron and boron in the BFe(CO)3- complex.

    abstract::While main group elements have four valence orbitals accessible for bonding, quadruple bonding to main group elements is extremely rare. Here we report that main group element boron is able to form quadruple bonding interactions with iron in the BFe(CO)3- anion complex, which has been revealed by quantum chemical inve...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/s41467-019-12767-5

    authors: Chi C,Wang JQ,Hu HS,Zhang YY,Li WL,Meng L,Luo M,Zhou M,Li J

    更新日期:2019-10-17 00:00:00

  • The formation and evolution of Titan's winter polar vortex.

    abstract::Saturn's largest moon Titan has a substantial nitrogen-methane atmosphere, with strong seasonal effects, including formation of winter polar vortices. Following Titan's 2009 northern spring equinox, peak solar heating moved to the northern hemisphere, initiating south-polar subsidence and winter polar vortex formation...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/s41467-017-01839-z

    authors: Teanby NA,Bézard B,Vinatier S,Sylvestre M,Nixon CA,Irwin PGJ,de Kok RJ,Calcutt SB,Flasar FM

    更新日期:2017-11-21 00:00:00

  • Oscillatory surface rheotaxis of swimming E. coli bacteria.

    abstract::Bacterial contamination of biological channels, catheters or water resources is a major threat to public health, which can be amplified by the ability of bacteria to swim upstream. The mechanisms of this 'rheotaxis', the reorientation with respect to flow gradients, are still poorly understood. Here, we follow individ...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/s41467-019-11360-0

    authors: Mathijssen AJTM,Figueroa-Morales N,Junot G,Clément É,Lindner A,Zöttl A

    更新日期:2019-07-31 00:00:00

  • Extreme oceanographic forcing and coastal response due to the 2015-2016 El Niño.

    abstract::The El Niño-Southern Oscillation is the dominant mode of interannual climate variability across the Pacific Ocean basin, with influence on the global climate. The two end members of the cycle, El Niño and La Niña, force anomalous oceanographic conditions and coastal response along the Pacific margin, exposing many hea...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/ncomms14365

    authors: Barnard PL,Hoover D,Hubbard DM,Snyder A,Ludka BC,Allan J,Kaminsky GM,Ruggiero P,Gallien TW,Gabel L,McCandless D,Weiner HM,Cohn N,Anderson DL,Serafin KA

    更新日期:2017-02-14 00:00:00

  • A streamlined pipeline for multiplexed quantitative site-specific N-glycoproteomics.

    abstract::Regulation of protein N-glycosylation is essential in human cells. However, large-scale, accurate, and site-specific quantification of glycosylation is still technically challenging. We here introduce SugarQuant, an integrated mass spectrometry-based pipeline comprising protein aggregation capture (PAC)-based sample p...

    journal_title:Nature communications

    pub_type: 杂志文章

    doi:10.1038/s41467-020-19052-w

    authors: Fang P,Ji Y,Silbern I,Doebele C,Ninov M,Lenz C,Oellerich T,Pan KT,Urlaub H

    更新日期:2020-10-19 00:00:00