Separation and assembly of deep sequencing data into discrete sub-population genomes.

Abstract:

:Sequence heterogeneity is a common characteristic of RNA viruses that is often referred to as sub-populations or quasispecies. Traditional techniques used for assembly of short sequence reads produced by deep sequencing, such as de-novo assemblers, ignore the underlying diversity. Here, we introduce a novel algorithm that simultaneously assembles discrete sequences of multiple genomes present in populations. Using in silico data we were able to detect populations at as low as 0.1% frequency with complete global genome reconstruction and in a single sample detected 16 resolved sequences with no mismatches. We also applied the algorithm to high throughput sequencing data obtained for viruses present in sewage samples and successfully detected multiple sub-populations and recombination events in these diverse mixtures. High sensitivity of the algorithm also enables genomic analysis of heterogeneous pathogen genomes from patient samples and accurate detection of intra-host diversity, enabling not just basic research in personalized medicine but also accurate diagnostics and monitoring drug therapies, which are critical in clinical and regulatory decision-making process.

journal_name

Nucleic Acids Res

journal_title

Nucleic acids research

authors

Karagiannis K,Simonyan V,Chumakov K,Mazumder R

doi

10.1093/nar/gkx755

subject

Has Abstract

pub_date

2017-11-02 00:00:00

pages

10989-11003

issue

19

eissn

0305-1048

issn

1362-4962

pii

4096350

journal_volume

45

pub_type

杂志文章
  • Activation of cryptic 3' splice sites within introns of cellular genes following gene entrapment.

    abstract::Gene trap vectors developed for genome-wide mutagenesis can be used to study factors governing the expression of exons inserted throughout the genome. For example, entrapment vectors consisting of a partial 3'-terminal exon [i.e. a neomycin resistance gene (Neo), a poly(A) site, but no 3' splice site] were typically e...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkh604

    authors: Osipovich AB,White-Grindley EK,Hicks GG,Roshon MJ,Shaffer C,Moore JH,Ruley HE

    更新日期:2004-05-20 00:00:00

  • The kinetic properties of cruciform extrusion are determined by DNA base-sequence.

    abstract::The extrusion kinetics of two cruciforms derived from unrelated DNA sequences differ markedly. Kinetic barriers exist for both reactions, necessitating elevated temperatures before extrusion proceeds at measureable speeds, but the dependence upon temperature and ionic strength is quite different for the two sequences....

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/13.5.1443

    authors: Lilley DM

    更新日期:1985-03-11 00:00:00

  • The regulatory region of the human plasminogen activator inhibitor type-1 (PAI-1) gene.

    abstract::The human gene for plasminogen activator inhibitor type-1 (PAI-1) has been isolated and its promoter region characterized. PAI-1 regulation by glucocorticoids, transforming growth factor-beta (TGF-beta) and the phorbol ester PMA is shown to be exerted at the promoter level. A fragment spanning 805 nucleotides of the 5...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/16.7.2805

    authors: Riccio A,Lund LR,Sartorio R,Lania A,Andreasen PA,Danø K,Blasi F

    更新日期:1988-04-11 00:00:00

  • An improved method for photofootprinting yeast genes in vivo using Taq polymerase.

    abstract::We have developed an improved method for photofootprinting in vivo which utilizes the thermostable DNA polymerase from T. aquaticus (Taq) in a primer extension assay. UV light is used to introduce photoproducts into the genomic DNA of intact yeast cells. The photoproducts are then detected and mapped at the nucleotide...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/17.1.171

    authors: Axelrod JD,Majors J

    更新日期:1989-01-11 00:00:00

  • RB signaling prevents replication-dependent DNA double-strand breaks following genotoxic insult.

    abstract::Cell cycle checkpoints induced by DNA damage play an integral role in preservation of genomic stability by allowing cells to limit the propagation of deleterious mutations. The retinoblastoma tumor suppressor (RB) is crucial for the maintenance of the DNA damage checkpoint function because it elicits cell cycle arrest...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkg919

    authors: Bosco EE,Mayhew CN,Hennigan RF,Sage J,Jacks T,Knudsen ES

    更新日期:2004-01-02 00:00:00

  • Hypermethylated-capped selenoprotein mRNAs in mammals.

    abstract::Mammalian mRNAs are generated by complex and coordinated biogenesis pathways and acquire 5'-end m(7)G caps that play fundamental roles in processing and translation. Here we show that several selenoprotein mRNAs are not recognized efficiently by translation initiation factor eIF4E because they bear a hypermethylated c...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gku580

    authors: Wurth L,Gribling-Burrer AS,Verheggen C,Leichter M,Takeuchi A,Baudrey S,Martin F,Krol A,Bertrand E,Allmang C

    更新日期:2014-07-01 00:00:00

  • NCBI's LocusLink and RefSeq.

    abstract::The NCBI has introduced two new web resources-LocusLink and RefSeq-that facilitate retrieval of gene-based information and provide reference sequence standards. These resources are designed to provide a non-redundant view of current knowledge about human genes, transcripts and proteins. Additional information about th...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/28.1.126

    authors: Maglott DR,Katz KS,Sicotte H,Pruitt KD

    更新日期:2000-01-01 00:00:00

  • Characteristic arrangement of nucleosomes is predictive of chromatin interactions at kilobase resolution.

    abstract::High-throughput chromosome conformation capture (3C) technologies, such as Hi-C, have made it possible to survey 3D genome structure. However, obtaining 3D profiles at kilobase resolution at low cost remains a major challenge. Therefore, we herein present an algorithm for precise identification of chromatin interactio...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx885

    authors: Zhang H,Li F,Jia Y,Xu B,Zhang Y,Li X,Zhang Z

    更新日期:2017-12-15 00:00:00

  • The 5' terminal capping of heterogeneous nuclear RNA at different embryonic stages of the sea urchin.

    abstract::5' Terminal cap structures of hnRNA have been characterized and the extent of capping determined as a function of embryonic development. Sea urchin embryo hnRNA contains only the type-1 cap, m7GpppNmpNp, with the type-2 cap, which has a 2'-0-methylated subpenultimate nucleotide, being associated only with stable small...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/6.6.2307

    authors: Nemer M,Surrey S,Ginzburg I,Echols MM

    更新日期:1979-01-01 00:00:00

  • Amplified inverted duplications within and adjacent to heterologous selectable DNA.

    abstract::Plasmids containing a dihydrofolate reductase (DHFR) expression unit were transfected into DHFR-deficient Chinese hamster ovary (CHO) cells. Methotrexate exposure was used to select cells with amplified DHFR sequences. Three cell lines were isolated containing amplified copies of transfected DNA that had integrated in...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/17.4.1697

    authors: Heartlein MW,Latt SA

    更新日期:1989-02-25 00:00:00

  • Improvements to solid phase phosphotriester synthesis of deoxyoligonucleotides.

    abstract::A solution of benzenesulphonic acid (3%, w/v) in a dimethylformamide and dichloromethane mixture (9:1, v/v) is shown to be a very effective reagent for the detritylation of deoxyoligonucleotides attached to a solid support. The levels of depurination with this reagent were lower than those observed with other reagents...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/10.18.5605

    authors: Patel TP,Millican TA,Bose CC,Titmas RC,Mock GA,Eaton MA

    更新日期:1982-09-25 00:00:00

  • iPcc: a novel feature extraction method for accurate disease class discovery and prediction.

    abstract::Gene expression profiling has gradually become a routine procedure for disease diagnosis and classification. In the past decade, many computational methods have been proposed, resulting in great improvements on various levels, including feature selection and algorithms for classification and clustering. In this study,...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkt343

    authors: Ren X,Wang Y,Zhang XS,Jin Q

    更新日期:2013-08-01 00:00:00

  • The European Nucleotide Archive.

    abstract::The European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena) is Europe's primary nucleotide-sequence repository. The ENA consists of three main databases: the Sequence Read Archive (SRA), the Trace Archive and EMBL-Bank. The objective of ENA is to support and promote the use of nucleotide sequencing as an experimen...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq967

    authors: Leinonen R,Akhtar R,Birney E,Bower L,Cerdeno-Tárraga A,Cheng Y,Cleland I,Faruque N,Goodgame N,Gibson R,Hoad G,Jang M,Pakseresht N,Plaister S,Radhakrishnan R,Reddy K,Sobhany S,Ten Hoopen P,Vaughan R,Zalunin V,Cochr

    更新日期:2011-01-01 00:00:00

  • Multiple functions for the N-terminal region of Msh6.

    abstract::The eukaryotic mismatch repair protein Msh6 shares five domains in common with other MutS members. However, it also contains several hundred additional residues at its N-terminus. A few of these residues bind to PCNA, but the functions of the other amino acids in the N-terminal region (NTR) are unknown. Here we demons...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkm409

    authors: Clark AB,Deterding L,Tomer KB,Kunkel TA

    更新日期:2007-01-01 00:00:00

  • The pseudodisaccharides: a novel class of group I intron splicing inhibitors.

    abstract::Lysinomicin, a naturally-occurring pseudodisaccharide, inhibits translation in prokaryotes. We report that lysinomicin (and three related compounds) are able to inhibit the self-splicing of group I introns, thus identifying pseudodisaccharides as a novel class of group I intron splicing inhibitors. Lysinomicin inhibit...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/22.23.4983

    authors: Rogers J,Davies J

    更新日期:1994-11-25 00:00:00

  • Structural basis for substrate binding and catalytic mechanism of a human RNA:m5C methyltransferase NSun6.

    abstract::5-methylcytosine (m5C) modifications of RNA are ubiquitous in nature and play important roles in many biological processes such as protein translational regulation, RNA processing and stress response. Aberrant expressions of RNA:m5C methyltransferases are closely associated with various human diseases including cancer...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx473

    authors: Liu RJ,Long T,Li J,Li H,Wang ED

    更新日期:2017-06-20 00:00:00

  • The Comparative Toxicogenomics Database: update 2019.

    abstract::The Comparative Toxicogenomics Database (CTD; http://ctdbase.org/) is a premier public resource for literature-based, manually curated associations between chemicals, gene products, phenotypes, diseases, and environmental exposures. In this biennial update, we present our new chemical-phenotype module that codes chemi...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gky868

    authors: Davis AP,Grondin CJ,Johnson RJ,Sciaky D,McMorran R,Wiegers J,Wiegers TC,Mattingly CJ

    更新日期:2019-01-08 00:00:00

  • In silico characterization and prediction of global protein-mRNA interactions in yeast.

    abstract::Post-transcriptional gene regulation is mediated through complex networks of protein-RNA interactions. The targets of only a few RNA binding proteins (RBPs) are known, even in the well-characterized budding yeast. In silico prediction of protein-RNA interactions is therefore useful to guide experiments and to provide ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkr160

    authors: Pancaldi V,Bähler J

    更新日期:2011-08-01 00:00:00

  • Short antisense-locked nucleic acids (all-LNAs) correct alternative splicing abnormalities in myotonic dystrophy.

    abstract::Myotonic dystrophy type 1 (DM1) is an autosomal dominant multisystemic disorder caused by expansion of CTG triplet repeats in 3'-untranslated region of DMPK gene. The pathomechanism of DM1 is driven by accumulation of toxic transcripts containing expanded CUG repeats (CUG(exp)) in nuclear foci which sequester several ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv163

    authors: Wojtkowiak-Szlachcic A,Taylor K,Stepniak-Konieczna E,Sznajder LJ,Mykowska A,Sroka J,Thornton CA,Sobczak K

    更新日期:2015-03-31 00:00:00

  • Cis and trans regulatory elements required for regulation of the CHO1 gene of Saccharomyces cerevisiae.

    abstract::A 34 base-pair (bp) fragment spanning sequences -154 to -120 of the promoter of the CHO1 gene (structural gene for phosphatidylserine synthase) from the yeast Saccharomyces cerevisiae has been shown to place transcription of a promoter-less Escherichia coli lacZ gene under control of the phospholipid precursors inosit...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/20.6.1411

    authors: Bailis AM,Lopes JM,Kohlwein SD,Henry SA

    更新日期:1992-03-25 00:00:00

  • TopNet: a tool for comparing biological sub-networks, correlating protein properties with topological statistics.

    abstract::Biological networks are a topic of great current interest, particularly with the publication of a number of large genome-wide interaction datasets. They are globally characterized by a variety of graph-theoretic statistics, such as the degree distribution, clustering coefficient, characteristic path length and diamete...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkh164

    authors: Yu H,Zhu X,Greenbaum D,Karro J,Gerstein M

    更新日期:2004-01-14 00:00:00

  • Bovine Genome Database: integrated tools for genome annotation and discovery.

    abstract::The Bovine Genome Database (BGD; http://BovineGenome.org) strives to improve annotation of the bovine genome and to integrate the genome sequence with other genomics data. BGD includes GBrowse genome browsers, the Apollo Annotation Editor, a quantitative trait loci (QTL) viewer, BLAST databases and gene pages. Genome ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkq1235

    authors: Childers CP,Reese JT,Sundaram JP,Vile DC,Dickens CM,Childs KL,Salih H,Bennett AK,Hagen DE,Adelson DL,Elsik CG

    更新日期:2011-01-01 00:00:00

  • The SWI/SNF ATP-dependent nucleosome remodeler promotes resection initiation at a DNA double-strand break in yeast.

    abstract::DNA double-strand breaks (DSBs) are repaired by either the non-homologous end joining (NHEJ) or homologous recombination (HR) pathway. Pathway choice is determined by the generation of 3΄ single-strand DNA overhangs at the break that are initiated by the action of the Mre11-Rad50-Xrs2 (MRX) complex to direct repair to...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx221

    authors: Wiest NE,Houghtaling S,Sanchez JC,Tomkinson AE,Osley MA

    更新日期:2017-06-02 00:00:00

  • Structure and evolution of a mouse tRNA gene cluster encoding tRNAAsp, tRNAGly and tRNAGlu and an unlinked, solitary gene encoding tRNAAsp.

    abstract::We have sequenced mouse tRNA genes from two recombinant lambda phage. An 1800 bp sequence from one phage contains 3 tRNA genes, potentially encoding tRNAAsp, tRNAGly, and tRNAGlu, separated by spacer sequences of 587 bp and 436 bp, respectively. The mouse tRNA gene cluster is homologous to a rat sequence (Sekiya et al...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/11.24.8761

    authors: Looney JE,Harding JD

    更新日期:1983-12-20 00:00:00

  • Unique sequences are interspersed among tandemly repeated elements in the murine gamma 1 switch segment.

    abstract::Early in its differentiative pathway, a given B lymphocyte expresses immunoglobulin of the mu heavy chain class (IgM). Subsequent differentiative processes may involve rearrangement within the immunoglobulin heavy chain chromosomal locus to enable cells of the same lineage to synthesize immunoglobulins of other heavy ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/13.1.225

    authors: Mowatt M,Dery C,Dunnick W

    更新日期:1985-01-11 00:00:00

  • Conformation transitions of eukaryotic polyribosomes during multi-round translation.

    abstract::Using sedimentation and cryo electron tomography techniques, the conformations of eukaryotic polyribosomes formed in a long-term cell-free translation system were analyzed over all the active system lifetime (20-30 translation rounds during 6-8 h in wheat germ extract at 25°C). Three distinct types of the conformation...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gku1270

    authors: Afonina ZA,Myasnikov AG,Shirokov VA,Klaholz BP,Spirin AS

    更新日期:2015-01-01 00:00:00

  • A method for generating subtractive cDNA libraries retaining clones containing repetitive elements.

    abstract::Here we describe a two-stepped photobiotin-based procedure to enrich a target (canine retinal) cDNA library for tissue specific clones without removing those containing repetitive ( SINE ) elements, despite the presence of these elements in the driver population. In a first hybridization excess SINE elements were hybr...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/25.21.4427

    authors: Lin CT,Sargan DR

    更新日期:1997-11-01 00:00:00

  • A comprehensive catalog of predicted functional upstream open reading frames in humans.

    abstract::Upstream open reading frames (uORFs) latent in mRNA transcripts are thought to modify translation of coding sequences by altering ribosome activity. Not all uORFs are thought to be active in such a process. To estimate the impact of uORFs on the regulation of translation in humans, we first circumscribed the universe ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gky188

    authors: McGillivray P,Ault R,Pawashe M,Kitchen R,Balasubramanian S,Gerstein M

    更新日期:2018-04-20 00:00:00

  • Structural determinants of an internal ribosome entry site that direct translational reading frame selection.

    abstract::The dicistrovirus intergenic internal ribosome entry site (IGR IRES) directly recruits the ribosome and initiates translation using a non-AUG codon. A subset of IGR IRESs initiates translation in either of two overlapping open reading frames (ORFs), resulting in expression of the 0 frame viral structural polyprotein a...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gku622

    authors: Ren Q,Au HH,Wang QS,Lee S,Jan E

    更新日期:2014-08-01 00:00:00

  • AVPpred: collection and prediction of highly effective antiviral peptides.

    abstract::In the battle against viruses, antiviral peptides (AVPs) had demonstrated the immense potential. Presently, more than 15 peptide-based drugs are in various stages of clinical trials. Emerging and re-emerging viruses further emphasize the efforts to accelerate antiviral drug discovery efforts. Despite, huge importance ...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gks450

    authors: Thakur N,Qureshi A,Kumar M

    更新日期:2012-07-01 00:00:00