The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants.

Abstract:

:FASTQ has emerged as a common file format for sharing sequencing read data combining both the sequence and an associated per base quality score, despite lacking any formal definition to date, and existing in at least three incompatible variants. This article defines the FASTQ format, covering the original Sanger standard, the Solexa/Illumina variants and conversion between them, based on publicly available information such as the MAQ documentation and conventions recently agreed by the Open Bioinformatics Foundation projects Biopython, BioPerl, BioRuby, BioJava and EMBOSS. Being an open access publication, it is hoped that this description, with the example files provided as Supplementary Data, will serve in future as a reference for this important file format.

journal_name

Nucleic Acids Res

journal_title

Nucleic acids research

authors

Cock PJ,Fields CJ,Goto N,Heuer ML,Rice PM

doi

10.1093/nar/gkp1137

subject

Has Abstract

pub_date

2010-04-01 00:00:00

pages

1767-71

issue

6

eissn

0305-1048

issn

1362-4962

pii

gkp1137

journal_volume

38

pub_type

历史文章,杂志文章,评审
  • MRX protects fork integrity at protein-DNA barriers, and its absence causes checkpoint activation dependent on chromatin context.

    abstract::To address how eukaryotic replication forks respond to fork stalling caused by strong non-covalent protein-DNA barriers, we engineered the controllable Fob-block system in Saccharomyces cerevisiae. This system allows us to strongly induce and control replication fork barriers (RFB) at their natural location within the...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkt051

    authors: Bentsen IB,Nielsen I,Lisby M,Nielsen HB,Gupta SS,Mundbjerg K,Andersen AH,Bjergbaek L

    更新日期:2013-03-01 00:00:00

  • Elongation of repetitive DNA by DNA polymerase from a hyperthermophilic bacterium Thermus thermophilus.

    abstract::Short repetitive DNA sequences are believed to be one of the primordial genetic elements that served as a source of complex large DNA found in the genome of modern organisms. However, the mechanism of its expansion (increase in repeat number) during the course of evolution is unclear. We demonstrate that the DNA polym...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/28.20.3999

    authors: Ogata N,Morino H

    更新日期:2000-10-15 00:00:00

  • Homologies between X and Y chromosomes detected by DNA probes: localisation and evolution.

    abstract::We have isolated and characterized DNA probes that detect homologies between the X and Y chromosomes. Clone St25 is derived from the q13-q22 region of the X chromosome and recognizes a 98% homologous sequence on the Y chromosome. Y specific fragments were present in DNAs from 5 Yq-individuals and from 4 out of 7 XX ma...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/13.15.5485

    authors: Koenig M,Moisan JP,Heilig R,Mandel JL

    更新日期:1985-08-12 00:00:00

  • Methylation of human eukaryotic elongation factor alpha (eEF1A) by a member of a novel protein lysine methyltransferase family modulates mRNA translation.

    abstract::Many cellular proteins are methylated on lysine residues and this has been most intensively studied for histone proteins. Lysine methylations on non-histone proteins are also frequent, but in most cases the functional significance of the methylation event, as well as the identity of the responsible lysine (K) specific...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx432

    authors: Jakobsson ME,Malecki J,Nilges BS,Moen A,Leidel SA,Falnes PØ

    更新日期:2017-08-21 00:00:00

  • Monitoring mis-acylated tRNA suppression efficiency in mammalian cells via EGFP fluorescence recovery.

    abstract::A reporter assay was developed to detect and quantify nonsense codon suppression by chemically aminoacylated tRNAs in mammalian cells. It is based on the cellular expression of the enhanced green fluorescent protein (EGFP) as a reporter for the site-specific amino acid incorporation in its sequence using an orthogonal...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gnf128

    authors: Ilegems E,Pick HM,Vogel H

    更新日期:2002-12-01 00:00:00

  • Specific hydrolysis of methionyl-tRNA Met f catalyzed by a purified peptide.

    abstract::A peptide initiation factor purified from rat liver and promoting the binding of initiator tRNA and model initiators to 40S and 80S ribosome at an acid pH liberates methionine and N-acetylmethionine from Trna Met f at neutral reaction. Phenylalanyl-tRNA, N-acetylphenylalanyl-tRNA and methionyl-tRNA Met m are not hydro...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/2.11.2119

    authors: Hradec J

    更新日期:1975-11-01 00:00:00

  • Assessment of clone identity and sequence fidelity for 1189 IMAGE cDNA clones.

    abstract::This report documents the error rate in a commercially distributed subset of the IMAGE Consortium mouse cDNA clone collection. After isolation of plasmid DNA from 1189 bacterial stock cultures, only 62. 2% were uncontaminated and contained cDNA inserts that had significant sequence identity to published data for the o...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/29.2.582

    authors: Halgren RG,Fielden MR,Fong CJ,Zacharewski TR

    更新日期:2001-01-15 00:00:00

  • Transcription in vivo directed by consensus sequences of E.coli promoters: their context heavily affects efficiencies and start sites.

    abstract::We studied in vivo transcription and gene expression directed by a series of synthetic sequences, bearing the consensus hexamer (CH) pair of E.coli promoters in various contexts. The results demonstrate that, for the contexts tested, the CH pair supports transcription activity and gene expression, whether the spacer l...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/18.5.1137

    authors: Jacquet MA,Reiss C

    更新日期:1990-03-11 00:00:00

  • A new efficient gene disruption cassette for repeated use in budding yeast.

    abstract::The dominant kanr marker gene plays an important role in gene disruption experiments in budding yeast, as this marker can be used in a variety of yeast strains lacking the conventional yeast markers. We have developed a loxP-kanMX-loxP gene disruption cassette, which combines the advantages of the heterologous kanr ma...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/24.13.2519

    authors: Güldener U,Heck S,Fielder T,Beinhauer J,Hegemann JH

    更新日期:1996-07-01 00:00:00

  • Therapeutic target database update 2018: enriched resource for facilitating bench-to-clinic research of targeted therapeutics.

    abstract::Extensive efforts have been directed at the discovery, investigation and clinical monitoring of targeted therapeutics. These efforts may be facilitated by the convenient access of the genetic, proteomic, interactive and other aspects of the therapeutic targets. Here, we describe an update of the Therapeutic target dat...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx1076

    authors: Li YH,Yu CY,Li XX,Zhang P,Tang J,Yang Q,Fu T,Zhang X,Cui X,Tu G,Zhang Y,Li S,Yang F,Sun Q,Qin C,Zeng X,Chen Z,Chen YZ,Zhu F

    更新日期:2018-01-04 00:00:00

  • PACCMIT/PACCMIT-CDS: identifying microRNA targets in 3' UTRs and coding sequences.

    abstract::The purpose of the proposed web server, publicly available at http://paccmit.epfl.ch, is to provide a user-friendly interface to two algorithms for predicting messenger RNA (mRNA) molecules regulated by microRNAs: (i) PACCMIT (Prediction of ACcessible and/or Conserved MIcroRNA Targets), which identifies primarily mRNA...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkv457

    authors: Šulc M,Marín RM,Robins HS,Vaníček J

    更新日期:2015-07-01 00:00:00

  • Cpf1 protein induced bending of yeast centromere DNA element I.

    abstract::The centromere complex is a multicomponent structure essential for faithful chromosome transmission. Here we show that the S. cerevisiae centromere protein Cpf1 bends centromere DNA element I (CDEI) with the bend angle ranging from 66 degrees to 71 degrees. CDEI DNA sequences that carry point mutations which lead to r...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/21.20.4726

    authors: Niedenthal RK,Sen-Gupta M,Wilmen A,Hegemann JH

    更新日期:1993-10-11 00:00:00

  • eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses.

    abstract::eggNOG is a public database of orthology relationships, gene evolutionary histories and functional annotations. Here, we present version 5.0, featuring a major update of the underlying genome sets, which have been expanded to 4445 representative bacteria and 168 archaea derived from 25 038 genomes, as well as 477 euka...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gky1085

    authors: Huerta-Cepas J,Szklarczyk D,Heller D,Hernández-Plaza A,Forslund SK,Cook H,Mende DR,Letunic I,Rattei T,Jensen LJ,von Mering C,Bork P

    更新日期:2019-01-08 00:00:00

  • Characterization of human 5S rRNA genes.

    abstract::The human 5S rRNA genes are found in clusters of tandem repeated units. We have cloned and partially characterized six restriction fragments from two clusters of 2.3 kb and 1.6 kb repeats, respectively. Four fragments from the cluster of 2.3 kb repeats contain a 5S rRNA gene and one fragment contains a gene variant wi...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/19.15.4147

    authors: Sørensen PD,Frederiksen S

    更新日期:1991-08-11 00:00:00

  • Spectral clustering of protein sequences.

    abstract::An important problem in genomics is automatically clustering homologous proteins when only sequence information is available. Most methods for clustering proteins are local, and are based on simply thresholding a measure related to sequence distance. We first show how locality limits the performance of such methods by...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkj515

    authors: Paccanaro A,Casbon JA,Saqi MA

    更新日期:2006-03-17 00:00:00

  • ST1710-DNA complex crystal structure reveals the DNA binding mechanism of the MarR family of regulators.

    abstract::ST1710, a member of the multiple antibiotic resistance regulator (MarR) family of regulatory proteins in bacteria and archaea, plays important roles in development of antibiotic resistance, a global health problem. Here, we present the crystal structure of ST1710 from Sulfolobus tokodaii strain 7 complexed with salicy...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkp496

    authors: Kumarevel T,Tanaka T,Umehara T,Yokoyama S

    更新日期:2009-08-01 00:00:00

  • Nucleotide sequence of satellite DNA contained in the eliminated genome of Ascaris lumbricoides.

    abstract::Several restriction endonuclease fragments isolated from highly repetitive satellite DNA of the chromatin eliminating nematode Ascaris lumbricoides var. suum have been cloned. Each type of restriction fragment corresponds to a different variant of the same related ancestral sequence. These variants differ by small del...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/10.23.7493

    authors: Müller F,Walker P,Aeby P,Neuhaus H,Felder H,Back E,Tobler H

    更新日期:1982-12-11 00:00:00

  • Synthesis and coding properties of dinucleoside diphosphates containing alky pyrimidines which are formed by the action of carcinogens on nucleic acids.

    abstract::Dinucleoside diphosphates of the general type pGpN have been prepared enzymatically using ribonuclease N1. Alkylated uridines or cytidines, which are products of carcinogens acting on nucleic acids, were tested in dinucleoside diphosphates for their ability to stimulate the binding of Ala- or Val-tRNA to ribosomes. O2...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/6.4.1709

    authors: Singer B,Pergolizzi RG,Grunberger D

    更新日期:1979-04-01 00:00:00

  • An improved method for circular RNA purification using RNase R that efficiently removes linear RNAs containing G-quadruplexes or structured 3' ends.

    abstract::Thousands of eukaryotic protein-coding genes generate circular RNAs that have covalently linked ends and are resistant to degradation by exonucleases. To prove their circularity as well as biochemically enrich these transcripts, it has become standard in the field to use the 3'-5' exonuclease RNase R. Here, we demonst...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkz576

    authors: Xiao MS,Wilusz JE

    更新日期:2019-09-19 00:00:00

  • Gene for OTC: characterisation and linkage to Duchenne muscular dystrophy.

    abstract::Cloned coding sequences for rat and human ornithine transcarbamylase (OTC) were obtained by screening a rat and a human cDNA library respectively with a synthetic oligonucleotide corresponding to 27 bases of the rat sequence. These clones, 1100 bp long for the rat clone and 1300 bp for the human, contain approximately...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/13.1.155

    authors: Davies KE,Briand P,Ionasescu V,Ionasescu G,Williamson R,Brown C,Cavard C,Cathelineau L

    更新日期:1985-01-11 00:00:00

  • NMR structural analysis of DNA recognition by a novel Myb1 DNA-binding domain in the protozoan parasite Trichomonas vaginalis.

    abstract::The transcription regulator, tvMyb1, is the first Myb family protein identified in Trichomonas vaginalis. Using an electrophoretic mobility shift assay, we defined the amino-acid sequence from Lys(35) to Ser(141) (tvMyb1(35-141)) as the minimal DNA-binding domain, encompassing two Myb-like DNA-binding motifs (designat...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkp097

    authors: Lou YC,Wei SY,Rajasekaran M,Chou CC,Hsu HM,Tai JH,Chen C

    更新日期:2009-04-01 00:00:00

  • CasHRA (Cas9-facilitated Homologous Recombination Assembly) method of constructing megabase-sized DNA.

    abstract::Current DNA assembly methods for preparing highly purified linear subassemblies require complex and time-consuming in vitro manipulations that hinder their ability to construct megabase-sized DNAs (e.g. synthetic genomes). We have developed a new method designated 'CasHRA (Cas9-facilitated Homologous Recombination Ass...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkw475

    authors: Zhou J,Wu R,Xue X,Qin Z

    更新日期:2016-08-19 00:00:00

  • Structural mechanisms of the degenerate sequence recognition by Bse634I restriction endonuclease.

    abstract::Restriction endonuclease Bse634I recognizes and cleaves the degenerate DNA sequence 5'-R/CCGGY-3' (R stands for A or G; Y for T or C, '/' indicates a cleavage position). Here, we report the crystal structures of the Bse634I R226A mutant complexed with cognate oligoduplexes containing ACCGGT and GCCGGC sites, respectiv...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gks300

    authors: Manakova E,Grazulis S,Zaremba M,Tamulaitiene G,Golovenko D,Siksnys V

    更新日期:2012-08-01 00:00:00

  • Specifically alkylated DNA fragments. Synthesis and physical characterization of d[CGC(O6Me)GCG] and d[CGT(O6Me)GCG].

    abstract::Two hexamer DNA fragments containing a carcinogenic modified base, O6-methyl guanine, have been synthesized by a solid-phase phosphotriester method, in which the unmodified guanine residues present were O6 protected with the 4-nitrophenylethyl group. These two alkylated oligonucleotides were found to have similar Tm's...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/11.10.3393

    authors: Kuzmich S,Marky LA,Jones RA

    更新日期:1983-05-25 00:00:00

  • Tetrahymena Genome Database (TGD): a new genomic resource for Tetrahymena thermophila research.

    abstract::We have developed a web-based resource (available at www.ciliate.org) for researchers studying the model ciliate organism Tetrahymena thermophila. Employing the underlying database structure and programming of the Saccharomyces Genome Database, the Tetrahymena Genome Database (TGD) integrates the wealth of knowledge g...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkj054

    authors: Stover NA,Krieger CJ,Binkley G,Dong Q,Fisk DG,Nash R,Sethuraman A,Weng S,Cherry JM

    更新日期:2006-01-01 00:00:00

  • Structure and mechanism of the 2',3' phosphatase component of the bacterial Pnkp-Hen1 RNA repair system.

    abstract::Pnkp is the end-healing and end-sealing component of an RNA repair system present in diverse bacteria from many phyla. Pnkp is composed of three catalytic modules: an N-terminal polynucleotide 5' kinase, a central 2',3' phosphatase and a C-terminal ligase. The phosphatase module is a Mn(2+)-dependent phosphodiesterase...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkt221

    authors: Wang LK,Smith P,Shuman S

    更新日期:2013-06-01 00:00:00

  • A directed evolution design of a GCG-specific DNA hemimethylase.

    abstract::DNA cytosine-5 methyltransferases (C5-MTases) are valuable models to study sequence-specific modification of DNA and are becoming increasingly important tools for biotechnology. Here we describe a structure-guided rational protein design combined with random mutagenesis and selection to change the specificity of the H...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkp772

    authors: Gerasimaite R,Vilkaitis G,Klimasauskas S

    更新日期:2009-11-01 00:00:00

  • Investigation of some properties of oligodeoxynucleotides containing 4'-thio-2'-deoxynucleotides: duplex hybridization and nuclease sensitivity.

    abstract::The thermal stabilities of the duplexes formed between 4'-thio-modified oligodeoxynucleotides and their DNA and RNA complementary strands were determined and compared with those of the corresponding unmodified oligodeoxynucleotides. A 16mer oligodeoxynucleotide containing 10 contiguous 4'-thiothymidylate modifications...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/24.21.4117

    authors: Jones GD,Lesnik EA,Owens SR,Risen LM,Walker RT

    更新日期:1996-11-01 00:00:00

  • PBrowse: a web-based platform for real-time collaborative exploration of genomic data.

    abstract::Genome browsers are widely used for individually exploring various types of genomic data. A handful of genome browsers offer limited tools for collaboration among multiple users. Here, we describe PBrowse, an integrated real-time collaborative genome browser that enables multiple users to simultaneously view and acces...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkw1358

    authors: Szot PS,Yang A,Wang X,Parsania C,Röhm U,Wong KH,Ho JWK

    更新日期:2017-05-19 00:00:00

  • Structural rearrangements in mRNA upon its binding to human 80S ribosomes revealed by EPR spectroscopy.

    abstract::The model mRNA (MR), 11-mer RNA containing two nitroxide spin labels at the 5'- and 3'-terminal nucleotides and prone to form a stable homodimer (MR)2, was used for Electron Paramagnetic Resonance study of structural rearrangements in mRNA occurring upon its binding to human 80S ribosomes. The formation of two differe...

    journal_title:Nucleic acids research

    pub_type: 杂志文章

    doi:10.1093/nar/gkx1136

    authors: Malygin AA,Graifer DM,Meschaninova MI,Venyaminova AG,Timofeev IO,Kuzhelev AA,Krumkacheva OA,Fedin MV,Karpova GG,Bagryanskaya EG

    更新日期:2018-01-25 00:00:00