Short-insert libraries as a method of problem solving in genome sequencing.

Abstract:

:As the Human Genome Project moves into its sequencing phase, a serious problem has arisen. The same problem has been increasingly vexing in the closing phase of the Caenorhabditis elegans project. The difficulty lies in sequencing efficiently through certain regions in which the templates (DNA substrates for the sequencing process) form complex folded secondary structures that are inaccessible to the enzymes. The solution, however, is simply to break them up. Specifically, the offending fragments are sonicated heavily and recloned, as much smaller fragments, into pUC vector. The sequences obtained from the resulting library can subsequently be assembled, free from the effects of secondary structure, to produce high-quality, complete sequence. Because of the success and simplicity of this procedure, we have begun to use it for the sequencing of all regions in which standard primer walking has been at all difficult.

journal_name

Genome Res

journal_title

Genome research

authors

McMurray AA,Sulston JE,Quail MA

doi

10.1101/gr.8.5.562

subject

Has Abstract

pub_date

1998-05-01 00:00:00

pages

562-6

issue

5

eissn

1088-9051

issn

1549-5469

journal_volume

8

pub_type

杂志文章
  • Random mutagenesis of proximal mouse chromosome 5 uncovers predominantly embryonic lethal mutations.

    abstract::A region-specific ENU mutagenesis screen was conducted to elucidate the functional content of proximal mouse Chr 5. We used the visibly marked, recessive, lethal inversion Rump White (Rw) as a balancer in a three-generation breeding scheme to identify recessive mutations within the approximately 50 megabases spanned b...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3826505

    authors: Wilson L,Ching YH,Farias M,Hartford SA,Howell G,Shao H,Bucan M,Schimenti JC

    更新日期:2005-08-01 00:00:00

  • The portability of tagSNPs across populations: a worldwide survey.

    abstract::In the search for common genetic variants that contribute to prevalent human diseases, patterns of linkage disequilibrium (LD) among linked markers should be considered when selecting SNPs. Genotyping efficiency can be increased by choosing tagging SNPs (tagSNPs) in LD with other SNPs. However, it remains to be seen w...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4138406

    authors: González-Neira A,Ke X,Lao O,Calafell F,Navarro A,Comas D,Cann H,Bumpstead S,Ghori J,Hunt S,Deloukas P,Dunham I,Cardon LR,Bertranpetit J

    更新日期:2006-03-01 00:00:00

  • Detecting copy number variation with mated short reads.

    abstract::The development of high-throughput sequencing (HTS) technologies has opened the door to novel methods for detecting copy number variants (CNVs) in the human genome. While in the past CNVs have been detected based on array CGH data, recent studies have shown that depth-of-coverage information from HTS technologies can ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.106344.110

    authors: Medvedev P,Fiume M,Dzamba M,Smith T,Brudno M

    更新日期:2010-11-01 00:00:00

  • Most parsimonious reconciliation in the presence of gene duplication, loss, and deep coalescence using labeled coalescent trees.

    abstract::Accurate gene tree-species tree reconciliation is fundamental to inferring the evolutionary history of a gene family. However, although it has long been appreciated that population-related effects such as incomplete lineage sorting (ILS) can dramatically affect the gene tree, many of the most popular reconciliation me...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.161968.113

    authors: Wu YC,Rasmussen MD,Bansal MS,Kellis M

    更新日期:2014-03-01 00:00:00

  • Retroelement distributions in the human genome: variations associated with age and proximity to genes.

    abstract::Remnants of more than 3 million transposable elements, primarily retroelements, comprise nearly half of the human genome and have generated much speculation concerning their evolutionary significance. We have exploited the draft human genome sequence to examine the distributions of retroelements on a genome-wide scale...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.388902

    authors: Medstrand P,van de Lagemaat LN,Mager DL

    更新日期:2002-10-01 00:00:00

  • Systematic functional characterization of cis-regulatory motifs in human core promoters.

    abstract::A large number of cis-regulatory motifs involved in transcriptional control have been identified, but the regulatory context and biological processes in which many of them function are unknown. Here, we computationally identify the sets of human core promoters targeted by motifs, and systematically characterize their ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6828808

    authors: Sinha S,Adler AS,Field Y,Chang HY,Segal E

    更新日期:2008-03-01 00:00:00

  • An extraordinary retrotransposon family encoding dual endonucleases.

    abstract::Retrotransposons commonly encode a reverse transcriptase (RT), but other functional domains are variable. The acquisition of new domains is the dominant evolutionary force that brings structural variety to retrotransposons. Non-long-terminal-repeat (non-LTR) retrotransposons are classified into two groups by their str...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3271405

    authors: Kojima KK,Fujiwara H

    更新日期:2005-08-01 00:00:00

  • Mouse population-guided resequencing reveals that variants in CD44 contribute to acetaminophen-induced liver injury in humans.

    abstract::Interindividual variability in response to chemicals and drugs is a common regulatory concern. It is assumed that xenobiotic-induced adverse reactions have a strong genetic basis, but many mechanism-based investigations have not been successful in identifying susceptible individuals. While recent advances in pharmacog...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.090241.108

    authors: Harrill AH,Watkins PB,Su S,Ross PK,Harbourt DE,Stylianou IM,Boorman GA,Russo MW,Sackler RS,Harris SC,Smith PC,Tennant R,Bogue M,Paigen K,Harris C,Contractor T,Wiltshire T,Rusyn I,Threadgill DW

    更新日期:2009-09-01 00:00:00

  • Two contrasting classes of nucleolus-associated domains in mouse fibroblast heterochromatin.

    abstract::In interphase eukaryotic cells, almost all heterochromatin is located adjacent to the nucleolus or to the nuclear lamina, thus defining nucleolus-associated domains (NADs) and lamina-associated domains (LADs), respectively. Here, we determined the first genome-scale map of murine NADs in mouse embryonic fibroblasts (M...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.247072.118

    authors: Vertii A,Ou J,Yu J,Yan A,Pagès H,Liu H,Zhu LJ,Kaufman PD

    更新日期:2019-08-01 00:00:00

  • A systematic model to predict transcriptional regulatory mechanisms based on overrepresentation of transcription factor binding profiles.

    abstract::An important aspect of understanding a biological pathway is to delineate the transcriptional regulatory mechanisms of the genes involved. Two important tasks are often encountered when studying transcription regulation, i.e., (1) the identification of common transcriptional regulators of a set of coexpressed genes; (...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4303406

    authors: Chang LW,Nagarajan R,Magee JA,Milbrandt J,Stormo GD

    更新日期:2006-03-01 00:00:00

  • Properties of overlapping genes are conserved across microbial genomes.

    abstract::There are numerous examples from the genomes of viruses, mitochondria, and chromosomes that adjacent genes can overlap, sharing at least one nucleotide. Overlaps have been hypothesized to be involved in genome size minimization and as a regulatory mechanism of gene expression. Here we show that overlapping genes are a...

    journal_title:Genome research

    pub_type: 信件

    doi:10.1101/gr.2433104

    authors: Johnson ZI,Chisholm SW

    更新日期:2004-11-01 00:00:00

  • Cell-type, allelic, and genetic signatures in the human pancreatic beta cell transcriptome.

    abstract::Elucidating the pathophysiology and molecular attributes of common disorders as well as developing targeted and effective treatments hinges on the study of the relevant cell type and tissues. Pancreatic beta cells within the islets of Langerhans are centrally involved in the pathogenesis of both type 1 and type 2 diab...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.150706.112

    authors: Nica AC,Ongen H,Irminger JC,Bosco D,Berney T,Antonarakis SE,Halban PA,Dermitzakis ET

    更新日期:2013-09-01 00:00:00

  • A large database of chicken bursal ESTs as a resource for the analysis of vertebrate gene function.

    abstract::Chicken B cells create their immunoglobulin repertoire within the Bursa of Fabricius by gene conversion. The high homologous recombination activity is shared by the bursal B-cell-derived DT40 cell line, which integrates transfected DNA constructs at high rates into its endogenous loci. Targeted integration in DT40 is ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.10.12.2062

    authors: Abdrakhmanov I,Lodygin D,Geroth P,Arakawa H,Law A,Plachy J,Korn B,Buerstedde JM

    更新日期:2000-12-01 00:00:00

  • DIG-seq: a genome-wide CRISPR off-target profiling method using chromatin DNA.

    abstract::To investigate whether and how CRISPR-Cas9 on-target and off-target activities are affected by chromatin in eukaryotic cells, we first identified a series of identical endogenous DNA sequences present in both open and closed chromatin regions and then measured mutation frequencies at these sites in human cells using C...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.236620.118

    authors: Kim D,Kim JS

    更新日期:2018-12-01 00:00:00

  • A genome-wide study of dual coding regions in human alternatively spliced genes.

    abstract::Alternative splicing is a major mechanism for gene product regulation in many multicellular organisms. By using different exon combinations, some coding regions can encode amino acids in multiple reading frames in different transcripts. Here we performed a systematic search through a set of high-quality human transcri...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4246506

    authors: Liang H,Landweber LF

    更新日期:2006-02-01 00:00:00

  • Discovery of high-confidence human protein-coding genes and exons by whole-genome PhyloCSF helps elucidate 118 GWAS loci.

    abstract::The most widely appreciated role of DNA is to encode protein, yet the exact portion of the human genome that is translated remains to be ascertained. We previously developed PhyloCSF, a widely used tool to identify evolutionary signatures of protein-coding regions using multispecies genome alignments. Here, we present...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.246462.118

    authors: Mudge JM,Jungreis I,Hunt T,Gonzalez JM,Wright JC,Kay M,Davidson C,Fitzgerald S,Seal R,Tweedie S,He L,Waterhouse RM,Li Y,Bruford E,Choudhary JS,Frankish A,Kellis M

    更新日期:2019-12-01 00:00:00

  • Annotated expressed sequence tags and cDNA microarrays for studies of brain and behavior in the honey bee.

    abstract::To accelerate the molecular analysis of behavior in the honey bee (Apis mellifera), we created expressed sequence tag (EST) and cDNA microarray resources for the bee brain. Over 20,000 cDNA clones were partially sequenced from a normalized (and subsequently subtracted) library generated from adult A. mellifera brains....

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5302

    authors: Whitfield CW,Band MR,Bonaldo MF,Kumar CG,Liu L,Pardinas JR,Robertson HM,Soares MB,Robinson GE

    更新日期:2002-04-01 00:00:00

  • Hybrid assembly of the large and highly repetitive genome of Aegilops tauschii, a progenitor of bread wheat, with the MaSuRCA mega-reads algorithm.

    abstract::Long sequencing reads generated by single-molecule sequencing technology offer the possibility of dramatically improving the contiguity of genome assemblies. The biggest challenge today is that long reads have relatively high error rates, currently around 15%. The high error rates make it difficult to use this data al...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.213405.116

    authors: Zimin AV,Puiu D,Luo MC,Zhu T,Koren S,Marçais G,Yorke JA,Dvořák J,Salzberg SL

    更新日期:2017-05-01 00:00:00

  • A recombination hotspot leads to sequence variability within a novel gene (AK005651) and contributes to type 1 diabetes susceptibility.

    abstract::More than 25 loci have been linked to type 1 diabetes (T1D) in the nonobese diabetic (NOD) mouse, but identification of the underlying genes remains challenging. We describe here the positional cloning of a T1D susceptibility locus, Idd11, located on mouse chromosome 4. Sequence analysis of a series of congenic NOD mo...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.101881.109

    authors: Tan IK,Mackin L,Wang N,Papenfuss AT,Elso CM,Ashton MP,Quirk F,Phipson B,Bahlo M,Speed TP,Smyth GK,Morahan G,Brodnicki TC

    更新日期:2010-12-01 00:00:00

  • Integrated mapping, chromosomal sequencing and sequence analysis of Cryptosporidium parvum.

    abstract::The apicomplexan Cryptosporidium parvum is one of the most prevalent protozoan parasites of humans. We report the physical mapping of the genome of the Iowa isolate, sequencing and analysis of chromosome 6, and approximately 0.9 Mbp of sequence sampled from the remainder of the genome. To construct a robust physical m...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1555203

    authors: Bankier AT,Spriggs HF,Fartmann B,Konfortov BA,Madera M,Vogel C,Teichmann SA,Ivens A,Dear PH

    更新日期:2003-08-01 00:00:00

  • Efficient identification of Y chromosome sequences in the human and Drosophila genomes.

    abstract::Notwithstanding their biological importance, Y chromosomes remain poorly known in most species. A major obstacle to their study is the identification of Y chromosome sequences; due to its high content of repetitive DNA, in most genome projects, the Y chromosome sequence is fragmented into a large number of small, unma...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.156034.113

    authors: Carvalho AB,Clark AG

    更新日期:2013-11-01 00:00:00

  • Genome-scale identification of cellular pathways required for cell surface recognition.

    abstract::Interactions mediated by cell surface receptors initiate important instructive signaling cues but can be difficult to detect in biochemical assays because they are often highly transient and membrane-embedded receptors are difficult to solubilize in their native conformation. Here, we address these biochemical challen...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.231183.117

    authors: Sharma S,Bartholdson SJ,Couch ACM,Yusa K,Wright GJ

    更新日期:2018-09-01 00:00:00

  • A transposon-based strategy for sequencing repetitive DNA in eukaryotic genomes.

    abstract::Repetitive DNA is a significant component of eukaryotic genomes. We have developed a strategy to efficiently and accurately sequence repetitive DNA in the nematode Caenorhabditis elegans using integrated artificial transposons and automated fluorescent sequencing. Mapping and assembly tools represent important compone...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.7.5.551

    authors: Devine SE,Chissoe SL,Eby Y,Wilson RK,Boeke JD

    更新日期:1997-05-01 00:00:00

  • A network of transcriptionally coordinated functional modules in Saccharomyces cerevisiae.

    abstract::Recent computational and experimental work suggests that functional modules underlie much of cellular physiology and are a useful unit of cellular organization from the perspective of systems biology. Because interactions among modules can give rise to higher-level properties that are essential to cellular function, a...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3847105

    authors: Petti AA,Church GM

    更新日期:2005-09-01 00:00:00

  • Parente2: a fast and accurate method for detecting identity by descent.

    abstract::Identity-by-descent (IBD) inference is the problem of establishing a genetic connection between two individuals through a genomic segment that is inherited by both individuals from a recent common ancestor. IBD inference is an important preceding step in a variety of population genomic studies, ranging from demographi...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.173641.114

    authors: Rodriguez JM,Bercovici S,Huang L,Frostig R,Batzoglou S

    更新日期:2015-02-01 00:00:00

  • Improved discovery of genetic interactions using CRISPRiSeq across multiple environments.

    abstract::Large-scale genetic interaction (GI) screens in yeast have been invaluable for our understanding of molecular systems biology and for characterizing novel gene function. Owing in part to the high costs and long experiment times required, a preponderance of GI data has been generated in a single environmental condition...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.246603.118

    authors: Jaffe M,Dziulko A,Smith JD,St Onge RP,Levy SF,Sherlock G

    更新日期:2019-04-01 00:00:00

  • A fine scale phenotype-genotype virulence map of a bacterial pathogen.

    abstract::A large fraction of the genes from sequenced organisms are of unknown function. This limits biological insight, and for pathogenic microorganisms hampers the development of new approaches to battle infections. There is thus a great need for novel strategies that link genotypes to phenotypes for microorganisms. We desc...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.137430.112

    authors: van Opijnen T,Camilli A

    更新日期:2012-12-01 00:00:00

  • Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors.

    abstract::Chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) has become the dominant technique for mapping transcription factor (TF) binding regions genome-wide. We performed an integrative analysis centered around 457 ChIP-seq data sets on 119 human TFs generated by the ENCODE Consortium. We ident...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.139105.112

    authors: Wang J,Zhuang J,Iyer S,Lin X,Whitfield TW,Greven MC,Pierce BG,Dong X,Kundaje A,Cheng Y,Rando OJ,Birney E,Myers RM,Noble WS,Snyder M,Weng Z

    更新日期:2012-09-01 00:00:00

  • The evolution of sex-biased gene expression in the Drosophila brain.

    abstract::Genes with sex-biased expression in Drosophila are thought to underlie sexually dimorphic phenotypes and have been shown to possess unique evolutionary properties. However, the forces and constraints governing the evolution of sex-biased genes in the somatic tissues of Drosophila are largely unknown. By using populati...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.259069.119

    authors: Khodursky S,Svetec N,Durkin SM,Zhao L

    更新日期:2020-06-01 00:00:00

  • Adenoviral vectors expressing siRNAs for discovery and validation of gene function.

    abstract::RNA interference is a powerful tool for studying gene function and for drug target discovery in diverse organisms and cell types. In mammalian systems, small interfering RNAs (siRNAs), or DNA plasmids expressing these siRNAs, have been used to down-modulate gene expression. However, inefficient transfection protocols,...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1332603

    authors: Arts GJ,Langemeijer E,Tissingh R,Ma L,Pavliska H,Dokic K,Dooijes R,Mesić E,Clasen R,Michiels F,van der Schueren J,Lambrecht M,Herman S,Brys R,Thys K,Hoffmann M,Tomme P,van Es H

    更新日期:2003-10-01 00:00:00