Abstract:
:Short tandem repeats (STRs) have a wide range of applications, including medical genetics, forensics, and genetic genealogy. High-throughput sequencing (HTS) has the potential to profile hundreds of thousands of STR loci. However, mainstream bioinformatics pipelines are inadequate for the task. These pipelines treat STR mapping as gapped alignment, which results in cumbersome processing times and a biased sampling of STR alleles. Here, we present lobSTR, a novel method for profiling STRs in personal genomes. lobSTR harnesses concepts from signal processing and statistical learning to avoid gapped alignment and to address the specific noise patterns in STR calling. The speed and reliability of lobSTR exceed the performance of current mainstream algorithms for STR profiling. We validated lobSTR's accuracy by measuring its consistency in calling STRs from whole-genome sequencing of two biological replicates from the same individual, by tracing Mendelian inheritance patterns in STR alleles in whole-genome sequencing of a HapMap trio, and by comparing lobSTR results to traditional molecular techniques. Encouraged by the speed and accuracy of lobSTR, we used the algorithm to conduct a comprehensive survey of STR variations in a deeply sequenced personal genome. We traced the mutation dynamics of close to 100,000 STR loci and observed more than 50,000 STR variations in a single genome. lobSTR's implementation is an end-to-end solution. The package accepts raw sequencing reads and provides the user with the genotyping results. It is written in C/C++, includes multi-threading capabilities, and is compatible with the BAM format.
journal_name
Genome Resjournal_title
Genome researchauthors
Gymrek M,Golan D,Rosset S,Erlich Ydoi
10.1101/gr.135780.111subject
Has Abstractpub_date
2012-06-01 00:00:00pages
1154-62issue
6eissn
1088-9051issn
1549-5469pii
gr.135780.111journal_volume
22pub_type
杂志文章相关文献
GENOME RESEARCH文献大全abstract::With the genomic sequencing of Arabidopsis nearing completion and rice sequencing very much in its infancy, a key question is whether we can exploit the Arabidopsis sequence to identify candidate genes for traits in cereal crops using a map-based approach. This requires the existence of colinearity between the Arabido...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.9.9.825
更新日期:1999-09-01 00:00:00
abstract::Previously, we have described novel families of genes, warthog (wrt) and groundhog (grd), in Caenorhabditis elegans. They are related to Hedgehog (Hh) through the carboxy-terminal autoprocessing domain (called Hog or Hint). A comprehensive survey revealed 10 genes with Hog/Hint modules in C. elegans. Five of these are...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.9.10.909
更新日期:1999-10-01 00:00:00
abstract::Analysis procedures are needed to extract useful information from the large amount of gene expression data that is becoming available. This work describes a set of analytical tools and their application to yeast cell cycle data. The components of our approach are (1) a similarity measure that reduces the number of fal...
journal_title:Genome research
pub_type: 杂志文章,评审
doi:10.1101/gr.9.11.1106
更新日期:1999-11-01 00:00:00
abstract::The Atlantic herring is a model species for exploring the genetic basis for ecological adaptation, due to its huge population size and extremely low genetic differentiation at selectively neutral loci. However, such studies have so far been hampered because of a highly fragmented genome assembly. Here, we deliver a ch...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.253435.119
更新日期:2019-11-01 00:00:00
abstract::Primate pericentromeric regions recently have been shown to exhibit extraordinary evolutionary plasticity. In this paper we report an additional peculiar feature of these regions that we discovered while analyzing, by FISH, the evolutionary conservation of primate phylogenetic chromosome IX. If the position of the cen...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.9.12.1184
更新日期:1999-12-01 00:00:00
abstract::Gene amplification occurs in most solid tumors and is associated with poor prognosis. Amplification of 20q13.2 is common to several tumor types including breast cancer. The 1 Mb of sequence spanning the 20q13.2 breast cancer amplicon is one of the most exhaustively studied segments of the human genome. These studies h...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.gr1743r
更新日期:2001-06-01 00:00:00
abstract::Bidirectional promoters have received considerable attention because of their ability to regulate two downstream genes (divergent genes). They are also highly abundant, directing the transcription of approximately 11% of genes in the human genome. We categorized the presence of DNA sequence motifs, binding of transcri...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.5623407
更新日期:2007-06-01 00:00:00
abstract::Understanding the consequences of regulatory variation in the human genome remains a major challenge, with important implications for understanding gene regulation and interpreting the many disease-risk variants that fall outside of protein-coding regions. Here, we provide a direct window into the regulatory consequen...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.155192.113
更新日期:2014-01-01 00:00:00
abstract::Next-generation sequencing technologies have made it possible to sequence targeted regions of the human genome in hundreds of individuals. Deep sequencing represents a powerful approach for the discovery of the complete spectrum of DNA sequence variants in functionally important genomic intervals. Current methods for ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.100040.109
更新日期:2010-04-01 00:00:00
abstract::Translocations are known to affect the expression of genes at the breakpoints and, in the case of unbalanced translocations, alter the gene copy number. However, a comprehensive understanding of the functional impact of this class of variation is lacking. Here, we have studied the effect of balanced chromosomal rearra...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.103622.109
更新日期:2010-05-01 00:00:00
abstract::Genome-wide association studies (GWAS) are identifying genetic predisposition to various diseases. The 17q24.3 locus harbors the single nucleotide polymorphism (SNP) rs1859962 that is statistically associated with prostate cancer (PCa). It defines a 130-kb linkage disequilibrium (LD) block that lies in an ∼2-Mb gene d...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.135665.111
更新日期:2012-08-01 00:00:00
abstract::We used a combination of cDNA selection, exon amplification, and computational prediction from genomic sequence to isolate transcribed sequences from genomic DNA surrounding the familial Mediterranean fever (FMF) locus. Eighty-seven kb of genomic DNA around D16S3370, a marker showing a high degree of linkage disequili...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.8.11.1172
更新日期:1998-11-01 00:00:00
abstract::Long sequencing reads generated by single-molecule sequencing technology offer the possibility of dramatically improving the contiguity of genome assemblies. The biggest challenge today is that long reads have relatively high error rates, currently around 15%. The high error rates make it difficult to use this data al...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.213405.116
更新日期:2017-05-01 00:00:00
abstract::Double anal fin (Da) is a medaka with an autosomal semidominant mutation that causes mirror image duplication of the ventral region concentrating on the caudal region. The chromosomal location of the Da gene and its sequence have remained unknown. We constructed a medaka linkage map as a first step to approach positio...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.9.12.1277
更新日期:1999-12-01 00:00:00
abstract::A benchmark problem is described for the reconstruction and analysis of biochemical networks given sampled experimental data. The growth of the organisms is described in a bioreactor in which one substrate is fed into the reactor with a given feed rate and feed concentration. Measurements for some intracellular compon...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.1226004
更新日期:2004-09-01 00:00:00
abstract::The str family of genes encoding seven-transmembrane G-protein-coupled or serpentine receptors related to the ODR-10 diacetyl chemoreceptor is very large, with at least 197 members in the Caenorhabditis elegans genome. The closely related stl family has 43 genes, and both families are distantly related to the srd fami...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.8.5.449
更新日期:1998-05-01 00:00:00
abstract::The incorporation and creation of modified nucleobases in DNA have profound effects on genome function. We describe methods for mapping positions and local content of modified DNA nucleobases in genomic DNA. We combined in vitro nucleobase excision with massively parallel DNA sequencing (Excision-seq) to determine the...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.174052.114
更新日期:2014-09-01 00:00:00
abstract::The densities of transposable elements (TEs) in the human genome display substantial variation both within individual chromosomes and among chromosome types (autosomes and the two sex chromosomes). Finding an explanation for this variability has been challenging, especially in light of genome landscapes unique to the ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.099044.109
更新日期:2010-05-01 00:00:00
abstract::Somatic transposon expression in neural tissue is commonly considered as a measure of mobilization and has therefore been linked to neuropathology and organismal individuality. We combined genome sequencing data with single-cell mRNA sequencing of the same inbred fly strain to map transposon expression in the Drosophi...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.259200.119
更新日期:2020-11-01 00:00:00
abstract::The gastrointestinal microbiome undergoes shifts in species and strain abundances, yet dynamics involving closely related microorganisms remain largely unknown because most methods cannot resolve them. We developed new metagenomic methods and utilized them to track species and strain level variations in microbial comm...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.142315.112
更新日期:2013-01-01 00:00:00
abstract::We report on the development of a methylation analysis workflow for optical detection of fluorescent methylation profiles along chromosomal DNA molecules. In combination with Bionano Genomics genome mapping technology, these profiles provide a hybrid genetic/epigenetic genome-wide map composed of DNA molecules spannin...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.240739.118
更新日期:2019-04-01 00:00:00
abstract::Increasing evidence suggests that interactions between regulatory genomic elements play an important role in regulating gene expression. We generated a genome-wide interaction map of regulatory elements in human cells (ENCODE tier 1 cells, K562, GM12878) using Chromatin Interaction Analysis by Paired-End Tag sequencin...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.176586.114
更新日期:2014-12-01 00:00:00
abstract::Mammalian genomes are partitioned into domains that replicate in a defined temporal order. These domains can replicate at similar times in all cell types (constitutive) or at cell type-specific times (developmental). Genome-wide chromatin conformation capture (Hi-C) has revealed sub-megabase topologically associating ...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.183699.114
更新日期:2015-08-01 00:00:00
abstract::Homeotic (Hox) genes are usually clustered and arranged in the same order as they are expressed along the anteroposterior body axis of metazoans. The mechanistic explanation for this colinearity has been elusive, and it may well be that a single and universal cause does not exist. The Hox-gene complex (HOM-C) has been...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.3468605
更新日期:2005-05-01 00:00:00
abstract::The main objectives of the study reported here were to construct a molecular map of wild emmer wheat, Triticum dicoccoides, to characterize the marker-related anatomy of the genome, and to evaluate segregation and recombination patterns upon crossing T. dicoccoides with its domesticated descendant Triticum durum (cult...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.150300
更新日期:2000-10-01 00:00:00
abstract::Dual channel imaging and warping of two-dimensional (2D) protein gels were used to visualize global changes of the gene expression patterns in growing Bacillus subtilis cells during entry into the stationary phase as triggered by glucose exhaustion. The 2D gels only depict single moments during the cells' growth cycle...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.905003
更新日期:2003-02-01 00:00:00
abstract::Changes in gene order between the genomes of two related yeast species, Saccharomyces cerevisiae and Saccharomyces bayanus var. uvarum were studied. From the dataset of a previous low coverage sequencing of the S. bayanus var. uvarum genome, 35 different synteny breakpoints between neighboring genes and two cases of l...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.212701
更新日期:2001-12-01 00:00:00
abstract::In mammals, genome-wide chromatin maps and immunofluorescence studies show that broad domains of repressive histone modifications are present on pericentromeric and telomeric repeats and on the inactive X chromosome. However, only a few autosomal loci such as silent Hox gene clusters have been shown to lie in broad do...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.080861.108
更新日期:2009-02-01 00:00:00
abstract::Using AP-PCR-based DNA profiling we examined some structural features of B chromosomes from yellow-necked mice Apodemus flavicollis. Mice harboring one, two, or three or lacking B chromosomes were examined. Chromosomal structure was scanned for variant bands by using a series of arbitrary primers and from these, infor...
journal_title:Genome research
pub_type: 杂志文章
doi:
更新日期:2000-01-01 00:00:00
abstract::Natural killer (NK) cells are innate lymphocytes important for early host defense against infectious pathogens and surveillance against malignant transformation. Resting murine NK cells regulate the translation of effector molecule mRNAs (e.g., granzyme B, GzmB) through unclear molecular mechanisms. MicroRNAs (miRNAs)...
journal_title:Genome research
pub_type: 杂志文章
doi:10.1101/gr.107995.110
更新日期:2010-11-01 00:00:00