A matter of life or death: how microsatellites emerge in and vanish from the human genome.

Abstract:

:Microsatellites--tandem repeats of short DNA motifs--are abundant in the human genome and have high mutation rates. While microsatellite instability is implicated in numerous genetic diseases, the molecular processes involved in their emergence and disappearance are still not well understood. Microsatellites are hypothesized to follow a life cycle, wherein they are born and expand into adulthood, until their degradation and death. Here we identified microsatellite births/deaths in human, chimpanzee, and orangutan genomes, using macaque and marmoset as outgroups. We inferred mutations causing births/deaths based on parsimony, and investigated local genomic environments affecting them. We also studied birth/death patterns within transposable elements (Alus and L1s), coding regions, and disease-associated loci. We observed that substitutions were the predominant cause for births of short microsatellites, while insertions and deletions were important for births of longer microsatellites. Substitutions were the cause for deaths of microsatellites of virtually all lengths. AT-rich L1 sequences exhibited elevated frequency of births/deaths over their entire length, while GC-rich Alus only in their 3' poly(A) tails and middle A-stretches, with differences depending on transposable element integration timing. Births/deaths were strongly selected against in coding regions. Births/deaths occurred in genomic regions with high substitution rates, protomicrosatellite content, and L1 density, but low GC content and Alu density. The majority of the 17 disease-associated microsatellites examined are evolutionarily ancient (were acquired by the common ancestor of simians). Our genome-wide investigation of microsatellite life cycle has fundamental applications for predicting the susceptibility of birth/death of microsatellites, including many disease-causing loci.

journal_name

Genome Res

journal_title

Genome research

authors

Kelkar YD,Eckert KA,Chiaromonte F,Makova KD

doi

10.1101/gr.122937.111

subject

Has Abstract

pub_date

2011-12-01 00:00:00

pages

2038-48

issue

12

eissn

1088-9051

issn

1549-5469

pii

gr.122937.111

journal_volume

21

pub_type

杂志文章
  • Molecular cloning and RARE cleavage mapping of human 2p, 6q, 8q, 12q, and 18q telomeres.

    abstract::Large terminal fragments of human chromosomes 2p, 6p, 8q, 12q, and 18q were cloned using yeast artificial chromosomes (YACs). RecA-assisted restriction endonuclease (RARE) cleavage analysis of genomic DNA samples from II unrelated individuals using YAC-derived probes confirmed the telomeric localizations of the half-Y...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5.3.225

    authors: Macina RA,Morii K,Hu XL,Negorev DG,Spais C,Ruthig LA,Riethman HC

    更新日期:1995-10-01 00:00:00

  • Strategies for mutational analysis of the large multiexon ATM gene using high-density oligonucleotide arrays.

    abstract::Mutational analysis of large genes with complex genomic structures plays an important role in medical genetics. Technical limitations associated with current mutation screening protocols have placed increased emphasis on the development of new technologies to simplify these procedures. High-density arrays of >90,000-o...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.8.12.1245

    authors: Hacia JG,Sun B,Hunt N,Edgemon K,Mosbrook D,Robbins C,Fodor SP,Tagle DA,Collins FS

    更新日期:1998-12-01 00:00:00

  • Signatures of domain shuffling in the human genome.

    abstract::To elucidate the role of exon shuffling in shaping the complexity of the human genome/proteome, we have systematically analyzed intron phase distributions in the coding sequence of human protein domains. We found that introns at the boundaries of domains show high excess of symmetrical phase combinations (i.e., 0-0, 1...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.520702

    authors: Kaessmann H,Zöllner S,Nekrutenko A,Li WH

    更新日期:2002-11-01 00:00:00

  • Global analysis of Drosophila Cys₂-His₂ zinc finger proteins reveals a multitude of novel recognition motifs and binding determinants.

    abstract::Cys2-His2 zinc finger proteins (ZFPs) are the largest group of transcription factors in higher metazoans. A complete characterization of these ZFPs and their associated target sequences is pivotal to fully annotate transcriptional regulatory networks in metazoan genomes. As a first step in this process, we have charac...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.151472.112

    authors: Enuameh MS,Asriyan Y,Richards A,Christensen RG,Hall VL,Kazemian M,Zhu C,Pham H,Cheng Q,Blatti C,Brasefield JA,Basciotta MD,Ou J,McNulty JC,Zhu LJ,Celniker SE,Sinha S,Stormo GD,Brodsky MH,Wolfe SA

    更新日期:2013-06-01 00:00:00

  • Annotated expressed sequence tags and cDNA microarrays for studies of brain and behavior in the honey bee.

    abstract::To accelerate the molecular analysis of behavior in the honey bee (Apis mellifera), we created expressed sequence tag (EST) and cDNA microarray resources for the bee brain. Over 20,000 cDNA clones were partially sequenced from a normalized (and subsequently subtracted) library generated from adult A. mellifera brains....

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5302

    authors: Whitfield CW,Band MR,Bonaldo MF,Kumar CG,Liu L,Pardinas JR,Robertson HM,Soares MB,Robinson GE

    更新日期:2002-04-01 00:00:00

  • Inferring tumor progression from genomic heterogeneity.

    abstract::Cancer progression in humans is difficult to infer because we do not routinely sample patients at multiple stages of their disease. However, heterogeneous breast tumors provide a unique opportunity to study human tumor progression because they still contain evidence of early and intermediate subpopulations in the form...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.099622.109

    authors: Navin N,Krasnitz A,Rodgers L,Cook K,Meth J,Kendall J,Riggs M,Eberling Y,Troge J,Grubor V,Levy D,Lundin P,Månér S,Zetterberg A,Hicks J,Wigler M

    更新日期:2010-01-01 00:00:00

  • Properties of overlapping genes are conserved across microbial genomes.

    abstract::There are numerous examples from the genomes of viruses, mitochondria, and chromosomes that adjacent genes can overlap, sharing at least one nucleotide. Overlaps have been hypothesized to be involved in genome size minimization and as a regulatory mechanism of gene expression. Here we show that overlapping genes are a...

    journal_title:Genome research

    pub_type: 信件

    doi:10.1101/gr.2433104

    authors: Johnson ZI,Chisholm SW

    更新日期:2004-11-01 00:00:00

  • CENPT bridges adjacent CENPA nucleosomes on young human α-satellite dimers.

    abstract::Nucleosomes containing the CenH3 (CENPA or CENP-A) histone variant replace H3 nucleosomes at centromeres to provide a foundation for kinetochore assembly. CENPA nucleosomes are part of the constitutive centromere associated network (CCAN) that forms the inner kinetochore on which outer kinetochore proteins assemble. T...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.204784.116

    authors: Thakur J,Henikoff S

    更新日期:2016-09-01 00:00:00

  • A comprehensive survey of 3' animal miRNA modification events and a possible role for 3' adenylation in modulating miRNA targeting effectiveness.

    abstract::Animal microRNA sequences are subject to 3' nucleotide addition. Through detailed analysis of deep-sequenced short RNA data sets, we show adenylation and uridylation of miRNA is globally present and conserved across Drosophila and vertebrates. To better understand 3' adenylation function, we deep-sequenced RNA after k...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.106054.110

    authors: Burroughs AM,Ando Y,de Hoon MJ,Tomaru Y,Nishibu T,Ukekawa R,Funakoshi T,Kurokawa T,Suzuki H,Hayashizaki Y,Daub CO

    更新日期:2010-10-01 00:00:00

  • Uncovering cis-regulatory sequence requirements for context-specific transcription factor binding.

    abstract::The regulation of gene expression is mediated at the transcriptional level by enhancer regions that are bound by sequence-specific transcription factors (TFs). Recent studies have shown that the in vivo binding sites of single TFs differ between developmental or cellular contexts. How this context-specific binding is ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.132811.111

    authors: Yáñez-Cuna JO,Dinh HQ,Kvon EZ,Shlyueva D,Stark A

    更新日期:2012-10-01 00:00:00

  • Preference of DNA methyltransferases for CpG islands in mouse embryonic stem cells.

    abstract::Many CpG islands have tissue-dependent and differentially methylated regions (T-DMRs) in normal cells and tissues. To elucidate how DNA methyltransferases (Dnmts) participate in methylation of the genomic components, we investigated the genome-wide DNA methylation pattern of the T-DMRs with Dnmt1-, Dnmt3a-, and/or Dnm...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2431504

    authors: Hattori N,Abe T,Hattori N,Suzuki M,Matsuyama T,Yoshida S,Li E,Shiota K

    更新日期:2004-09-01 00:00:00

  • Modeling kinetic rate variation in third generation DNA sequencing data to detect putative modifications to DNA bases.

    abstract::Current generation DNA sequencing instruments are moving closer to seamlessly sequencing genomes of entire populations as a routine part of scientific investigation. However, while significant inroads have been made identifying small nucleotide variation and structural variations in DNA that impact phenotypes of inter...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.136739.111

    authors: Schadt EE,Banerjee O,Fang G,Feng Z,Wong WH,Zhang X,Kislyuk A,Clark TA,Luong K,Keren-Paz A,Chess A,Kumar V,Chen-Plotkin A,Sondheimer N,Korlach J,Kasarskis A

    更新日期:2013-01-01 00:00:00

  • Copy number and targeted mutational analysis reveals novel somatic events in metastatic prostate tumors.

    abstract::Advanced prostate cancer can progress to systemic metastatic tumors, which are generally androgen insensitive and ultimately lethal. Here, we report a comprehensive genomic survey for somatic events in systemic metastatic prostate tumors using both high-resolution copy number analysis and targeted mutational survey of...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.107961.110

    authors: Robbins CM,Tembe WA,Baker A,Sinari S,Moses TY,Beckstrom-Sternberg S,Beckstrom-Sternberg J,Barrett M,Long J,Chinnaiyan A,Lowey J,Suh E,Pearson JV,Craig DW,Agus DB,Pienta KJ,Carpten JD

    更新日期:2011-01-01 00:00:00

  • Biological data sciences in genome research.

    abstract::The last 20 years have been a remarkable era for biology and medicine. One of the most significant achievements has been the sequencing of the first human genomes, which has laid the foundation for profound insights into human genetics, the intricacies of regulation and development, and the forces of evolution. Incred...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.191684.115

    authors: Schatz MC

    更新日期:2015-10-01 00:00:00

  • The evolution of sex-biased gene expression in the Drosophila brain.

    abstract::Genes with sex-biased expression in Drosophila are thought to underlie sexually dimorphic phenotypes and have been shown to possess unique evolutionary properties. However, the forces and constraints governing the evolution of sex-biased genes in the somatic tissues of Drosophila are largely unknown. By using populati...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.259069.119

    authors: Khodursky S,Svetec N,Durkin SM,Zhao L

    更新日期:2020-06-01 00:00:00

  • Genome-scale identification of cellular pathways required for cell surface recognition.

    abstract::Interactions mediated by cell surface receptors initiate important instructive signaling cues but can be difficult to detect in biochemical assays because they are often highly transient and membrane-embedded receptors are difficult to solubilize in their native conformation. Here, we address these biochemical challen...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.231183.117

    authors: Sharma S,Bartholdson SJ,Couch ACM,Yusa K,Wright GJ

    更新日期:2018-09-01 00:00:00

  • The Release 6 reference sequence of the Drosophila melanogaster genome.

    abstract::Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and co...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.185579.114

    authors: Hoskins RA,Carlson JW,Wan KH,Park S,Mendez I,Galle SE,Booth BW,Pfeiffer BD,George RA,Svirskas R,Krzywinski M,Schein J,Accardo MC,Damia E,Messina G,Méndez-Lago M,de Pablos B,Demakova OV,Andreyeva EN,Boldyreva LV,Ma

    更新日期:2015-03-01 00:00:00

  • The identification and functional annotation of RNA structures conserved in vertebrates.

    abstract::Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than seq...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.208652.116

    authors: Seemann SE,Mirza AH,Hansen C,Bang-Berthelsen CH,Garde C,Christensen-Dalsgaard M,Torarinsson E,Yao Z,Workman CT,Pociot F,Nielsen H,Tommerup N,Ruzzo WL,Gorodkin J

    更新日期:2017-08-01 00:00:00

  • Copy number variation at the 7q11.23 segmental duplications is a susceptibility factor for the Williams-Beuren syndrome deletion.

    abstract::Large copy number variants (CNVs) have been recently found as structural polymorphisms of the human genome of still unknown biological significance. CNVs are significantly enriched in regions with segmental duplications or low-copy repeats (LCRs). Williams-Beuren syndrome (WBS) is a neurodevelopmental disorder caused ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.073197.107

    authors: Cuscó I,Corominas R,Bayés M,Flores R,Rivera-Brugués N,Campuzano V,Pérez-Jurado LA

    更新日期:2008-05-01 00:00:00

  • Modeling of epigenome dynamics identifies transcription factors that mediate Polycomb targeting.

    abstract::Although changes in chromatin are integral to transcriptional reprogramming during cellular differentiation, it is currently unclear how chromatin modifications are targeted to specific loci. To systematically identify transcription factors (TFs) that can direct chromatin changes during cell fate decisions, we model t...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.142661.112

    authors: Arnold P,Schöler A,Pachkov M,Balwierz PJ,Jørgensen H,Stadler MB,van Nimwegen E,Schübeler D

    更新日期:2013-01-01 00:00:00

  • Detecting genetic variation in microarray expression data.

    abstract::The use of high-density oligonucleotide arrays to measure the expression levels of thousands of genes in parallel has become commonplace. To take further advantage of the growing body of data, we developed a method, termed "GeSNP," to mine the detailed hybridization patterns in oligonucleotide array expression data fo...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6307307

    authors: Greenhall JA,Zapala MA,Cáceres M,Libiger O,Barlow C,Schork NJ,Lockhart DJ

    更新日期:2007-08-01 00:00:00

  • lobSTR: A short tandem repeat profiler for personal genomes.

    abstract::Short tandem repeats (STRs) have a wide range of applications, including medical genetics, forensics, and genetic genealogy. High-throughput sequencing (HTS) has the potential to profile hundreds of thousands of STR loci. However, mainstream bioinformatics pipelines are inadequate for the task. These pipelines treat S...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.135780.111

    authors: Gymrek M,Golan D,Rosset S,Erlich Y

    更新日期:2012-06-01 00:00:00

  • Hybrid assembly of the large and highly repetitive genome of Aegilops tauschii, a progenitor of bread wheat, with the MaSuRCA mega-reads algorithm.

    abstract::Long sequencing reads generated by single-molecule sequencing technology offer the possibility of dramatically improving the contiguity of genome assemblies. The biggest challenge today is that long reads have relatively high error rates, currently around 15%. The high error rates make it difficult to use this data al...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.213405.116

    authors: Zimin AV,Puiu D,Luo MC,Zhu T,Koren S,Marçais G,Yorke JA,Dvořák J,Salzberg SL

    更新日期:2017-05-01 00:00:00

  • Gene loss and movement in the maize genome.

    abstract::Maize (Zea mays L. ssp. mays), one of the most important agricultural crops in the world, originated by hybridization of two closely related progenitors. To investigate the fate of its genes after tetraploidization, we analyzed the sequence of five duplicated regions from different chromosomal locations. We also compa...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2701104

    authors: Lai J,Ma J,Swigonová Z,Ramakrishna W,Linton E,Llaca V,Tanyolac B,Park YJ,Jeong OY,Bennetzen JL,Messing J

    更新日期:2004-10-01 00:00:00

  • Sequence features and chromatin structure around the genomic regions bound by 119 human transcription factors.

    abstract::Chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) has become the dominant technique for mapping transcription factor (TF) binding regions genome-wide. We performed an integrative analysis centered around 457 ChIP-seq data sets on 119 human TFs generated by the ENCODE Consortium. We ident...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.139105.112

    authors: Wang J,Zhuang J,Iyer S,Lin X,Whitfield TW,Greven MC,Pierce BG,Dong X,Kundaje A,Cheng Y,Rando OJ,Birney E,Myers RM,Noble WS,Snyder M,Weng Z

    更新日期:2012-09-01 00:00:00

  • A fine scale phenotype-genotype virulence map of a bacterial pathogen.

    abstract::A large fraction of the genes from sequenced organisms are of unknown function. This limits biological insight, and for pathogenic microorganisms hampers the development of new approaches to battle infections. There is thus a great need for novel strategies that link genotypes to phenotypes for microorganisms. We desc...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.137430.112

    authors: van Opijnen T,Camilli A

    更新日期:2012-12-01 00:00:00

  • Spatiotemporal clustering of the epigenome reveals rules of dynamic gene regulation.

    abstract::Spatial organization of different epigenomic marks was used to infer functions of the epigenome. It remains unclear what can be learned from the temporal changes of the epigenome. Here, we developed a probabilistic model to cluster genomic sequences based on the similarity of temporal changes of multiple epigenomic ma...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.144949.112

    authors: Yu P,Xiao S,Xin X,Song CX,Huang W,McDee D,Tanaka T,Wang T,He C,Zhong S

    更新日期:2013-02-01 00:00:00

  • Chromosomal instability mediated by non-B DNA: cruciform conformation and not DNA sequence is responsible for recurrent translocation in humans.

    abstract::Chromosomal aberrations have been thought to be random events. However, recent findings introduce a new paradigm in which certain DNA segments have the potential to adopt unusual conformations that lead to genomic instability and nonrandom chromosomal rearrangement. One of the best-studied examples is the palindromic ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.079244.108

    authors: Inagaki H,Ohye T,Kogo H,Kato T,Bolor H,Taniguchi M,Shaikh TH,Emanuel BS,Kurahashi H

    更新日期:2009-02-01 00:00:00

  • Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.

    abstract::Short tandem repeats (STRs) are implicated in dozens of human genetic diseases and contribute significantly to genome variation and instability. Yet profiling STRs from short-read sequencing data is challenging because of their high sequencing error rates. Here, we developed STR-FM, short tandem repeat profiling using...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.185892.114

    authors: Fungtammasan A,Ananda G,Hile SE,Su MS,Sun C,Harris R,Medvedev P,Eckert K,Makova KD

    更新日期:2015-05-01 00:00:00

  • A scalable high-throughput chemical synthesizer.

    abstract::A machine that employs a novel reagent delivery technique for biomolecular synthesis has been developed. This machine separates the addressing of individual synthesis sites from the actual process of reagent delivery by using masks placed over the sites. Because of this separation, this machine is both cost-effective ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.359002

    authors: Livesay EA,Liu YH,Luebke KJ,Irick J,Belosludtsev Y,Rayner S,Balog R,Johnston SA

    更新日期:2002-12-01 00:00:00