Abundance and length of simple repeats in vertebrate genomes are determined by their structural properties.

Abstract:

:Microsatellites are abundant in vertebrate genomes, but their sequence representation and length distributions vary greatly within each family of repeats (e.g., tetranucleotides). Biophysical studies of 82 synthetic single-stranded oligonucleotides comprising all tetra- and trinucleotide repeats revealed an inverse correlation between the stability of folded-back hairpin and quadruplex structures and the sequence representation for repeats > or =30 bp in length in nine vertebrate genomes. Alternatively, the predicted energies of base-stacking interactions correlated directly with the longest length distributions in vertebrate genomes. Genome-wide analyses indicated that unstable sequences, such as CAG:CTG and CCG:CGG, were over-represented in coding regions and that micro/minisatellites were recruited in genes involved in transcription and signaling pathways, particularly in the nervous system. Microsatellite instability (MSI) is a hallmark of cancer, and length polymorphism within genes can confer susceptibility to inherited disease. Sequences that manifest the highest MSI values also displayed the strongest base-stacking interactions; analyses of 62 tri- and tetranucleotide repeat-containing genes associated with human genetic disease revealed enrichments similar to those noted for micro/minisatellite-containing genes. We conclude that DNA structure and base-stacking determined the number and length distributions of microsatellite repeats in vertebrate genomes over evolutionary time and that micro/minisatellites have been recruited to participate in both gene and protein function.

journal_name

Genome Res

journal_title

Genome research

authors

Bacolla A,Larson JE,Collins JR,Li J,Milosavljevic A,Stenson PD,Cooper DN,Wells RD

doi

10.1101/gr.078303.108

subject

Has Abstract

pub_date

2008-10-01 00:00:00

pages

1545-53

issue

10

eissn

1088-9051

issn

1549-5469

pii

gr.078303.108

journal_volume

18

pub_type

杂志文章
  • The amphioxus genome illuminates vertebrate origins and cephalochordate biology.

    abstract::Cephalochordates, urochordates, and vertebrates evolved from a common ancestor over 520 million years ago. To improve our understanding of chordate evolution and the origin of vertebrates, we intensively searched for particular genes, gene families, and conserved noncoding elements in the sequenced genome of the cepha...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.073676.107

    authors: Holland LZ,Albalat R,Azumi K,Benito-Gutiérrez E,Blow MJ,Bronner-Fraser M,Brunet F,Butts T,Candiani S,Dishaw LJ,Ferrier DE,Garcia-Fernàndez J,Gibson-Brown JJ,Gissi C,Godzik A,Hallböök F,Hirose D,Hosomichi K,Ikuta T,I

    更新日期:2008-07-01 00:00:00

  • An analysis of the gene complement of a marsupial, Monodelphis domestica: evolution of lineage-specific genes and giant chromosomes.

    abstract::The newly sequenced genome of Monodelphis domestica not only provides the out-group necessary to better understand our own eutherian lineage, but it enables insights into the innovative biology of metatherians. Here, we compare Monodelphis with Homo sequences from alignments of single nucleotides, genes, and whole chr...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6093907

    authors: Goodstadt L,Heger A,Webber C,Ponting CP

    更新日期:2007-07-01 00:00:00

  • Predicting deleterious amino acid substitutions.

    abstract::Many missense substitutions are identified in single nucleotide polymorphism (SNP) data and large-scale random mutagenesis projects. Each amino acid substitution potentially affects protein function. We have constructed a tool that uses sequence homology to predict whether a substitution affects protein function. SIFT...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.176601

    authors: Ng PC,Henikoff S

    更新日期:2001-05-01 00:00:00

  • Toward the development of a gene index to the human genome: an assessment of the nature of high-throughput EST sequence data.

    abstract::A rigorous analysis of the Merck-sponsored EST data with respect to known gene sequences increases the utility of the data set and helps refine methods for building a gene index. A highly curated human transcript data base was used as a reference data set of known genes. A detailed analysis of EST sequences derived fr...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6.9.829

    authors: Aaronson JS,Eckman B,Blevins RA,Borkowski JA,Myerson J,Imran S,Elliston KO

    更新日期:1996-09-01 00:00:00

  • Schizosaccharomyces pombe essential genes: a pilot study.

    abstract::After completion of the Schizosaccharomyces pombe genome sequence, we have carried out a pilot gene deletion project to assess the feasibility of a genome-wide deletion project and to estimate the percentage of essential genes. Using a PCR-based gene deletion procedure, we investigated 100 genes within a 253-kb region...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.636103

    authors: Decottignies A,Sanchez-Perez I,Nurse P

    更新日期:2003-03-01 00:00:00

  • A pooling-based approach to mapping genetic variants associated with DNA methylation.

    abstract::DNA methylation is an epigenetic modification that plays a key role in gene regulation. Previous studies have investigated its genetic basis by mapping genetic variants that are associated with DNA methylation at specific sites, but these have been limited to microarrays that cover <2% of the genome and cannot account...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.183749.114

    authors: Kaplow IM,MacIsaac JL,Mah SM,McEwen LM,Kobor MS,Fraser HB

    更新日期:2015-06-01 00:00:00

  • Mouse population-guided resequencing reveals that variants in CD44 contribute to acetaminophen-induced liver injury in humans.

    abstract::Interindividual variability in response to chemicals and drugs is a common regulatory concern. It is assumed that xenobiotic-induced adverse reactions have a strong genetic basis, but many mechanism-based investigations have not been successful in identifying susceptible individuals. While recent advances in pharmacog...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.090241.108

    authors: Harrill AH,Watkins PB,Su S,Ross PK,Harbourt DE,Stylianou IM,Boorman GA,Russo MW,Sackler RS,Harris SC,Smith PC,Tennant R,Bogue M,Paigen K,Harris C,Contractor T,Wiltshire T,Rusyn I,Threadgill DW

    更新日期:2009-09-01 00:00:00

  • Analysis of 5' junctions of human LINE-1 and Alu retrotransposons suggests an alternative model for 5'-end attachment requiring microhomology-mediated end-joining.

    abstract::Insertion of the human non-LTR retrotransposon LINE-1 (L1) into chromosomal DNA is thought to be initiated by a mechanism called target-primed reverse transcription (TPRT). This mechanism readily accounts for the attachment of the 3'-end of an L1 copy to the genomic target, but the subsequent integration steps leading...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3421505

    authors: Zingler N,Willhoeft U,Brose HP,Schoder V,Jahns T,Hanschmann KM,Morrish TA,Löwer J,Schumann GG

    更新日期:2005-06-01 00:00:00

  • Two large families of chemoreceptor genes in the nematodes Caenorhabditis elegans and Caenorhabditis briggsae reveal extensive gene duplication, diversification, movement, and intron loss.

    abstract::The str family of genes encoding seven-transmembrane G-protein-coupled or serpentine receptors related to the ODR-10 diacetyl chemoreceptor is very large, with at least 197 members in the Caenorhabditis elegans genome. The closely related stl family has 43 genes, and both families are distantly related to the srd fami...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.8.5.449

    authors: Robertson HM

    更新日期:1998-05-01 00:00:00

  • Closing the gaps on human chromosome 19 revealed genes with a high density of repetitive tandemly arrayed elements.

    abstract::The reported human genome sequence includes about 400 gaps of unknown sequence that were not found in the bacterial artificial chromosome (BAC) and cosmid libraries used for sequencing of the genome. These missing sequences correspond to approximately 1% of euchromatic regions of the human genome. Gap filling is a lab...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1929904

    authors: Leem SH,Kouprina N,Grimwood J,Kim JH,Mullokandov M,Yoon YH,Chae JY,Morgan J,Lucas S,Richardson P,Detter C,Glavina T,Rubin E,Barrett JC,Larionov V

    更新日期:2004-02-01 00:00:00

  • A scalable high-throughput chemical synthesizer.

    abstract::A machine that employs a novel reagent delivery technique for biomolecular synthesis has been developed. This machine separates the addressing of individual synthesis sites from the actual process of reagent delivery by using masks placed over the sites. Because of this separation, this machine is both cost-effective ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.359002

    authors: Livesay EA,Liu YH,Luebke KJ,Irick J,Belosludtsev Y,Rayner S,Balog R,Johnston SA

    更新日期:2002-12-01 00:00:00

  • The portability of tagSNPs across populations: a worldwide survey.

    abstract::In the search for common genetic variants that contribute to prevalent human diseases, patterns of linkage disequilibrium (LD) among linked markers should be considered when selecting SNPs. Genotyping efficiency can be increased by choosing tagging SNPs (tagSNPs) in LD with other SNPs. However, it remains to be seen w...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4138406

    authors: González-Neira A,Ke X,Lao O,Calafell F,Navarro A,Comas D,Cann H,Bumpstead S,Ghori J,Hunt S,Deloukas P,Dunham I,Cardon LR,Bertranpetit J

    更新日期:2006-03-01 00:00:00

  • Detecting copy number variation with mated short reads.

    abstract::The development of high-throughput sequencing (HTS) technologies has opened the door to novel methods for detecting copy number variants (CNVs) in the human genome. While in the past CNVs have been detected based on array CGH data, recent studies have shown that depth-of-coverage information from HTS technologies can ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.106344.110

    authors: Medvedev P,Fiume M,Dzamba M,Smith T,Brudno M

    更新日期:2010-11-01 00:00:00

  • Gene expression profiling in human fetal liver and identification of tissue- and developmental-stage-specific genes through compiled expression profiles and efficient cloning of full-length cDNAs.

    abstract::Fetal liver intriguingly consists of hepatic parenchymal cells and hematopoietic stem/progenitor cells. Human fetal liver aged 22 wk of gestation (HFL22w) corresponds to the turning point between immigration and emigration of the hematopoietic system. To gain further molecular insight into its developmental and functi...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.175501

    authors: Yu Y,Zhang C,Zhou G,Wu S,Qu X,Wei H,Xing G,Dong C,Zhai Y,Wan J,Ouyang S,Li L,Zhang S,Zhou K,Zhang Y,Wu C,He F

    更新日期:2001-08-01 00:00:00

  • Global survey of escape from X inactivation by RNA-sequencing in mouse.

    abstract::X inactivation equalizes the dosage of gene expression between the sexes, but some genes escape silencing and are thus expressed from both alleles in females. To survey X inactivation and escape in mouse, we performed RNA sequencing in Mus musculus x Mus spretus cells with complete skewing of X inactivation, relying o...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.103200.109

    authors: Yang F,Babak T,Shendure J,Disteche CM

    更新日期:2010-05-01 00:00:00

  • Recent segmental duplications in the working draft assembly of the brown Norway rat.

    abstract::We assessed the content, structure, and distribution of segmental duplications (> or =90% sequence identity, > or =5 kb length) within the published version of the Rattus norvegicus genome assembly (v.3.1). The overall fraction of duplicated sequence within the rat assembly (2.92%) is greater than that of the mouse (1...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1907504

    authors: Tuzun E,Bailey JA,Eichler EE

    更新日期:2004-04-01 00:00:00

  • Strategies for mutational analysis of the large multiexon ATM gene using high-density oligonucleotide arrays.

    abstract::Mutational analysis of large genes with complex genomic structures plays an important role in medical genetics. Technical limitations associated with current mutation screening protocols have placed increased emphasis on the development of new technologies to simplify these procedures. High-density arrays of >90,000-o...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.8.12.1245

    authors: Hacia JG,Sun B,Hunt N,Edgemon K,Mosbrook D,Robbins C,Fodor SP,Tagle DA,Collins FS

    更新日期:1998-12-01 00:00:00

  • Mutation scanning by meltMADGE: validations using BRCA1 and LDLR, and demonstration of the potential to identify severe, moderate, silent, rare, and paucimorphic mutations in the general population.

    abstract::We have developed a mutation-scanning approach suitable for whole population screening for unknown mutations. The method, meltMADGE, combines thermal ramp electrophoresis with MADGE to achieve suitable cost efficiency and throughput. The sensitivity was tested in blind trials using 54 amplicons representing the BRCA1 ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3313405

    authors: Alharbi KK,Aldahmesh MA,Spanakis E,Haddad L,Whittall RA,Chen XH,Rassoulian H,Smith MJ,Sillibourne J,Ball NJ,Graham NJ,Briggs PJ,Simpson IA,Phillips DI,Lawlor DA,Ye S,Humphries SE,Cooper C,Smith GD,Ebrahim S,Eccles

    更新日期:2005-07-01 00:00:00

  • Polycomb preferentially targets stalled promoters of coding and noncoding transcripts.

    abstract::The Polycomb group (PcG) and Trithorax group (TrxG) of proteins are required for stable and heritable maintenance of repressed and active gene expression states. Their antagonistic function on gene control, repression for PcG and activity for TrxG, is mediated by binding to chromatin and subsequent epigenetic modifica...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.114348.110

    authors: Enderle D,Beisel C,Stadler MB,Gerstung M,Athri P,Paro R

    更新日期:2011-02-01 00:00:00

  • Coevolution within a transcriptional network by compensatory trans and cis mutations.

    abstract::Transcriptional networks have been shown to evolve very rapidly, prompting questions as to how such changes arise and are tolerated. Recent comparisons of transcriptional networks across species have implicated variations in the cis-acting DNA sequences near genes as the main cause of divergence. What is less clear is...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.111765.110

    authors: Kuo D,Licon K,Bandyopadhyay S,Chuang R,Luo C,Catalana J,Ravasi T,Tan K,Ideker T

    更新日期:2010-12-01 00:00:00

  • CG dinucleotides enhance promoter activity independent of DNA methylation.

    abstract::Most mammalian RNA polymerase II initiation events occur at CpG islands, which are rich in CpGs and devoid of DNA methylation. Despite their relevance for gene regulation, it is unknown to what extent the CpG dinucleotide itself actually contributes to promoter activity. To address this question, we determined the tra...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.241653.118

    authors: Hartl D,Krebs AR,Grand RS,Baubec T,Isbel L,Wirbelauer C,Burger L,Schübeler D

    更新日期:2019-04-01 00:00:00

  • Identification of protein features encoded by alternative exons using Exon Ontology.

    abstract::Transcriptomic genome-wide analyses demonstrate massive variation of alternative splicing in many physiological and pathological situations. One major challenge is now to establish the biological contribution of alternative splicing variation in physiological- or pathological-associated cellular phenotypes. Toward thi...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.212696.116

    authors: Tranchevent LC,Aubé F,Dulaurier L,Benoit-Pilven C,Rey A,Poret A,Chautard E,Mortada H,Desmet FO,Chakrama FZ,Moreno-Garcia MA,Goillot E,Janczarski S,Mortreux F,Bourgeois CF,Auboeuf D

    更新日期:2017-06-01 00:00:00

  • Mapping the pericentric heterochromatin by comparative genomic hybridization analysis and chromosome deletions in Drosophila melanogaster.

    abstract::Heterochromatin represents a significant portion of eukaryotic genomes and has essential structural and regulatory functions. Its molecular organization is largely unknown due to difficulties in sequencing through and assembling repetitive sequences enriched in the heterochromatin. Here we developed a novel strategy u...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.137406.112

    authors: He B,Caudy A,Parsons L,Rosebrock A,Pane A,Raj S,Wieschaus E

    更新日期:2012-12-01 00:00:00

  • Whole population, genome-wide mapping of hidden relatedness.

    abstract::We present GERMLINE, a robust algorithm for identifying segmental sharing indicative of recent common ancestry between pairs of individuals. Unlike methods with comparable objectives, GERMLINE scales linearly with the number of samples, enabling analysis of whole-genome data in large cohorts. Our approach is based on ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.081398.108

    authors: Gusev A,Lowe JK,Stoffel M,Daly MJ,Altshuler D,Breslow JL,Friedman JM,Pe'er I

    更新日期:2009-02-01 00:00:00

  • GrapeTree: visualization of core genomic relationships among 100,000 bacterial pathogens.

    abstract::Current methods struggle to reconstruct and visualize the genomic relationships of large numbers of bacterial genomes. GrapeTree facilitates the analyses of large numbers of allelic profiles by a static "GrapeTree Layout" algorithm that supports interactive visualizations of large trees within a web browser window. Gr...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.232397.117

    authors: Zhou Z,Alikhan NF,Sergeant MJ,Luhmann N,Vaz C,Francisco AP,Carriço JA,Achtman M

    更新日期:2018-09-01 00:00:00

  • Domain regulation of imprinting cluster in Kip2/Lit1 subdomain on mouse chromosome 7F4/F5: large-scale DNA methylation analysis reveals that DMR-Lit1 is a putative imprinting control region.

    abstract::Mouse chromosome 7F4/F5, where the imprinting domain is located, is syntenic to human 11p15.5, the locus for Beckwith-Wiedemann syndrome. The domain is thought to consist of the two subdomains Kip2 (p57(kip2))/Lit1 and Igf2/H19. Because DNA methylation is believed to be a key factor in genomic imprinting, we performed...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.110702

    authors: Yatsuki H,Joh K,Higashimoto K,Soejima H,Arai Y,Wang Y,Hatada I,Obata Y,Morisaki H,Zhang Z,Nakagawachi T,Satoh Y,Mukai T

    更新日期:2002-12-01 00:00:00

  • SNP-based quantitative deconvolution of biological mixtures: application to the detection of cows with subclinical mastitis by whole-genome sequencing of tank milk.

    abstract::Biological products of importance in food (e.g., milk) and medical (e.g., donor blood-derived products) sciences often correspond to mixtures of samples contributed by multiple individuals. Identifying which individuals contributed to the mixture and in what proportions may be of interest in several circumstances. We ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.256172.119

    authors: Coppieters W,Karim L,Georges M

    更新日期:2020-08-01 00:00:00

  • Structured nucleosome fingerprints enable high-resolution mapping of chromatin architecture within regulatory regions.

    abstract::Transcription factors canonically bind nucleosome-free DNA, making the positioning of nucleosomes within regulatory regions crucial to the regulation of gene expression. Using the assay of transposase accessible chromatin (ATAC-seq), we observe a highly structured pattern of DNA fragment lengths and positions around n...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.192294.115

    authors: Schep AN,Buenrostro JD,Denny SK,Schwartz K,Sherlock G,Greenleaf WJ

    更新日期:2015-11-01 00:00:00

  • Sequence diversity and genomic organization of vomeronasal receptor genes in the mouse.

    abstract::The vomeronasal system of mice is thought to be specialized in the detection of pheromones. Two multigene families have been identified that encode proteins with seven putative transmembrane domains and that are expressed selectively in subsets of neurons of the vomeronasal organ. The products of these vomeronasal rec...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.10.12.1958

    authors: Del Punta K,Rothman A,Rodriguez I,Mombaerts P

    更新日期:2000-12-01 00:00:00

  • Genomic organization of the sex-determining and adjacent regions of the sex chromosomes of medaka.

    abstract::Sequencing of the human Y chromosome has uncovered the peculiarities of the genomic organization of a heterogametic sex chromosome of old evolutionary age, and has led to many insights into the evolutionary changes that occurred during its long history. We have studied the genomic organization of the medaka fish Y chr...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5016106

    authors: Kondo M,Hornung U,Nanda I,Imai S,Sasaki T,Shimizu A,Asakawa S,Hori H,Schmid M,Shimizu N,Schartl M

    更新日期:2006-07-01 00:00:00