Computational and experimental identification of mirtrons in Drosophila melanogaster and Caenorhabditis elegans.

Abstract:

:Mirtrons are intronic hairpin substrates of the dicing machinery that generate functional microRNAs. In this study, we describe experimental assays that defined the essential requirements for entry of introns into the mirtron pathway. These data informed a bioinformatic screen that effectively identified functional mirtrons from the Drosophila melanogaster transcriptome. These included 17 known and six confident novel mirtrons among the top 51 candidates, and additional candidates had limited read evidence in available small RNA data. Our computational model also proved effective on Caenorhabditis elegans, for which the identification of 14 cloned mirtrons among the top 22 candidates more than tripled the number of validated mirtrons in this species. A few low-scoring introns generated mirtron-like read patterns from atypical RNA structures, but their paucity suggests that relatively few such loci were not captured by our model. Unexpectedly, we uncovered examples of clustered mirtrons in both fly and worm genomes, including a <8-kb region in C. elegans harboring eight distinct mirtrons. Altogether, we demonstrate that discovery of functional mirtrons, unlike canonical miRNAs, is amenable to computational methods independent of evolutionary constraint.

journal_name

Genome Res

journal_title

Genome research

authors

Chung WJ,Agius P,Westholm JO,Chen M,Okamura K,Robine N,Leslie CS,Lai EC

doi

10.1101/gr.113050.110

subject

Has Abstract

pub_date

2011-02-01 00:00:00

pages

286-300

issue

2

eissn

1088-9051

issn

1549-5469

pii

gr.113050.110

journal_volume

21

pub_type

杂志文章
  • Gene loss and movement in the maize genome.

    abstract::Maize (Zea mays L. ssp. mays), one of the most important agricultural crops in the world, originated by hybridization of two closely related progenitors. To investigate the fate of its genes after tetraploidization, we analyzed the sequence of five duplicated regions from different chromosomal locations. We also compa...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2701104

    authors: Lai J,Ma J,Swigonová Z,Ramakrishna W,Linton E,Llaca V,Tanyolac B,Park YJ,Jeong OY,Bennetzen JL,Messing J

    更新日期:2004-10-01 00:00:00

  • Sensitive mapping of recombination hotspots using sequencing-based detection of ssDNA.

    abstract::Meiotic DNA double-stranded breaks (DSBs) initiate genetic recombination in discrete areas of the genome called recombination hotspots. DSBs can be directly mapped using chromatin immunoprecipitation followed by sequencing (ChIP-seq). Nevertheless, the genome-wide mapping of recombination hotspots in mammals is still ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.130583.111

    authors: Khil PP,Smagulova F,Brick KM,Camerini-Otero RD,Petukhova GV

    更新日期:2012-05-01 00:00:00

  • A network of transcriptionally coordinated functional modules in Saccharomyces cerevisiae.

    abstract::Recent computational and experimental work suggests that functional modules underlie much of cellular physiology and are a useful unit of cellular organization from the perspective of systems biology. Because interactions among modules can give rise to higher-level properties that are essential to cellular function, a...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3847105

    authors: Petti AA,Church GM

    更新日期:2005-09-01 00:00:00

  • Why do human diversity levels vary at a megabase scale?

    abstract::Levels of diversity vary across the human genome. This variation is caused by two forces: differences in mutation rates and the differential impact of natural selection. Pertinent to the question of the relative importance of these two forces is the observation that both diversity within species and interspecies diver...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3461105

    authors: Hellmann I,Prüfer K,Ji H,Zody MC,Pääbo S,Ptak SE

    更新日期:2005-09-01 00:00:00

  • Evolutionary constraints in conserved nongenic sequences of mammals.

    abstract::Mammalian genomes contain many highly conserved nongenic sequences (CNGs) whose functional significance is poorly understood. Sets of CNGs have previously been identified by selecting the most conserved elements from a chromosome or genome, but in these highly selected samples, conservation may be unrelated to purifyi...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3942005

    authors: Keightley PD,Kryukov GV,Sunyaev S,Halligan DL,Gaffney DJ

    更新日期:2005-10-01 00:00:00

  • A contiguous 66-kb barley DNA sequence provides evidence for reversible genome expansion.

    abstract::Organisms with large genomes contain vast amounts of repetitive DNA sequences, much of which is composed of retrotransposons. Amplification of retrotransposons has been postulated to be a major mechanism increasing genome size and leading to "genomic obesity." To gain insights into the relation between retrotransposon...

    journal_title:Genome research

    pub_type: 评论,杂志文章

    doi:10.1101/gr.10.7.908

    authors: Shirasu K,Schulman AH,Lahaye T,Schulze-Lefert P

    更新日期:2000-07-01 00:00:00

  • Computational identification of operons in microbial genomes.

    abstract::By applying graph representations to biochemical pathways, a new computational pipeline is proposed to find potential operons in microbial genomes. The algorithm relies on the fact that enzyme genes in operons tend to catalyze successive reactions in metabolic pathways. We applied this algorithm to 42 microbial genome...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.200602

    authors: Zheng Y,Szustakowski JD,Fortnow L,Roberts RJ,Kasif S

    更新日期:2002-08-01 00:00:00

  • Probing genomic diversity and evolution of Escherichia coli O157 by single nucleotide polymorphisms.

    abstract::Infections by Shiga toxin-producing Escherichia coli O157:H7 (STEC O157) are the predominant cause of bloody diarrhea and hemolytic uremic syndrome in the United States. In silico comparison of the two complete STEC O157 genomes (Sakai and EDL933) revealed a strikingly high level of sequence identity in orthologous pr...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4759706

    authors: Zhang W,Qi W,Albert TJ,Motiwala AS,Alland D,Hyytia-Trees EK,Ribot EM,Fields PI,Whittam TS,Swaminathan B

    更新日期:2006-06-01 00:00:00

  • The amphioxus genome illuminates vertebrate origins and cephalochordate biology.

    abstract::Cephalochordates, urochordates, and vertebrates evolved from a common ancestor over 520 million years ago. To improve our understanding of chordate evolution and the origin of vertebrates, we intensively searched for particular genes, gene families, and conserved noncoding elements in the sequenced genome of the cepha...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.073676.107

    authors: Holland LZ,Albalat R,Azumi K,Benito-Gutiérrez E,Blow MJ,Bronner-Fraser M,Brunet F,Butts T,Candiani S,Dishaw LJ,Ferrier DE,Garcia-Fernàndez J,Gibson-Brown JJ,Gissi C,Godzik A,Hallböök F,Hirose D,Hosomichi K,Ikuta T,I

    更新日期:2008-07-01 00:00:00

  • Molecular genetic maps in wild emmer wheat, Triticum dicoccoides: genome-wide coverage, massive negative interference, and putative quasi-linkage.

    abstract::The main objectives of the study reported here were to construct a molecular map of wild emmer wheat, Triticum dicoccoides, to characterize the marker-related anatomy of the genome, and to evaluate segregation and recombination patterns upon crossing T. dicoccoides with its domesticated descendant Triticum durum (cult...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.150300

    authors: Peng J,Korol AB,Fahima T,Röder MS,Ronin YI,Li YC,Nevo E

    更新日期:2000-10-01 00:00:00

  • The Ensembl automatic gene annotation system.

    abstract::As more genomes are sequenced, there is an increasing need for automated first-pass annotation which allows timely access to important genomic information. The Ensembl gene-building system enables fast automated annotation of eukaryotic genomes. It annotates genes based on evidence derived from known protein, cDNA, an...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1858004

    authors: Curwen V,Eyras E,Andrews TD,Clarke L,Mongin E,Searle SM,Clamp M

    更新日期:2004-05-01 00:00:00

  • Comparing genomes within the species Mycobacterium tuberculosis.

    abstract::The study of genetic variability within natural populations of pathogens may provide insight into their evolution and pathogenesis. We used a Mycobacterium tuberculosis high-density oligonucleotide microarray to detect small-scale genomic deletions among 19 clinically and epidemiologically well-characterized isolates ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.166401

    authors: Kato-Maeda M,Rhee JT,Gingeras TR,Salamon H,Drenkow J,Smittipat N,Small PM

    更新日期:2001-04-01 00:00:00

  • Comparative sequence analyses reveal rapid and divergent evolutionary changes of the WFDC locus in the primate lineage.

    abstract::The initial comparison of the human and chimpanzee genome sequences revealed 16 genomic regions with an unusually high density of rapidly evolving genes. One such region is the whey acidic protein (WAP) four-disulfide core domain locus (or WFDC locus), which contains 14 WFDC genes organized in two subloci on human chr...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.6004607

    authors: Hurle B,Swanson W,NISC Comparative Sequencing Program.,Green ED

    更新日期:2007-03-01 00:00:00

  • Hybrid assembly of the large and highly repetitive genome of Aegilops tauschii, a progenitor of bread wheat, with the MaSuRCA mega-reads algorithm.

    abstract::Long sequencing reads generated by single-molecule sequencing technology offer the possibility of dramatically improving the contiguity of genome assemblies. The biggest challenge today is that long reads have relatively high error rates, currently around 15%. The high error rates make it difficult to use this data al...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.213405.116

    authors: Zimin AV,Puiu D,Luo MC,Zhu T,Koren S,Marçais G,Yorke JA,Dvořák J,Salzberg SL

    更新日期:2017-05-01 00:00:00

  • De novo rates and selection of large copy number variation.

    abstract::While copy number variation (CNV) is an active area of research, de novo mutation rates within human populations are not well characterized. By focusing on large (>100 kbp) events, we estimate the rate of de novo CNV formation in humans by analyzing 4394 transmissions from human pedigrees with and without neurocogniti...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.107680.110

    authors: Itsara A,Wu H,Smith JD,Nickerson DA,Romieu I,London SJ,Eichler EE

    更新日期:2010-11-01 00:00:00

  • Closing the gaps on human chromosome 19 revealed genes with a high density of repetitive tandemly arrayed elements.

    abstract::The reported human genome sequence includes about 400 gaps of unknown sequence that were not found in the bacterial artificial chromosome (BAC) and cosmid libraries used for sequencing of the genome. These missing sequences correspond to approximately 1% of euchromatic regions of the human genome. Gap filling is a lab...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.1929904

    authors: Leem SH,Kouprina N,Grimwood J,Kim JH,Mullokandov M,Yoon YH,Chae JY,Morgan J,Lucas S,Richardson P,Detter C,Glavina T,Rubin E,Barrett JC,Larionov V

    更新日期:2004-02-01 00:00:00

  • A first-generation whole genome-radiation hybrid map spanning the mouse genome.

    abstract::We have assembled a first-generation anchor map of the mouse genome using a panel of 94 whole-genome-radiation hybrids (WG-RHs) and 271 sequence-tagged sites (STSs). This is the first genome-wide RH anchor map of a model organism. All of the STSs have been previously localized on the genetic map and are located 8.8 Mb...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.7.12.1153

    authors: McCarthy LC,Terrett J,Davis ME,Knights CJ,Smith AL,Critcher R,Schmitt K,Hudson J,Spurr NK,Goodfellow PN

    更新日期:1997-12-01 00:00:00

  • Genetically indistinguishable SNPs and their influence on inferring the location of disease-associated variants.

    abstract::As part of a recent high-density linkage disequilibrium (LD) study of chromosome 20, we obtained genotypes for approximately 30,000 SNPs at a density of 1 SNP/2 kb on four different population samples (47 CEPH founders; 91 UK unrelateds [unrelated white individuals of western European ancestry]; 97 African Americans; ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.4217605

    authors: Lawrence R,Evans DM,Morris AP,Ke X,Hunt S,Paolucci M,Ragoussis J,Deloukas P,Bentley D,Cardon LR

    更新日期:2005-11-01 00:00:00

  • Two breakpoint clusters at fragile site FRA3B form phased nucleosomes.

    abstract::Fragile sites are gaps and breaks in metaphase chromosomes generated by specific culture conditions. Fragile site FRA3B is the most unstable site and is directly involved in the breakpoints of deletion and translocation in a wide spectrum of cancers. To learn about the general characteristics of common fragile sites, ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.2304404

    authors: Mulvihill DJ,Wang YH

    更新日期:2004-07-01 00:00:00

  • Gene expression profiling in human fetal liver and identification of tissue- and developmental-stage-specific genes through compiled expression profiles and efficient cloning of full-length cDNAs.

    abstract::Fetal liver intriguingly consists of hepatic parenchymal cells and hematopoietic stem/progenitor cells. Human fetal liver aged 22 wk of gestation (HFL22w) corresponds to the turning point between immigration and emigration of the hematopoietic system. To gain further molecular insight into its developmental and functi...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.175501

    authors: Yu Y,Zhang C,Zhou G,Wu S,Qu X,Wei H,Xing G,Dong C,Zhai Y,Wan J,Ouyang S,Li L,Zhang S,Zhou K,Zhang Y,Wu C,He F

    更新日期:2001-08-01 00:00:00

  • Comparative analysis of human chromosome 7q21 and mouse proximal chromosome 6 reveals a placental-specific imprinted gene, TFPI2/Tfpi2, which requires EHMT2 and EED for allelic-silencing.

    abstract::Genomic imprinting is a developmentally important mechanism that involves both differential DNA methylation and allelic histone modifications. Through detailed comparative characterization, a large imprinted domain mapping to chromosome 7q21 in humans and proximal chromosome 6 in mice was redefined. This domain is org...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.077115.108

    authors: Monk D,Wagschal A,Arnaud P,Müller PS,Parker-Katiraee L,Bourc'his D,Scherer SW,Feil R,Stanier P,Moore GE

    更新日期:2008-08-01 00:00:00

  • Reprogramming of the human intestinal epigenome by surgical tissue transposition.

    abstract::Extracellular cues play critical roles in the establishment of the epigenome during development and may also contribute to epigenetic perturbations found in disease states. The direct role of the local tissue environment on the post-development human epigenome, however, remains unclear due to limitations in studies of...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.166439.113

    authors: Lay FD,Triche TJ Jr,Tsai YC,Su SF,Martin SE,Daneshmand S,Skinner EC,Liang G,Chihara Y,Jones PA

    更新日期:2014-04-01 00:00:00

  • Rate of elongation by RNA polymerase II is associated with specific gene features and epigenetic modifications.

    abstract::The rate of transcription elongation plays an important role in the timing of expression of full-length transcripts as well as in the regulation of alternative splicing. In this study, we coupled Bru-seq technology with 5,6-dichlorobenzimidazole 1-β-D-ribofuranoside (DRB) to estimate the elongation rates of over 2000 ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.171405.113

    authors: Veloso A,Kirkconnell KS,Magnuson B,Biewen B,Paulsen MT,Wilson TE,Ljungman M

    更新日期:2014-06-01 00:00:00

  • The origins and evolution of chromosomes, dosage compensation, and mechanisms underlying venom regulation in snakes.

    abstract::Here we use a chromosome-level genome assembly of a prairie rattlesnake (Crotalus viridis), together with Hi-C, RNA-seq, and whole-genome resequencing data, to study key features of genome biology and evolution in reptiles. We identify the rattlesnake Z Chromosome, including the recombining pseudoautosomal region, and...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.240952.118

    authors: Schield DR,Card DC,Hales NR,Perry BW,Pasquesi GM,Blackmon H,Adams RH,Corbin AB,Smith CF,Ramesh B,Demuth JP,Betrán E,Tollis M,Meik JM,Mackessy SP,Castoe TA

    更新日期:2019-04-01 00:00:00

  • Multiple major disease-associated clones of Legionella pneumophila have emerged recently and independently.

    abstract::Legionella pneumophila is an environmental bacterium and the leading cause of Legionnaires' disease. Just five sequence types (ST), from more than 2000 currently described, cause nearly half of disease cases in northwest Europe. Here, we report the sequence and analyses of 364 L. pneumophila genomes, including 337 fro...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.209536.116

    authors: David S,Rusniok C,Mentasti M,Gomez-Valero L,Harris SR,Lechat P,Lees J,Ginevra C,Glaser P,Ma L,Bouchier C,Underwood A,Jarraud S,Harrison TG,Parkhill J,Buchrieser C

    更新日期:2016-11-01 00:00:00

  • A simplified procedure for developing multiplex PCRs.

    abstract::We have developed a simplified method for multiplex PCR based on the use of chimeric primers. Each primer contains a 3' region complementary to sequence-specific recognition sites and a 5' region made up of an unrelated 20-nucleotide sequence. Identical reaction conditions, cycling times, and annealing temperatures ha...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.5.5.488

    authors: Shuber AP,Grondin VJ,Klinger KW

    更新日期:1995-12-01 00:00:00

  • Analysis of the floral transcriptome uncovers new regulators of organ determination and gene families related to flower organ differentiation in Gerbera hybrida (Asteraceae).

    abstract::Development of composite inflorescences in the plant family Asteraceae has features that cannot be studied in the traditional model plants for flower development. In Gerbera hybrida, inflorescences are composed of morphologically different types of flowers tightly packed into a flower head (capitulum). Individual flor...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.3043705

    authors: Laitinen RA,Immanen J,Auvinen P,Rudd S,Alatalo E,Paulin L,Ainasoja M,Kotilainen M,Koskela S,Teeri TH,Elomaa P

    更新日期:2005-04-01 00:00:00

  • GeneID in Drosophila.

    abstract::GeneID is a program to predict genes in anonymous genomic sequences designed with a hierarchical structure. In the first step, splice sites, and start and stop codons are predicted and scored along the sequence using position weight matrices (PWMs). In the second step, exons are built from the sites. Exons are scored ...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.10.4.511

    authors: Parra G,Blanco E,Guigó R

    更新日期:2000-04-01 00:00:00

  • The extensive and condition-dependent nature of epistasis among whole-genome duplicates in yeast.

    abstract::Since complete redundancy between extant duplicates (paralogs) is evolutionarily unfavorable, some degree of functional congruency is eventually lost. However, in budding yeast, experimental evidence collected for duplicated metabolic enzymes and in global physical interaction surveys had suggested widespread function...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.076174.108

    authors: Musso G,Costanzo M,Huangfu M,Smith AM,Paw J,San Luis BJ,Boone C,Giaever G,Nislow C,Emili A,Zhang Z

    更新日期:2008-07-01 00:00:00

  • Pervasive, genome-wide positive selection leading to functional divergence in the bacterial genus Campylobacter.

    abstract::An open question in bacterial genomics is the role that adaptive evolution of the core genome plays in diversification and adaptation of bacterial species, and how this might differ between groups of bacteria occupying different environmental circumstances. The genus Campylobacter encompasses several important human a...

    journal_title:Genome research

    pub_type: 杂志文章

    doi:10.1101/gr.089250.108

    authors: Lefébure T,Stanhope MJ

    更新日期:2009-07-01 00:00:00