In silico miRNA prediction in metazoan genomes: balancing between sensitivity and specificity.

Abstract:

BACKGROUND:MicroRNAs (miRNAs), short approximately 21-nucleotide RNA molecules, play an important role in post-transcriptional regulation of gene expression. The number of known miRNA hairpins registered in the miRBase database is rapidly increasing, but recent reports suggest that many miRNAs with restricted temporal or tissue-specific expression remain undiscovered. Various strategies for in silico miRNA identification have been proposed to facilitate miRNA discovery. Notably support vector machine (SVM) methods have recently gained popularity. However, a drawback of these methods is that they do not provide insight into the biological properties of miRNA sequences. RESULTS:We here propose a new strategy for miRNA hairpin prediction in which the likelihood that a genomic hairpin is a true miRNA hairpin is evaluated based on statistical distributions of observed biological variation of properties (descriptors) of known miRNA hairpins. These distributions are transformed into a single and continuous outcome classifier called the L score. Using a dataset of known miRNA hairpins from the miRBase database and an exhaustive set of genomic hairpins identified in the genome of Caenorhabditis elegans, a subset of 18 most informative descriptors was selected after detailed analysis of correlation among and discriminative power of individual descriptors. We show that the majority of previously identified miRNA hairpins have high L scores, that the method outperforms miRNA prediction by threshold filtering and that it is more transparent than SVM classifiers. CONCLUSION:The L score is applicable as a prediction classifier with high sensitivity for novel miRNA hairpins. The L-score approach can be used to rank and select interesting miRNA hairpin candidates for downstream experimental analysis when coupled to a genome-wide set of in silico-identified hairpins or to facilitate the analysis of large sets of putative miRNA hairpin loci obtained in deep-sequencing efforts of small RNAs. Moreover, the in-depth analyses of miRNA hairpins descriptors preceding and determining the L score outcome could be used as an extension to miRBase entries to help increase the reliability and biological relevance of the miRNA registry.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

van der Burgt A,Fiers MW,Nap JP,van Ham RC

doi

10.1186/1471-2164-10-204

subject

Has Abstract

pub_date

2009-04-30 00:00:00

pages

204

issn

1471-2164

pii

1471-2164-10-204

journal_volume

10

pub_type

杂志文章
  • Blood-based epigenetic estimators of chronological age in human adults using DNA methylation data from the Illumina MethylationEPIC array.

    abstract:BACKGROUND:Epigenetic clocks have been recognized for their precise prediction of chronological age, age-related diseases, and all-cause mortality. Existing epigenetic clocks are based on CpGs from the Illumina HumanMethylation450 BeadChip (450 K) which has now been replaced by the latest platform, Illumina Methylation...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07168-8

    authors: Lee Y,Haftorn KL,Denault WRP,Nustad HE,Page CM,Lyle R,Lee-Ødegård S,Moen GH,Prasad RB,Groop LC,Sletner L,Sommer C,Magnus MC,Gjessing HK,Harris JR,Magnus P,Håberg SE,Jugessur A,Bohlin J

    更新日期:2020-10-27 00:00:00

  • Discovering monotonic stemness marker genes from time-series stem cell microarray data.

    abstract:BACKGROUND:Identification of genes with ascending or descending monotonic expression patterns over time or stages of stem cells is an important issue in time-series microarray data analysis. We propose a method named Monotonic Feature Selector (MFSelector) based on a concept of total discriminating error (DEtotal) to i...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-16-S2-S2

    authors: Wang HW,Sun HJ,Chang TY,Lo HH,Cheng WC,Tseng GC,Lin CT,Chang SJ,Pal N,Chung IF

    更新日期:2015-01-01 00:00:00

  • The prediction of protein-protein interaction networks in rice blast fungus.

    abstract:BACKGROUND:Protein-protein interaction (PPI) maps are useful tools for investigating the cellular functions of genes. Thus far, large-scale PPI mapping projects have not been implemented for the rice blast fungus Magnaporthe grisea, which is responsible for the most severe rice disease. Inspired by recent advances in P...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-519

    authors: He F,Zhang Y,Chen H,Zhang Z,Peng YL

    更新日期:2008-11-02 00:00:00

  • Next generation sequencing analysis reveals a relationship between rDNA unit diversity and locus number in Nicotiana diploids.

    abstract:BACKGROUND:Tandemly arranged nuclear ribosomal DNA (rDNA), encoding 18S, 5.8S and 26S ribosomal RNA (rRNA), exhibit concerted evolution, a pattern thought to result from the homogenisation of rDNA arrays. However rDNA homogeneity at the single nucleotide polymorphism (SNP) level has not been detailed in organisms with ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-722

    authors: Matyášek R,Renny-Byfield S,Fulneček J,Macas J,Grandbastien MA,Nichols R,Leitch A,Kovařík A

    更新日期:2012-12-23 00:00:00

  • Host specialization of the blast fungus Magnaporthe oryzae is associated with dynamic gain and loss of genes linked to transposable elements.

    abstract:BACKGROUND:Magnaporthe oryzae (anamorph Pyricularia oryzae) is the causal agent of blast disease of Poaceae crops and their wild relatives. To understand the genetic mechanisms that drive host specialization of M. oryzae, we carried out whole genome resequencing of four M. oryzae isolates from rice (Oryza sativa), one ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2690-6

    authors: Yoshida K,Saunders DG,Mitsuoka C,Natsume S,Kosugi S,Saitoh H,Inoue Y,Chuma I,Tosa Y,Cano LM,Kamoun S,Terauchi R

    更新日期:2016-05-18 00:00:00

  • Analysis of 4,664 high-quality sequence-finished poplar full-length cDNA clones and their utility for the discovery of genes responding to insect feeding.

    abstract:BACKGROUND:The genus Populus includes poplars, aspens and cottonwoods, which will be collectively referred to as poplars hereafter unless otherwise specified. Poplars are the dominant tree species in many forest ecosystems in the Northern Hemisphere and are of substantial economic value in plantation forestry. Poplar h...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-57

    authors: Ralph SG,Chun HJ,Cooper D,Kirkpatrick R,Kolosova N,Gunter L,Tuskan GA,Douglas CJ,Holt RA,Jones SJ,Marra MA,Bohlmann J

    更新日期:2008-01-29 00:00:00

  • The metabolome as a link in the genotype-phenotype map for peroxide resistance in the fruit fly, Drosophila melanogaster.

    abstract:BACKGROUND:Genetic association studies that seek to explain the inheritance of complex traits typically fail to explain a majority of the heritability of the trait under study. Thus, we are left with a gap in the map from genotype to phenotype. Several approaches have been used to fill this gap, including those that at...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6739-1

    authors: Harrison BR,Wang L,Gajda E,Hoffman EV,Chung BY,Pletcher SD,Raftery D,Promislow DEL

    更新日期:2020-05-04 00:00:00

  • XenDB: full length cDNA prediction and cross species mapping in Xenopus laevis.

    abstract:BACKGROUND:Research using the model system Xenopus laevis has provided critical insights into the mechanisms of early vertebrate development and cell biology. Large scale sequencing efforts have provided an increasingly important resource for researchers. To provide full advantage of the available sequence, we have ana...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-6-123

    authors: Sczyrba A,Beckstette M,Brivanlou AH,Giegerich R,Altmann CR

    更新日期:2005-09-14 00:00:00

  • Genome-wide expression profiling shows transcriptional reprogramming in Fusarium graminearum by Fusarium graminearum virus 1-DK21 infection.

    abstract:BACKGROUND:Fusarium graminearum virus 1 strain-DK21 (FgV1-DK21) is a mycovirus that confers hypovirulence to F. graminearum, which is the primary phytopathogenic fungus that causes Fusarium head blight (FHB) disease in many cereals. Understanding the interaction between mycoviruses and plant pathogenic fungi is necessa...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-173

    authors: Cho WK,Yu J,Lee KM,Son M,Min K,Lee YW,Kim KH

    更新日期:2012-05-06 00:00:00

  • Origin and fate of pseudogenes in Hemiascomycetes: a comparative analysis.

    abstract:BACKGROUND:Pseudogenes are ubiquitous genetic elements that derive from functional genes after mutational inactivation. Characterization of pseudogenes is important to understand genome dynamics and evolution, and its significance increases when several genomes of related organisms can be compared. Among yeasts, only t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-260

    authors: Lafontaine I,Dujon B

    更新日期:2010-04-22 00:00:00

  • Cluster analysis of replicated alternative polyadenylation data using canonical correlation analysis.

    abstract:BACKGROUND:Alternative polyadenylation (APA) has emerged as a pervasive mechanism that contributes to the transcriptome complexity and dynamics of gene regulation. The current tsunami of whole genome poly(A) site data from various conditions generated by 3' end sequencing provides a valuable data source for the study o...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5433-7

    authors: Ye W,Long Y,Ji G,Su Y,Ye P,Fu H,Wu X

    更新日期:2019-01-22 00:00:00

  • Comparative mitochondrial proteomic, physiological, biochemical and ultrastructural profiling reveal factors underpinning salt tolerance in tetraploid black locust (Robinia pseudoacacia L.).

    abstract:BACKGROUND:Polyploidy is an important phenomenon in plants because of its roles in agricultural and forestry production as well as in plant tolerance to environmental stresses. Tetraploid black locust (Robinia pseudoacacia L.) is a polyploid plant and a pioneer tree species due to its wide ranging adaptability to adver...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-4038-2

    authors: Luo Q,Peng M,Zhang X,Lei P,Ji X,Chow W,Meng F,Sun G

    更新日期:2017-08-22 00:00:00

  • Comparative profiling of the transcriptional response to infection in two species of Drosophila by short-read cDNA sequencing.

    abstract:BACKGROUND:Homology-based comparisons of the genes involved in innate immunity across many insect taxa with fully sequenced genomes has revealed a striking pattern of gene gain and loss, particularly among genes that encode proteins involved in clearing pathogens (effectors). However, limited functional annotation in n...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-259

    authors: Sackton TB,Clark AG

    更新日期:2009-06-07 00:00:00

  • Genomic and systems evolution in Vibrionaceae species.

    abstract:BACKGROUND:The steadily increasing number of prokaryotic genomes has accelerated the study of genome evolution; in particular, the availability of sets of genomes from closely related bacteria has facilitated the exploration of the mechanisms underlying genome plasticity. The family Vibrionaceae is found in the Gammapr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-S1-S11

    authors: Gu J,Neary J,Cai H,Moshfeghian A,Rodriguez SA,Lilburn TG,Wang Y

    更新日期:2009-07-07 00:00:00

  • QTL detection for Aeromonas salmonicida resistance related traits in turbot (Scophthalmus maximus).

    abstract:BACKGROUND:Interactions between fish and pathogens, that may be harmless under natural conditions, often result in serious diseases in aquaculture systems. This is especially important due to the fact that the strains used in aquaculture are derived from wild strains that may not have had enough time to adapt to new di...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-541

    authors: Rodríguez-Ramilo ST,Toro MA,Bouza C,Hermida M,Pardo BG,Cabaleiro S,Martínez P,Fernández J

    更新日期:2011-11-02 00:00:00

  • Whole genome scanning and association mapping identified a significant association between growth and a SNP in the IFABP-a gene of the Asian seabass.

    abstract:BACKGROUND:Aquaculture is the quickest growing sector in agriculture. However, QTL for important traits have been only identified in a few aquaculture species. We conducted QTL mapping for growth traits in an Asian seabass F(2) family with 359 individuals using 123 microsatellites and 22 SNPs, and performed association...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-295

    authors: Xia JH,Lin G,He X,Liu P,Liu F,Sun F,Tu R,Yue GH

    更新日期:2013-05-01 00:00:00

  • Evidence for niche adaptation in the genome of the bovine pathogen Streptococcus uberis.

    abstract:BACKGROUND:Streptococcus uberis, a Gram positive bacterial pathogen responsible for a significant proportion of bovine mastitis in commercial dairy herds, colonises multiple body sites of the cow including the gut, genital tract and mammary gland. Comparative analysis of the complete genome sequence of S. uberis strain...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-54

    authors: Ward PN,Holden MT,Leigh JA,Lennard N,Bignell A,Barron A,Clark L,Quail MA,Woodward J,Barrell BG,Egan SA,Field TR,Maskell D,Kehoe M,Dowson CG,Chanter N,Whatmore AM,Bentley SD,Parkhill J

    更新日期:2009-01-28 00:00:00

  • The genome of the emerging barley pathogen Ramularia collo-cygni.

    abstract:BACKGROUND:Ramularia collo-cygni is a newly important, foliar fungal pathogen of barley that causes the disease Ramularia leaf spot. The fungus exhibits a prolonged endophytic growth stage before switching life habit to become an aggressive, necrotrophic pathogen that causes significant losses to green leaf area and he...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2928-3

    authors: McGrann GR,Andongabo A,Sjökvist E,Trivedi U,Dussart F,Kaczmarek M,Mackenzie A,Fountaine JM,Taylor JM,Paterson LJ,Gorniak K,Burnett F,Kanyuka K,Hammond-Kosack KE,Rudd JJ,Blaxter M,Havis ND

    更新日期:2016-08-09 00:00:00

  • Diversity and structure of PIF/Harbinger-like elements in the genome of Medicago truncatula.

    abstract:BACKGROUND:Transposable elements constitute a significant fraction of plant genomes. The PIF/Harbinger superfamily includes DNA transposons (class II elements) carrying terminal inverted repeats and producing a 3 bp target site duplication upon insertion. The presence of an ORF coding for the DDE/DDD transposase, requi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-409

    authors: Grzebelus D,Lasota S,Gambin T,Kucherov G,Gambin A

    更新日期:2007-11-09 00:00:00

  • A genomics-based systems approach towards drug repositioning for rheumatoid arthritis.

    abstract:BACKGROUND:Rheumatoid arthritis (RA) is a chronic autoimmune disease characterized by inflammation and destruction of synovial joints. RA affects up to 1 % of the population worldwide. Currently, there are no drugs that can cure RA or achieve sustained remission. The unknown cause of the disease represents a significan...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2910-0

    authors: Xu R,Wang Q

    更新日期:2016-08-22 00:00:00

  • Sequences conserved by selection across mouse and human malaria species.

    abstract:UNLABELLED:Little is known, either experimentally or computationally, about the genomic sequence features that regulate malaria genes. A sequence conservation analysis of the malaria species P. falciparum, P. berghei, P. yoelii, and P. chabaudi could significantly advance knowledge of malaria gene regulation. RESULTS:...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-372

    authors: Imamura H,Persampieri JH,Chuang JH

    更新日期:2007-10-15 00:00:00

  • Genome-wide profiling of G protein-coupled receptors in cerebellar granule neurons using high-throughput, real-time PCR.

    abstract:BACKGROUND:G protein-coupled receptors (GPCRs) are major players in cell communication, regulate a whole range of physiological functions during development and throughout adult life, are affected in numerous pathological situations, and constitute so far the largest class of drugable targets for human diseases. The co...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-241

    authors: Maurel B,Le Digarcher A,Dantec C,Journot L

    更新日期:2011-05-16 00:00:00

  • Characterization of the Shewanella oneidensis Fur gene: roles in iron and acid tolerance response.

    abstract:BACKGROUND:Iron homeostasis is a key metabolism for most organisms. In many bacterial species, coordinate regulation of iron homeostasis depends on the protein product of a Fur gene. Fur also plays roles in virulence, acid tolerance, redox-stress responses, flagella chemotaxis and metabolic pathways. RESULTS:We conduc...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-S1-S11

    authors: Yang Y,Harris DP,Luo F,Wu L,Parsons AB,Palumbo AV,Zhou J

    更新日期:2008-01-01 00:00:00

  • Development of simple sequence repeat (SSR) markers from a genome survey of Chinese bayberry (Myrica rubra).

    abstract:BACKGROUND:Chinese bayberry (Myrica rubra Sieb. and Zucc.) is a subtropical evergreen tree originating in China. It has been cultivated in southern China for several thousand years, and annual production has reached 1.1 million tons. The taste and high level of health promoting characters identified in the fruit in rec...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-201

    authors: Jiao Y,Jia HM,Li XW,Chai ML,Jia HJ,Chen Z,Wang GY,Chai CY,van de Weg E,Gao ZS

    更新日期:2012-05-23 00:00:00

  • An evaluation of the PacBio RS platform for sequencing and de novo assembly of a chloroplast genome.

    abstract:BACKGROUND:Second generation sequencing has permitted detailed sequence characterisation at the whole genome level of a growing number of non-model organisms, but the data produced have short read-lengths and biased genome coverage leading to fragmented genome assemblies. The PacBio RS long-read sequencing platform off...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-670

    authors: Ferrarini M,Moretto M,Ward JA,Šurbanovski N,Stevanović V,Giongo L,Viola R,Cavalieri D,Velasco R,Cestaro A,Sargent DJ

    更新日期:2013-10-01 00:00:00

  • A crustacean annotated transcriptome (CAT) database.

    abstract:BACKGROUND:Decapods are an order of crustaceans which includes shrimps, crabs, lobsters and crayfish. They occur worldwide and are of great scientific interest as well as being of ecological and economic importance in fisheries and aquaculture. However, our knowledge of their biology mainly comes from the group which i...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6433-3

    authors: Nong W,Chai ZYH,Jiang X,Qin J,Ma KY,Chan KM,Chan TF,Chow BKC,Kwan HS,Wong CKC,Qiu JW,Hui JHL,Chu KH

    更新日期:2020-01-09 00:00:00

  • Transcriptional profiling of putative human epithelial stem cells.

    abstract:BACKGROUND:Human interfollicular epidermis is sustained by the proliferation of stem cells and their progeny, transient amplifying cells. Molecular characterization of these two cell populations is essential for better understanding of self renewal, differentiation and mechanisms of skin pathogenesis. The purpose of th...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-359

    authors: Koçer SS,Djurić PM,Bugallo MF,Simon SR,Matic M

    更新日期:2008-07-30 00:00:00

  • H2A.Z marks antisense promoters and has positive effects on antisense transcript levels in budding yeast.

    abstract:BACKGROUND:The histone variant H2A.Z, which has been reported to have both activating and repressive effects on gene expression, is known to occupy nucleosomes at the 5' ends of protein-coding genes. RESULTS:We now find that H2A.Z is also significantly enriched in gene coding regions and at the 3' ends of genes in bud...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1247-4

    authors: Gu M,Naiyachit Y,Wood TJ,Millar CB

    更新日期:2015-02-19 00:00:00

  • Identification of microRNA-mRNA modules using microarray data.

    abstract:BACKGROUND:MicroRNAs (miRNAs) are post-transcriptional regulators of mRNA expression and are involved in numerous cellular processes. Consequently, miRNAs are an important component of gene regulatory networks and an improved understanding of miRNAs will further our knowledge of these networks. There is a many-to-many ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-138

    authors: Jayaswal V,Lutherborrow M,Ma DD,Yang YH

    更新日期:2011-03-06 00:00:00

  • Identification of functional regulatory elements in the human genome using pooled CRISPR screens.

    abstract:BACKGROUND:Genome-scale pooled CRISPR screens are powerful tools for identifying genetic dependencies across varied cellular processes. The vast majority of CRISPR screens reported to date have focused exclusively on the perturbation of protein-coding gene function. However, protein-coding genes comprise < 2% of the se...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6497-0

    authors: Borys SM,Younger ST

    更新日期:2020-01-31 00:00:00