Abstract:
BACKGROUND:MicroRNAs (miRNAs), short approximately 21-nucleotide RNA molecules, play an important role in post-transcriptional regulation of gene expression. The number of known miRNA hairpins registered in the miRBase database is rapidly increasing, but recent reports suggest that many miRNAs with restricted temporal or tissue-specific expression remain undiscovered. Various strategies for in silico miRNA identification have been proposed to facilitate miRNA discovery. Notably support vector machine (SVM) methods have recently gained popularity. However, a drawback of these methods is that they do not provide insight into the biological properties of miRNA sequences. RESULTS:We here propose a new strategy for miRNA hairpin prediction in which the likelihood that a genomic hairpin is a true miRNA hairpin is evaluated based on statistical distributions of observed biological variation of properties (descriptors) of known miRNA hairpins. These distributions are transformed into a single and continuous outcome classifier called the L score. Using a dataset of known miRNA hairpins from the miRBase database and an exhaustive set of genomic hairpins identified in the genome of Caenorhabditis elegans, a subset of 18 most informative descriptors was selected after detailed analysis of correlation among and discriminative power of individual descriptors. We show that the majority of previously identified miRNA hairpins have high L scores, that the method outperforms miRNA prediction by threshold filtering and that it is more transparent than SVM classifiers. CONCLUSION:The L score is applicable as a prediction classifier with high sensitivity for novel miRNA hairpins. The L-score approach can be used to rank and select interesting miRNA hairpin candidates for downstream experimental analysis when coupled to a genome-wide set of in silico-identified hairpins or to facilitate the analysis of large sets of putative miRNA hairpin loci obtained in deep-sequencing efforts of small RNAs. Moreover, the in-depth analyses of miRNA hairpins descriptors preceding and determining the L score outcome could be used as an extension to miRBase entries to help increase the reliability and biological relevance of the miRNA registry.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
van der Burgt A,Fiers MW,Nap JP,van Ham RCdoi
10.1186/1471-2164-10-204subject
Has Abstractpub_date
2009-04-30 00:00:00pages
204issn
1471-2164pii
1471-2164-10-204journal_volume
10pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Epigenetic clocks have been recognized for their precise prediction of chronological age, age-related diseases, and all-cause mortality. Existing epigenetic clocks are based on CpGs from the Illumina HumanMethylation450 BeadChip (450 K) which has now been replaced by the latest platform, Illumina Methylation...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-07168-8
更新日期:2020-10-27 00:00:00
abstract:BACKGROUND:Identification of genes with ascending or descending monotonic expression patterns over time or stages of stem cells is an important issue in time-series microarray data analysis. We propose a method named Monotonic Feature Selector (MFSelector) based on a concept of total discriminating error (DEtotal) to i...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-16-S2-S2
更新日期:2015-01-01 00:00:00
abstract:BACKGROUND:Protein-protein interaction (PPI) maps are useful tools for investigating the cellular functions of genes. Thus far, large-scale PPI mapping projects have not been implemented for the rice blast fungus Magnaporthe grisea, which is responsible for the most severe rice disease. Inspired by recent advances in P...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-519
更新日期:2008-11-02 00:00:00
abstract:BACKGROUND:Tandemly arranged nuclear ribosomal DNA (rDNA), encoding 18S, 5.8S and 26S ribosomal RNA (rRNA), exhibit concerted evolution, a pattern thought to result from the homogenisation of rDNA arrays. However rDNA homogeneity at the single nucleotide polymorphism (SNP) level has not been detailed in organisms with ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-722
更新日期:2012-12-23 00:00:00
abstract:BACKGROUND:Magnaporthe oryzae (anamorph Pyricularia oryzae) is the causal agent of blast disease of Poaceae crops and their wild relatives. To understand the genetic mechanisms that drive host specialization of M. oryzae, we carried out whole genome resequencing of four M. oryzae isolates from rice (Oryza sativa), one ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2690-6
更新日期:2016-05-18 00:00:00
abstract:BACKGROUND:The genus Populus includes poplars, aspens and cottonwoods, which will be collectively referred to as poplars hereafter unless otherwise specified. Poplars are the dominant tree species in many forest ecosystems in the Northern Hemisphere and are of substantial economic value in plantation forestry. Poplar h...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-57
更新日期:2008-01-29 00:00:00
abstract:BACKGROUND:Genetic association studies that seek to explain the inheritance of complex traits typically fail to explain a majority of the heritability of the trait under study. Thus, we are left with a gap in the map from genotype to phenotype. Several approaches have been used to fill this gap, including those that at...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-6739-1
更新日期:2020-05-04 00:00:00
abstract:BACKGROUND:Research using the model system Xenopus laevis has provided critical insights into the mechanisms of early vertebrate development and cell biology. Large scale sequencing efforts have provided an increasingly important resource for researchers. To provide full advantage of the available sequence, we have ana...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-6-123
更新日期:2005-09-14 00:00:00
abstract:BACKGROUND:Fusarium graminearum virus 1 strain-DK21 (FgV1-DK21) is a mycovirus that confers hypovirulence to F. graminearum, which is the primary phytopathogenic fungus that causes Fusarium head blight (FHB) disease in many cereals. Understanding the interaction between mycoviruses and plant pathogenic fungi is necessa...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-173
更新日期:2012-05-06 00:00:00
abstract:BACKGROUND:Pseudogenes are ubiquitous genetic elements that derive from functional genes after mutational inactivation. Characterization of pseudogenes is important to understand genome dynamics and evolution, and its significance increases when several genomes of related organisms can be compared. Among yeasts, only t...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-260
更新日期:2010-04-22 00:00:00
abstract:BACKGROUND:Alternative polyadenylation (APA) has emerged as a pervasive mechanism that contributes to the transcriptome complexity and dynamics of gene regulation. The current tsunami of whole genome poly(A) site data from various conditions generated by 3' end sequencing provides a valuable data source for the study o...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5433-7
更新日期:2019-01-22 00:00:00
abstract:BACKGROUND:Polyploidy is an important phenomenon in plants because of its roles in agricultural and forestry production as well as in plant tolerance to environmental stresses. Tetraploid black locust (Robinia pseudoacacia L.) is a polyploid plant and a pioneer tree species due to its wide ranging adaptability to adver...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4038-2
更新日期:2017-08-22 00:00:00
abstract:BACKGROUND:Homology-based comparisons of the genes involved in innate immunity across many insect taxa with fully sequenced genomes has revealed a striking pattern of gene gain and loss, particularly among genes that encode proteins involved in clearing pathogens (effectors). However, limited functional annotation in n...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-259
更新日期:2009-06-07 00:00:00
abstract:BACKGROUND:The steadily increasing number of prokaryotic genomes has accelerated the study of genome evolution; in particular, the availability of sets of genomes from closely related bacteria has facilitated the exploration of the mechanisms underlying genome plasticity. The family Vibrionaceae is found in the Gammapr...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-S1-S11
更新日期:2009-07-07 00:00:00
abstract:BACKGROUND:Interactions between fish and pathogens, that may be harmless under natural conditions, often result in serious diseases in aquaculture systems. This is especially important due to the fact that the strains used in aquaculture are derived from wild strains that may not have had enough time to adapt to new di...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-541
更新日期:2011-11-02 00:00:00
abstract:BACKGROUND:Aquaculture is the quickest growing sector in agriculture. However, QTL for important traits have been only identified in a few aquaculture species. We conducted QTL mapping for growth traits in an Asian seabass F(2) family with 359 individuals using 123 microsatellites and 22 SNPs, and performed association...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-295
更新日期:2013-05-01 00:00:00
abstract:BACKGROUND:Streptococcus uberis, a Gram positive bacterial pathogen responsible for a significant proportion of bovine mastitis in commercial dairy herds, colonises multiple body sites of the cow including the gut, genital tract and mammary gland. Comparative analysis of the complete genome sequence of S. uberis strain...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-54
更新日期:2009-01-28 00:00:00
abstract:BACKGROUND:Ramularia collo-cygni is a newly important, foliar fungal pathogen of barley that causes the disease Ramularia leaf spot. The fungus exhibits a prolonged endophytic growth stage before switching life habit to become an aggressive, necrotrophic pathogen that causes significant losses to green leaf area and he...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2928-3
更新日期:2016-08-09 00:00:00
abstract:BACKGROUND:Transposable elements constitute a significant fraction of plant genomes. The PIF/Harbinger superfamily includes DNA transposons (class II elements) carrying terminal inverted repeats and producing a 3 bp target site duplication upon insertion. The presence of an ORF coding for the DDE/DDD transposase, requi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-8-409
更新日期:2007-11-09 00:00:00
abstract:BACKGROUND:Rheumatoid arthritis (RA) is a chronic autoimmune disease characterized by inflammation and destruction of synovial joints. RA affects up to 1 % of the population worldwide. Currently, there are no drugs that can cure RA or achieve sustained remission. The unknown cause of the disease represents a significan...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2910-0
更新日期:2016-08-22 00:00:00
abstract:UNLABELLED:Little is known, either experimentally or computationally, about the genomic sequence features that regulate malaria genes. A sequence conservation analysis of the malaria species P. falciparum, P. berghei, P. yoelii, and P. chabaudi could significantly advance knowledge of malaria gene regulation. RESULTS:...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-8-372
更新日期:2007-10-15 00:00:00
abstract:BACKGROUND:G protein-coupled receptors (GPCRs) are major players in cell communication, regulate a whole range of physiological functions during development and throughout adult life, are affected in numerous pathological situations, and constitute so far the largest class of drugable targets for human diseases. The co...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-241
更新日期:2011-05-16 00:00:00
abstract:BACKGROUND:Iron homeostasis is a key metabolism for most organisms. In many bacterial species, coordinate regulation of iron homeostasis depends on the protein product of a Fur gene. Fur also plays roles in virulence, acid tolerance, redox-stress responses, flagella chemotaxis and metabolic pathways. RESULTS:We conduc...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-S1-S11
更新日期:2008-01-01 00:00:00
abstract:BACKGROUND:Chinese bayberry (Myrica rubra Sieb. and Zucc.) is a subtropical evergreen tree originating in China. It has been cultivated in southern China for several thousand years, and annual production has reached 1.1 million tons. The taste and high level of health promoting characters identified in the fruit in rec...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-201
更新日期:2012-05-23 00:00:00
abstract:BACKGROUND:Second generation sequencing has permitted detailed sequence characterisation at the whole genome level of a growing number of non-model organisms, but the data produced have short read-lengths and biased genome coverage leading to fragmented genome assemblies. The PacBio RS long-read sequencing platform off...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-670
更新日期:2013-10-01 00:00:00
abstract:BACKGROUND:Decapods are an order of crustaceans which includes shrimps, crabs, lobsters and crayfish. They occur worldwide and are of great scientific interest as well as being of ecological and economic importance in fisheries and aquaculture. However, our knowledge of their biology mainly comes from the group which i...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-6433-3
更新日期:2020-01-09 00:00:00
abstract:BACKGROUND:Human interfollicular epidermis is sustained by the proliferation of stem cells and their progeny, transient amplifying cells. Molecular characterization of these two cell populations is essential for better understanding of self renewal, differentiation and mechanisms of skin pathogenesis. The purpose of th...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-359
更新日期:2008-07-30 00:00:00
abstract:BACKGROUND:The histone variant H2A.Z, which has been reported to have both activating and repressive effects on gene expression, is known to occupy nucleosomes at the 5' ends of protein-coding genes. RESULTS:We now find that H2A.Z is also significantly enriched in gene coding regions and at the 3' ends of genes in bud...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1247-4
更新日期:2015-02-19 00:00:00
abstract:BACKGROUND:MicroRNAs (miRNAs) are post-transcriptional regulators of mRNA expression and are involved in numerous cellular processes. Consequently, miRNAs are an important component of gene regulatory networks and an improved understanding of miRNAs will further our knowledge of these networks. There is a many-to-many ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-138
更新日期:2011-03-06 00:00:00
abstract:BACKGROUND:Genome-scale pooled CRISPR screens are powerful tools for identifying genetic dependencies across varied cellular processes. The vast majority of CRISPR screens reported to date have focused exclusively on the perturbation of protein-coding gene function. However, protein-coding genes comprise < 2% of the se...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-6497-0
更新日期:2020-01-31 00:00:00