Predicting Long non-coding RNAs through feature ensemble learning.

Abstract:

BACKGROUND:Many transcripts have been generated due to the development of sequencing technologies, and lncRNA is an important type of transcript. Predicting lncRNAs from transcripts is a challenging and important task. Traditional experimental lncRNA prediction methods are time-consuming and labor-intensive. Efficient computational methods for lncRNA prediction are in demand. RESULTS:In this paper, we propose two lncRNA prediction methods based on feature ensemble learning strategies named LncPred-IEL and LncPred-ANEL. Specifically, we encode sequences into six different types of features including transcript-specified features and general sequence-derived features. Then we consider two feature ensemble strategies to utilize and integrate the information in different feature types, the iterative ensemble learning (IEL) and the attention network ensemble learning (ANEL). IEL employs a supervised iterative way to ensemble base predictors built on six different types of features. ANEL introduces an attention mechanism-based deep learning model to ensemble features by adaptively learning the weight of individual feature types. Experiments demonstrate that both LncPred-IEL and LncPred-ANEL can effectively separate lncRNAs and other transcripts in feature space. Moreover, comparison experiments demonstrate that LncPred-IEL and LncPred-ANEL outperform several state-of-the-art methods when evaluated by 5-fold cross-validation. Both methods have good performances in cross-species lncRNA prediction. CONCLUSIONS:LncPred-IEL and LncPred-ANEL are promising lncRNA prediction tools that can effectively utilize and integrate the information in different types of features.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Xu Y,Zhao X,Liu S,Zhang W

doi

10.1186/s12864-020-07237-y

subject

Has Abstract

pub_date

2020-12-17 00:00:00

pages

865

issue

Suppl 13

issn

1471-2164

pii

10.1186/s12864-020-07237-y

journal_volume

21

pub_type

杂志文章
  • Exposure to maternal obesity alters gene expression in the preimplantation ovine conceptus.

    abstract:BACKGROUND:Embryonic and fetal exposure to maternal obesity causes several maladaptive morphological and epigenetic changes in exposed offspring. The timing of these events is unclear, but changes can be observed even after a short exposure to maternal obesity around the time of conception. The hypothesis of this work ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5120-0

    authors: McCoski SR,Vailes MT,Owens CE,Cockrum RR,Ealy AD

    更新日期:2018-10-11 00:00:00

  • Omics profiles used to evaluate the gene expression of Exiguobacterium antarcticum B7 during cold adaptation.

    abstract:BACKGROUND:Exiguobacterium antarcticum strain B7 is a Gram-positive psychrotrophic bacterial species isolated in Antarctica. Although this bacteria has been poorly studied, its genome has already been sequenced. Therefore, it is an appropriate model for the study of thermal adaptation. In the present study, we analyzed...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-986

    authors: Dall'Agnol HP,Baraúna RA,de Sá PH,Ramos RT,Nóbrega F,Nunes CI,das Graças DA,Carneiro AR,Santos DM,Pimenta AM,Carepo MS,Azevedo V,Pellizari VH,Schneider MP,Silva A

    更新日期:2014-11-18 00:00:00

  • Evidence for a non-canonical JAK/STAT signaling pathway in the synthesis of the brain's major ion channels and neurotransmitter receptors.

    abstract:BACKGROUND:Brain-derived neurotrophic factor (BDNF) is a major signaling molecule that the brain uses to control a vast network of intracellular cascades fundamental to properties of learning and memory, and cognition. While much is known about BDNF signaling in the healthy nervous system where it controls the mitogen ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6033-2

    authors: Hixson KM,Cogswell M,Brooks-Kayal AR,Russek SJ

    更新日期:2019-08-28 00:00:00

  • A fast-linear mixed model for genome-wide haplotype association analysis: application to agronomic traits in maize.

    abstract:BACKGROUND:Haplotypes combine the effects of several single nucleotide polymorphisms (SNPs) with high linkage disequilibrium, which benefit the genome-wide association analysis (GWAS). In the haplotype association analysis, both haplotype alleles and blocks are tested. Haplotype alleles can be inferred with the same st...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6552-x

    authors: Chen H,Hao Z,Zhao Y,Yang R

    更新日期:2020-02-11 00:00:00

  • Novel β-catenin target genes identified in thalamic neurons encode modulators of neuronal excitability.

    abstract:BACKGROUND:LEF1/TCF transcription factors and their activator β-catenin are effectors of the canonical Wnt pathway. Although Wnt/β-catenin signaling has been implicated in neurodegenerative and psychiatric disorders, its possible role in the adult brain remains enigmatic. To address this issue, we sought to identify th...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-635

    authors: Wisniewska MB,Nagalski A,Dabrowski M,Misztal K,Kuznicki J

    更新日期:2012-11-17 00:00:00

  • Comparison between two amplicon-based sequencing panels of different scales in the detection of somatic mutations associated with gastric cancer.

    abstract:BACKGROUND:Sequencing data from The Cancer Genome Atlas (TGCA), the International Cancer Genome Consortium and other research institutes have revealed the presence of genetic alterations in several tumor types, including gastric cancer. These data have been combined into a catalog of significantly mutated genes for eac...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3166-4

    authors: Hirotsu Y,Kojima Y,Okimoto K,Amemiya K,Mochizuki H,Omata M

    更新日期:2016-10-26 00:00:00

  • Clostridium sticklandii, a specialist in amino acid degradation:revisiting its metabolism through its genome sequence.

    abstract:BACKGROUND:Clostridium sticklandii belongs to a cluster of non-pathogenic proteolytic clostridia which utilize amino acids as carbon and energy sources. Isolated by T.C. Stadtman in 1954, it has been generally regarded as a "gold mine" for novel biochemical reactions and is used as a model organism for studying metabol...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-555

    authors: Fonknechten N,Chaussonnerie S,Tricot S,Lajus A,Andreesen JR,Perchat N,Pelletier E,Gouyvenoux M,Barbe V,Salanoubat M,Le Paslier D,Weissenbach J,Cohen GN,Kreimeyer A

    更新日期:2010-10-11 00:00:00

  • Genome sequence of an aflatoxigenic pathogen of Argentinian peanut, Aspergillus arachidicola.

    abstract:BACKGROUND:Aspergillus arachidicola is an aflatoxigenic fungal species, first isolated from the leaves of a wild peanut species native to Argentina. It has since been reported in maize, Brazil nut and human sputum samples. This aflatoxigenic species is capable of secreting both B and G aflatoxins, similar to A. parasit...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4576-2

    authors: Moore GG,Mack BM,Beltz SB,Puel O

    更新日期:2018-03-09 00:00:00

  • Study of spontaneous mutations in the transmission of poplar chloroplast genomes from mother to offspring.

    abstract:BACKGROUND:Chloroplasts have their own genomes, independent from nuclear genomes, that play vital roles in growth, which is a major targeted trait for genetic improvement in Populus. Angiosperm chloroplast genomes are maternally inherited, but the chloroplast' variation pattern of poplar at the single-base level during...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4813-8

    authors: Zhu S,Xu M,Wang H,Pan H,Wang G,Huang M

    更新日期:2018-05-29 00:00:00

  • A transcript profiling approach reveals the zinc finger transcription factor ZNF191 is a pleiotropic factor.

    abstract:BACKGROUND:The human zinc finger protein 191 (ZNF191) is a member of the SCAN domain family of Krüppel-like zinc finger transcription factors. ZNF191 shows 94% identity to its mouse homologue zinc finger protein 191(Zfp191), which is the most highly conserved among the human-mouse SCAN family member orthologues pairs. ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-241

    authors: Li J,Chen X,Gong X,Liu Y,Feng H,Qiu L,Hu Z,Zhang J

    更新日期:2009-05-22 00:00:00

  • Exclusivity offers a sound yet practical species criterion for bacteria despite abundant gene flow.

    abstract:BACKGROUND:The question of whether bacterial species objectively exist has long divided microbiologists. A major source of contention stems from the fact that bacteria regularly engage in horizontal gene transfer (HGT), making it difficult to ascertain relatedness and draw boundaries between taxa. A natural way to defi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5099-6

    authors: Wright ES,Baum DA

    更新日期:2018-10-03 00:00:00

  • Identification of a radiosensitivity signature using integrative metaanalysis of published microarray data for NCI-60 cancer cells.

    abstract:BACKGROUND:In the postgenome era, a prediction of response to treatment could lead to better dose selection for patients in radiotherapy. To identify a radiosensitive gene signature and elucidate related signaling pathways, four different microarray experiments were reanalyzed before radiotherapy. RESULTS:Radiosensiti...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-348

    authors: Kim HS,Kim SC,Kim SJ,Park CH,Jeung HC,Kim YB,Ahn JB,Chung HC,Rha SY

    更新日期:2012-07-30 00:00:00

  • Lung transcriptomic clock predicts premature aging in cigarette smoke-exposed mice.

    abstract:BACKGROUND:Lung aging is characterized by a number of structural alterations including fibrosis, chronic inflammation and the alteration of inflammatory cell composition. Chronic exposure to cigarette smoke (CS) is known to induce similar alterations and may contribute to premature lung aging. Additionally, aging and C...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6712-z

    authors: Choukrallah MA,Hoeng J,Peitsch MC,Martin F

    更新日期:2020-04-09 00:00:00

  • Multi-omics analysis reveals regulators of the response to nitrogen limitation in Yarrowia lipolytica.

    abstract:BACKGROUND:Yarrowia lipolytica is an oleaginous ascomycete yeast that stores lipids in response to limitation of nitrogen. While the enzymatic pathways responsible for neutral lipid accumulation in Y. lipolytica are well characterized, regulation of these pathways has received little attention. We therefore sought to c...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2471-2

    authors: Pomraning KR,Kim YM,Nicora CD,Chu RK,Bredeweg EL,Purvine SO,Hu D,Metz TO,Baker SE

    更新日期:2016-02-25 00:00:00

  • Next-generation sequencing identifies equine cartilage and subchondral bone miRNAs and suggests their involvement in osteochondrosis physiopathology.

    abstract:BACKGROUND:MicroRNAs (miRNAs) are an abundant class of small single-stranded non-coding RNA molecules ranging from 18 to 24 nucleotides. They negatively regulate gene expression at the post-transcriptional level and play key roles in many biological processes, including skeletal development and cartilage maturation. In...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-798

    authors: Desjardin C,Vaiman A,Mata X,Legendre R,Laubier J,Kennedy SP,Laloe D,Barrey E,Jacques C,Cribiu EP,Schibler L

    更新日期:2014-09-17 00:00:00

  • Comprehensive analysis of CCCH-type zinc finger family genes facilitates functional gene discovery and reflects recent allopolyploidization event in tetraploid switchgrass.

    abstract:BACKGROUND:In recent years, dozens of Arabidopsis and rice CCCH-type zinc finger genes have been functionally studied, many of which confer important traits, such as abiotic and biotic stress tolerance, delayed leaf senescence and improved plant architecture. Switchgrass (Panicum virgatum) is an important bioenergy cro...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1328-4

    authors: Yuan S,Xu B,Zhang J,Xie Z,Cheng Q,Yang Z,Cai Q,Huang B

    更新日期:2015-02-25 00:00:00

  • Comparative transcriptomics uncovers alternative splicing changes and signatures of selection from maize improvement.

    abstract:BACKGROUND:Alternative splicing (AS) is an important regulatory mechanism that greatly contributes to eukaryotic transcriptome diversity. A substantial amount of evidence has demonstrated that AS complexity is relevant to eukaryotic evolution, development, adaptation, and complexity. In this study, six teosinte and ten...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1582-5

    authors: Huang J,Gao Y,Jia H,Liu L,Zhang D,Zhang Z

    更新日期:2015-05-08 00:00:00

  • Consistent levels of A-to-I RNA editing across individuals in coding sequences and non-conserved Alu repeats.

    abstract:BACKGROUND:Adenosine to inosine (A-to-I) RNA-editing is an essential post-transcriptional mechanism that occurs in numerous sites in the human transcriptome, mainly within Alu repeats. It has been shown to have consistent levels of editing across individuals in a few targets in the human brain and altered in several hu...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-608

    authors: Greenberger S,Levanon EY,Paz-Yaacov N,Barzilai A,Safran M,Osenberg S,Amariglio N,Rechavi G,Eisenberg E

    更新日期:2010-10-28 00:00:00

  • Methods for high-throughput MethylCap-Seq data analysis.

    abstract:BACKGROUND:Advances in whole genome profiling have revolutionized the cancer research field, but at the same time have raised new bioinformatics challenges. For next generation sequencing (NGS), these include data storage, computational costs, sequence processing and alignment, delineating appropriate statistical measu...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-S6-S14

    authors: Rodriguez BA,Frankhouser D,Murphy M,Trimarchi M,Tam HH,Curfman J,Huang R,Chan MW,Lai HC,Parikh D,Ball B,Schwind S,Blum W,Marcucci G,Yan P,Bundschuh R

    更新日期:2012-01-01 00:00:00

  • Large-scale analysis of post-translational modifications in E. coli under glucose-limiting conditions.

    abstract:BACKGROUND:Post-translational modification (PTM) of proteins is central to many cellular processes across all domains of life, but despite decades of study and a wealth of genomic and proteomic data the biological function of many PTMs remains unknown. This is especially true for prokaryotic PTM systems, many of which ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3676-8

    authors: Brown CW,Sridhara V,Boutz DR,Person MD,Marcotte EM,Barrick JE,Wilke CO

    更新日期:2017-04-17 00:00:00

  • Microarray-based ultra-high resolution discovery of genomic deletion mutations.

    abstract:BACKGROUND:Oligonucleotide microarray-based comparative genomic hybridization (CGH) offers an attractive possible route for the rapid and cost-effective genome-wide discovery of deletion mutations. CGH typically involves comparison of the hybridization intensities of genomic DNA samples with microarray chip representat...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-224

    authors: Belfield EJ,Brown C,Gan X,Jiang C,Baban D,Mithani A,Mott R,Ragoussis J,Harberd NP

    更新日期:2014-03-22 00:00:00

  • Genome-wide expression profiling shows transcriptional reprogramming in Fusarium graminearum by Fusarium graminearum virus 1-DK21 infection.

    abstract:BACKGROUND:Fusarium graminearum virus 1 strain-DK21 (FgV1-DK21) is a mycovirus that confers hypovirulence to F. graminearum, which is the primary phytopathogenic fungus that causes Fusarium head blight (FHB) disease in many cereals. Understanding the interaction between mycoviruses and plant pathogenic fungi is necessa...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-173

    authors: Cho WK,Yu J,Lee KM,Son M,Min K,Lee YW,Kim KH

    更新日期:2012-05-06 00:00:00

  • The initial deficiency of protein processing and flavonoids biosynthesis were the main mechanisms for the male sterility induced by SX-1 in Brassica napus.

    abstract:BACKGROUND:Rapeseed (Brassica napus) is an important oil seed crop in the Brassicaceae family. Chemical induced male sterility (CIMS) is one of the widely used method to produce the hybrids in B. napus. Identification of the key genes and pathways that involved in CIMS were important to understand the underlying molecu...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5203-y

    authors: Ning L,Lin Z,Gu J,Gan L,Li Y,Wang H,Miao L,Zhang L,Wang B,Li M

    更新日期:2018-11-07 00:00:00

  • Comparative analysis of the kinomes of three pathogenic trypanosomatids: Leishmania major, Trypanosoma brucei and Trypanosoma cruzi.

    abstract:BACKGROUND:The trypanosomatids Leishmania major, Trypanosoma brucei and Trypanosoma cruzi cause some of the most debilitating diseases of humankind: cutaneous leishmaniasis, African sleeping sickness, and Chagas disease. These protozoa possess complex life cycles that involve development in mammalian and insect hosts, ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-6-127

    authors: Parsons M,Worthey EA,Ward PN,Mottram JC

    更新日期:2005-09-15 00:00:00

  • Mitochondrial lineage M1 traces an early human backflow to Africa.

    abstract:BACKGROUND:The out of Africa hypothesis has gained generalized consensus. However, many specific questions remain unsettled. To know whether the two M and N macrohaplogroups that colonized Eurasia were already present in Africa before the exit is puzzling. It has been proposed that the east African clade M1 supports a ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-223

    authors: González AM,Larruga JM,Abu-Amero KK,Shi Y,Pestano J,Cabrera VM

    更新日期:2007-07-09 00:00:00

  • Genome-scale identification of Caenorhabditis elegans regulatory elements by tiling-array mapping of DNase I hypersensitive sites.

    abstract:BACKGROUND:A major goal of post-genomics research is the integrated analysis of genes, regulatory elements and the chromatin architecture on a genome-wide scale. Mapping DNase I hypersensitive sites within the nuclear chromatin is a powerful and well-established method of identifying regulatory element candidates. RES...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-92

    authors: Shi B,Guo X,Wu T,Sheng S,Wang J,Skogerbø G,Zhu X,Chen R

    更新日期:2009-02-25 00:00:00

  • Comparative transcriptomics of drought responses in Populus: a meta-analysis of genome-wide expression profiling in mature leaves and root apices across two genotypes.

    abstract:BACKGROUND:Comparative genomics has emerged as a promising means of unravelling the molecular networks underlying complex traits such as drought tolerance. Here we assess the genotype-dependent component of the drought-induced transcriptome response in two poplar genotypes differing in drought tolerance. Drought-induce...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-630

    authors: Cohen D,Bogeat-Triboulot MB,Tisserant E,Balzergue S,Martin-Magniette ML,Lelandais G,Ningre N,Renou JP,Tamby JP,Le Thiec D,Hummel I

    更新日期:2010-11-12 00:00:00

  • Protein acetylation in mitochondria plays critical functions in the pathogenesis of fatty liver disease.

    abstract:BACKGROUND:Fatty liver is a high incidence of perinatal disease in dairy cows caused by negative energy balance, which seriously threatens the postpartum health and milk production. It has been reported that lysine acetylation plays an important role in substance and energy metabolism. Predictably, most metabolic proce...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-06837-y

    authors: Le-Tian Z,Cheng-Zhang H,Xuan Z,Zhang Q,Zhen-Gui Y,Qing-Qing W,Sheng-Xuan W,Zhong-Jin X,Ran-Ran L,Ting-Jun L,Zhong-Qu S,Zhong-Hua W,Ke-Rong S

    更新日期:2020-06-26 00:00:00

  • Sequence differences in the seed dormancy gene Qsd1 among various wheat genomes.

    abstract:BACKGROUND:Pre-harvest sprouting frequently occurs in Triticum aestivum (wheat) and Hordeum vulgare (barley) at the end of the maturity period due to high rainfall, particularly in Asian monsoon areas. Seed dormancy is a major mechanism preventing pre-harvest sprouting in these crops. RESULTS:We identified orthologous...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3880-6

    authors: Onishi K,Yamane M,Yamaji N,Tokui M,Kanamori H,Wu J,Komatsuda T,Sato K

    更新日期:2017-06-29 00:00:00

  • Identification of sensory hair-cell transcripts by thiouracil-tagging in zebrafish.

    abstract:BACKGROUND:Sensory hair cells are exquisitely sensitive to mechanical stimuli and as such, are prone to damage and apoptosis during dissections or in vitro manipulations. Thiouracil (TU)-tagging is a noninvasive method to label cell type-specific transcripts in an intact organism, thereby meeting the challenge of how t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2072-5

    authors: Erickson T,Nicolson T

    更新日期:2015-10-23 00:00:00