ACEP: improving antimicrobial peptides recognition through automatic feature fusion and amino acid embedding.

Abstract:

BACKGROUND:Antimicrobial resistance is one of our most serious health threats. Antimicrobial peptides (AMPs), effecter molecules of innate immune system, can defend host organisms against microbes and most have shown a lowered likelihood for bacteria to form resistance compared to many conventional drugs. Thus, AMPs are gaining popularity as better substitute to antibiotics. To aid researchers in novel AMPs discovery, we design computational approaches to screen promising candidates. RESULTS:In this work, we design a deep learning model that can learn amino acid embedding patterns, automatically extract sequence features, and fuse heterogeneous information. Results show that the proposed model outperforms state-of-the-art methods on recognition of AMPs. By visualizing data in some layers of the model, we overcome the black-box nature of deep learning, explain the working mechanism of the model, and find some import motifs in sequences. CONCLUSIONS:ACEP model can capture similarity between amino acids, calculate attention scores for different parts of a peptide sequence in order to spot important parts that significantly contribute to final predictions, and automatically fuse a variety of heterogeneous information or features. For high-throughput AMPs recognition, open source software and datasets are made freely available at https://github.com/Fuhaoyi/ACEP .

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Fu H,Cao Z,Li M,Wang S

doi

10.1186/s12864-020-06978-0

subject

Has Abstract

pub_date

2020-08-28 00:00:00

pages

597

issue

1

issn

1471-2164

pii

10.1186/s12864-020-06978-0

journal_volume

21

pub_type

杂志文章
  • Comparative genome analysis of jujube witches'-broom Phytoplasma, an obligate pathogen that causes jujube witches'-broom disease.

    abstract:BACKGROUND:JWB phytoplasma is a kind of insect-transmitted and uncultivable bacterial plant pathogen causeing a destructive Jujube disease. To date, no genome information about JWB phytoplasma has been published, which hindered its characterization at genomic level. To understand its pathogenicity and ecology, the geno...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5075-1

    authors: Wang J,Song L,Jiao Q,Yang S,Gao R,Lu X,Zhou G

    更新日期:2018-09-19 00:00:00

  • The genomes of three stocks comprising the most widely utilized live sporozoite Theileria parva vaccine exhibit very different degrees and patterns of sequence divergence.

    abstract:BACKGROUND:There are no commercially available vaccines against human protozoan parasitic diseases, despite the success of vaccination-induced long-term protection against infectious diseases. East Coast fever, caused by the protist Theileria parva, kills one million cattle each year in sub-Saharan Africa, and contribu...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1910-9

    authors: Norling M,Bishop RP,Pelle R,Qi W,Henson S,Drábek EF,Tretina K,Odongo D,Mwaura S,Njoroge T,Bongcam-Rudloff E,Daubenberger CA,Silva JC

    更新日期:2015-09-24 00:00:00

  • Linkage disequilibrium and genome-wide association analysis for anthocyanin pigmentation and fruit color in eggplant.

    abstract:BACKGROUND:The genome-wide association (GWA) approach represents an alternative to biparental linkage mapping for determining the genetic basis of trait variation. Both approaches rely on recombination to re-arrange the genome, and seek to establish correlations between phenotype and genotype. The major advantages of G...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-896

    authors: Cericola F,Portis E,Lanteri S,Toppino L,Barchi L,Acciarri N,Pulcini L,Sala T,Rotino GL

    更新日期:2014-10-14 00:00:00

  • Strand-specific transcriptomes of Enterohemorrhagic Escherichia coli in response to interactions with ground beef microbiota: interactions between microorganisms in raw meat.

    abstract:BACKGROUND:Enterohemorrhagic Escherichia coli (EHEC) are zoonotic agents associated with outbreaks worldwide. Growth of EHEC strains in ground beef could be inhibited by background microbiota that is present initially at levels greater than that of the pathogen E. coli. However, how the microbiota outcompetes the patho...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3957-2

    authors: Galia W,Leriche F,Cruveiller S,Garnier C,Navratil V,Dubost A,Blanquet-Diot S,Thevenot-Sergentet D

    更新日期:2017-08-03 00:00:00

  • Analysis of gene expression in soybean (Glycine max) roots in response to the root knot nematode Meloidogyne incognita using microarrays and KEGG pathways.

    abstract:BACKGROUND:Root-knot nematodes are sedentary endoparasites that can infect more than 3000 plant species. Root-knot nematodes cause an estimated $100 billion annual loss worldwide. For successful establishment of the root-knot nematode in its host plant, it causes dramatic morphological and physiological changes in plan...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-220

    authors: Ibrahim HM,Hosseini P,Alkharouf NW,Hussein EH,Gamal El-Din Ael K,Aly MA,Matthews BF

    更新日期:2011-05-10 00:00:00

  • Distinct gene loci control the host response to influenza H1N1 virus infection in a time-dependent manner.

    abstract:BACKGROUND:There is strong but mostly circumstantial evidence that genetic factors modulate the severity of influenza infection in humans. Using genetically diverse but fully inbred strains of mice it has been shown that host sequence variants have a strong influence on the severity of influenza A disease progression. ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-411

    authors: Nedelko T,Kollmus H,Klawonn F,Spijker S,Lu L,Heßman M,Alberts R,Williams RW,Schughart K

    更新日期:2012-08-20 00:00:00

  • Divergence in function and expression of the NOD26-like intrinsic proteins in plants.

    abstract:BACKGROUND:NOD26-like intrinsic proteins (NIPs) that belong to the aquaporin superfamily are plant-specific and exhibit a similar three-dimensional structure. Experimental evidences however revealed that functional divergence should have extensively occurred among NIP genes. It is therefore intriguing to further invest...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-313

    authors: Liu Q,Wang H,Zhang Z,Wu J,Feng Y,Zhu Z

    更新日期:2009-07-15 00:00:00

  • The unique genomic landscape surrounding the EPSPS gene in glyphosate resistant Amaranthus palmeri: a repetitive path to resistance.

    abstract:BACKGROUND:The expanding number and global distributions of herbicide resistant weedy species threaten food, fuel, fiber and bioproduct sustainability and agroecosystem longevity. Amongst the most competitive weeds, Amaranthus palmeri S. Wats has rapidly evolved resistance to glyphosate primarily through massive amplif...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3336-4

    authors: Molin WT,Wright AA,Lawton-Rauh A,Saski CA

    更新日期:2017-01-17 00:00:00

  • Unique aspects of fiber degradation by the ruminal ethanologen Ruminococcus albus 7 revealed by physiological and transcriptomic analysis.

    abstract:BACKGROUND:Bacteria in the genus Ruminococcus are ubiquitous members of the mammalian gastrointestinal tract. In particular, they are important in ruminants where they digest a wide range of plant cell wall polysaccharides. For example, Ruminococcus albus 7 is a primary cellulose degrader that produces acetate usable b...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1066

    authors: Christopherson MR,Dawson JA,Stevenson DM,Cunningham AC,Bramhacharya S,Weimer PJ,Kendziorski C,Suen G

    更新日期:2014-12-04 00:00:00

  • Unravelling the complex trait of harvest index in rapeseed (Brassica napus L.) with association mapping.

    abstract:BACKGROUND:Harvest index (HI), the ratio of grain yield to total biomass, is considered as a measure of biological success in partitioning assimilated photosynthate to the harvestable product. While crop production can be dramatically improved by increasing HI, the underlying molecular genetic mechanism of HI in rapese...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1607-0

    authors: Luo X,Ma C,Yue Y,Hu K,Li Y,Duan Z,Wu M,Tu J,Shen J,Yi B,Fu T

    更新日期:2015-05-12 00:00:00

  • Microarray analysis of Foxa2 mutant mouse embryos reveals novel gene expression and inductive roles for the gastrula organizer and its derivatives.

    abstract:BACKGROUND:The Spemann/Mangold organizer is a transient tissue critical for patterning the gastrula stage vertebrate embryo and formation of the three germ layers. Despite its important role during development, there are still relatively few genes with specific expression in the organizer and its derivatives. Foxa2 is ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-511

    authors: Tamplin OJ,Kinzel D,Cox BJ,Bell CE,Rossant J,Lickert H

    更新日期:2008-10-30 00:00:00

  • Genome-wide analysis of consistently RNA edited sites in human blood reveals interactions with mRNA processing genes and suggests correlations with cell types and biological variables.

    abstract:BACKGROUND:A-to-I RNA editing is a co-/post-transcriptional modification catalyzed by ADAR enzymes, that deaminates Adenosines (A) into Inosines (I). Most of known editing events are located within inverted ALU repeats, but they also occur in coding sequences and may alter the function of encoded proteins. RNA editing ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5364-8

    authors: Giacopuzzi E,Gennarelli M,Sacco C,Filippini A,Mingardi J,Magri C,Barbon A

    更新日期:2018-12-27 00:00:00

  • Differential representation of sunflower ESTs in enriched organ-specific cDNA libraries in a small scale sequencing project.

    abstract:BACKGROUND:Subtractive hybridization methods are valuable tools for identifying differentially regulated genes in a given tissue avoiding redundant sequencing of clones representing the same expressed genes, maximizing detection of low abundant transcripts and thus, affecting the efficiency and cost effectiveness of sm...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-4-40

    authors: Fernández P,Paniego N,Lew S,Hopp HE,Heinz RA

    更新日期:2003-09-30 00:00:00

  • Molecular dynamics study of the archaeal aquaporin AqpM.

    abstract:BACKGROUND:Aquaporins are a large family of transmembrane channel proteins that are present throughout all domains of life and are implicated in human disorders. These channels, allow the passive but selective movement of water and other small neutral solutes across cell membranes. Aquaporins have been classified into ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-S4-S8

    authors: Araya-Secchi R,Garate JA,Holmes DS,Perez-Acle T

    更新日期:2011-12-22 00:00:00

  • Analysis of intra-genomic GC content homogeneity within prokaryotes.

    abstract:BACKGROUND:Bacterial genomes possess varying GC content (total guanines (Gs) and cytosines (Cs) per total of the four bases within the genome) but within a given genome, GC content can vary locally along the chromosome, with some regions significantly more or less GC rich than on average. We have examined how the GC co...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-464

    authors: Bohlin J,Snipen L,Hardy SP,Kristoffersen AB,Lagesen K,Dønsvik T,Skjerve E,Ussery DW

    更新日期:2010-08-06 00:00:00

  • The complete mitochondrial genomes for three Toxocara species of human and animal health significance.

    abstract:BACKGROUND:Studying mitochondrial (mt) genomics has important implications for various fundamental areas, including mt biochemistry, physiology and molecular biology. In addition, mt genome sequences have provided useful markers for investigating population genetic structures, systematics and phylogenetics of organisms...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-224

    authors: Li MW,Lin RQ,Song HQ,Wu XY,Zhu XQ

    更新日期:2008-05-16 00:00:00

  • A gene sets approach for identifying prognostic gene signatures for outcome prediction.

    abstract:BACKGROUND:Gene expression profiling is a promising approach to better estimate patient prognosis; however, there are still unresolved problems, including little overlap among similarly developed gene sets and poor performance of a developed gene set in other datasets. RESULTS:We applied a gene sets approach to develo...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-177

    authors: Kim SY,Kim YS

    更新日期:2008-04-16 00:00:00

  • Integrated proteomic and metabolomic analysis to study the effects of spaceflight on Candida albicans.

    abstract:BACKGROUND:Candida albicans is an opportunistic pathogenic yeast, which could become pathogenic in various stressful environmental factors including the spaceflight environment. In this study, we aim to explore the phenotypic changes and possible mechanisms of C. albicans after exposure to spaceflight conditions. RESU...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6476-5

    authors: Wang J,Liu Y,Zhao G,Gao J,Liu J,Wu X,Xu C,Li Y

    更新日期:2020-01-17 00:00:00

  • Gene regulation of Sclerotinia sclerotiorum during infection of Glycine max: on the road to pathogenesis.

    abstract:BACKGROUND:Sclerotinia sclerotiorum is a broad-host range necrotrophic pathogen which is the causative agent of Sclerotinia stem rot (SSR), and a major disease of soybean (Glycine max). A time course transcriptomic analysis was performed in both compatible and incompatible soybean lines to identify pathogenicity and de...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5517-4

    authors: Westrick NM,Ranjan A,Jain S,Grau CR,Smith DL,Kabbage M

    更新日期:2019-02-26 00:00:00

  • Integrated "omics" profiling indicates that miRNAs are modulators of the ontogenetic venom composition shift in the Central American rattlesnake, Crotalus simus simus.

    abstract:BACKGROUND:Understanding the processes that drive the evolution of snake venom is a topic of great research interest in molecular and evolutionary toxinology. Recent studies suggest that ontogenetic changes in venom composition are genetically controlled rather than environmentally induced. However, the molecular mecha...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-234

    authors: Durban J,Pérez A,Sanz L,Gómez A,Bonilla F,Rodríguez S,Chacón D,Sasa M,Angulo Y,Gutiérrez JM,Calvete JJ

    更新日期:2013-04-10 00:00:00

  • Identification of candidate genes for human pituitary development by EST analysis.

    abstract:BACKGROUND:The pituitary is a critical neuroendocrine gland that is comprised of five hormone-secreting cell types, which develops in tandem during the embryonic stage. Some essential genes have been identified in the early stage of adenohypophysial development, such as PITX1, FGF8, BMP4 and SF-1. However, it is likely...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-109

    authors: Ma Y,Qi X,Du J,Song S,Feng D,Qi J,Zhu Z,Zhang X,Xiao H,Han Z,Hao X

    更新日期:2009-03-15 00:00:00

  • In silico and biological survey of transcription-associated proteins implicated in the transcriptional machinery during the erythrocytic development of Plasmodium falciparum.

    abstract:BACKGROUND:Malaria is the most important parasitic disease in the world with approximately two million people dying every year, mostly due to Plasmodium falciparum infection. During its complex life cycle in the Anopheles vector and human host, the parasite requires the coordinated and modulated expression of diverse s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-34

    authors: Bischoff E,Vaquero C

    更新日期:2010-01-15 00:00:00

  • Lung transcriptomic clock predicts premature aging in cigarette smoke-exposed mice.

    abstract:BACKGROUND:Lung aging is characterized by a number of structural alterations including fibrosis, chronic inflammation and the alteration of inflammatory cell composition. Chronic exposure to cigarette smoke (CS) is known to induce similar alterations and may contribute to premature lung aging. Additionally, aging and C...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-6712-z

    authors: Choukrallah MA,Hoeng J,Peitsch MC,Martin F

    更新日期:2020-04-09 00:00:00

  • RNA-seq analysis reveals genetic response and tolerance mechanisms to ozone exposure in soybean.

    abstract:BACKGROUND:Oxidative stress caused by ground level ozone is a contributor to yield loss in a number of important crop plants. Soybean (Glycine max) is considered to be ozone sensitive, and current research into its response to oxidative stress is limited. To better understand the genetic response in soybean to oxidativ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1637-7

    authors: Whaley A,Sheridan J,Safari S,Burton A,Burkey K,Schlueter J

    更新日期:2015-06-04 00:00:00

  • Identification of microRNA-mRNA modules using microarray data.

    abstract:BACKGROUND:MicroRNAs (miRNAs) are post-transcriptional regulators of mRNA expression and are involved in numerous cellular processes. Consequently, miRNAs are an important component of gene regulatory networks and an improved understanding of miRNAs will further our knowledge of these networks. There is a many-to-many ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-138

    authors: Jayaswal V,Lutherborrow M,Ma DD,Yang YH

    更新日期:2011-03-06 00:00:00

  • Divergence of the SigB regulon and pathogenesis of the Bacillus cereus sensu lato group.

    abstract:BACKGROUND:The Bacillus cereus sensu lato group currently includes seven species (B. cereus, B. anthracis, B. mycoides, B. pseudomycoides, B. thuringiensis, B. weihenstephanensis and B. cytotoxicus) that recent phylogenetic and phylogenomic analyses suggest are likely a single species, despite their varied phenotypes. ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-564

    authors: Scott E 2nd,Dyer DW

    更新日期:2012-10-22 00:00:00

  • Optimization of cDNA microarrays procedures using criteria that do not rely on external standards.

    abstract:BACKGROUND:The measurement of gene expression using microarray technology is a complicated process in which a large number of factors can be varied. Due to the lack of standard calibration samples such as are used in traditional chemical analysis it may be a problem to evaluate whether changes done to the microarray pr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-377

    authors: Bruland T,Anderssen E,Doseth B,Bergum H,Beisvag V,Laegreid A

    更新日期:2007-10-18 00:00:00

  • A comprehensive survey of integron-associated genes present in metagenomes.

    abstract:BACKGROUND:Integrons are genomic elements that mediate horizontal gene transfer by inserting and removing genetic material using site-specific recombination. Integrons are commonly found in bacterial genomes, where they maintain a large and diverse set of genes that plays an important role in adaptation and evolution. ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-06830-5

    authors: Buongermino Pereira M,Österlund T,Eriksson KM,Backhaus T,Axelson-Fisk M,Kristiansson E

    更新日期:2020-07-20 00:00:00

  • TREC-IN: gene knock-in genetic tool for genomes cloned in yeast.

    abstract:BACKGROUND:With the development of several new technologies using synthetic biology, it is possible to engineer genetically intractable organisms including Mycoplasma mycoides subspecies capri (Mmc), by cloning the intact bacterial genome in yeast, using the host yeast's genetic tools to modify the cloned genome, and s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1180

    authors: Chandran S,Noskov VN,Segall-Shapiro TH,Ma L,Whiteis C,Lartigue C,Jores J,Vashee S,Chuang RY

    更新日期:2014-12-24 00:00:00

  • Identification of Nicotiana benthamiana microRNAs and their targets using high throughput sequencing and degradome analysis.

    abstract:BACKGROUND:Nicotiana benthamiana is a widely used model plant species for research on plant-pathogen interactions as well as other areas of plant science. It can be easily transformed or agroinfiltrated, therefore it is commonly used in studies requiring protein localization, interaction, or plant-based systems for pro...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2209-6

    authors: Baksa I,Nagy T,Barta E,Havelda Z,Várallyay É,Silhavy D,Burgyán J,Szittya G

    更新日期:2015-12-01 00:00:00