Comparison of feature selection and classification for MALDI-MS data.

Abstract:

INTRODUCTION:In the classification of Mass Spectrometry (MS) proteomics data, peak detection, feature selection, and learning classifiers are critical to classification accuracy. To better understand which methods are more accurate when classifying data, some publicly available peak detection algorithms for Matrix assisted Laser Desorption Ionization Mass Spectrometry (MALDI-MS) data were recently compared; however, the issue of different feature selection methods and different classification models as they relate to classification performance has not been addressed. With the application of intelligent computing, much progress has been made in the development of feature selection methods and learning classifiers for the analysis of high-throughput biological data. The main objective of this paper is to compare the methods of feature selection and different learning classifiers when applied to MALDI-MS data and to provide a subsequent reference for the analysis of MS proteomics data. RESULTS:We compared a well-known method of feature selection, Support Vector Machine Recursive Feature Elimination (SVMRFE), and a recently developed method, Gradient based Leave-one-out Gene Selection (GLGS) that effectively performs microarray data analysis. We also compared several learning classifiers including K-Nearest Neighbor Classifier (KNNC), Naïve Bayes Classifier (NBC), Nearest Mean Scaled Classifier (NMSC), uncorrelated normal based quadratic Bayes Classifier recorded as UDC, Support Vector Machines, and a distance metric learning for Large Margin Nearest Neighbor classifier (LMNN) based on Mahanalobis distance. To compare, we conducted a comprehensive experimental study using three types of MALDI-MS data. CONCLUSION:Regarding feature selection, SVMRFE outperformed GLGS in classification. As for the learning classifiers, when classification models derived from the best training were compared, SVMs performed the best with respect to the expected testing accuracy. However, the distance metric learning LMNN outperformed SVMs and other classifiers on evaluating the best testing. In such cases, the optimum classification model based on LMNN is worth investigating for future study.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Liu Q,Sung AH,Qiao M,Chen Z,Yang JY,Yang MQ,Huang X,Deng Y

doi

10.1186/1471-2164-10-S1-S3

subject

Has Abstract

pub_date

2009-07-07 00:00:00

pages

S3

issn

1471-2164

pii

1471-2164-10-S1-S3

journal_volume

10 Suppl 1

pub_type

杂志文章
  • Genome-wide association study of eating and cooking qualities in different subpopulations of rice (Oryza sativa L.).

    abstract:BACKGROUND:Starch and protein are two major components of polished rice, and the amylose and protein contents affect eating and cooking qualities (ECQs). In the present study, genome-wide association study with high-quality re-sequencing data was performed for 10 ECQs in a panel of 227 non-glutinous rice accessions and...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3000-z

    authors: Xu F,Bao J,He Q,Park YJ

    更新日期:2016-08-20 00:00:00

  • The prediction of protein-protein interaction networks in rice blast fungus.

    abstract:BACKGROUND:Protein-protein interaction (PPI) maps are useful tools for investigating the cellular functions of genes. Thus far, large-scale PPI mapping projects have not been implemented for the rice blast fungus Magnaporthe grisea, which is responsible for the most severe rice disease. Inspired by recent advances in P...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-519

    authors: He F,Zhang Y,Chen H,Zhang Z,Peng YL

    更新日期:2008-11-02 00:00:00

  • Involvement of potential pathways in malignant transformation from oral leukoplakia to oral squamous cell carcinoma revealed by proteomic analysis.

    abstract:BACKGROUND:Oral squamous cell carcinoma (OSCC) is one of the most common forms of cancer associated with the presence of precancerous oral leukoplakia. Given the poor prognosis associated with oral leukoplakia, and the difficulties in distinguishing it from cancer lesions, there is an urgent need to elucidate the molec...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-383

    authors: Wang Z,Feng X,Liu X,Jiang L,Zeng X,Ji N,Li J,Li L,Chen Q

    更新日期:2009-08-19 00:00:00

  • Hierarchical transcriptional control regulates Plasmodium falciparum sexual differentiation.

    abstract:BACKGROUND:Malaria pathogenesis relies on sexual gametocyte forms of the malaria parasite to be transmitted between the infected human and the mosquito host but the molecular mechanisms controlling gametocytogenesis remains poorly understood. Here we provide a high-resolution transcriptome of Plasmodium falciparum as i...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-6322-9

    authors: van Biljon R,van Wyk R,Painter HJ,Orchard L,Reader J,Niemand J,Llinás M,Birkholtz LM

    更新日期:2019-12-03 00:00:00

  • Distinct gene loci control the host response to influenza H1N1 virus infection in a time-dependent manner.

    abstract:BACKGROUND:There is strong but mostly circumstantial evidence that genetic factors modulate the severity of influenza infection in humans. Using genetically diverse but fully inbred strains of mice it has been shown that host sequence variants have a strong influence on the severity of influenza A disease progression. ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-411

    authors: Nedelko T,Kollmus H,Klawonn F,Spijker S,Lu L,Heßman M,Alberts R,Williams RW,Schughart K

    更新日期:2012-08-20 00:00:00

  • Testis-Specific GTPase (TSG): An oligomeric protein.

    abstract:BACKGROUND:Ras-related proteins in brain (Rab)-family proteins are key members of the membrane trafficking pathway in cells. In addition, these proteins have been identified to have diverse functions such as cross-talking with different kinases and playing a role in cellular signaling. However, only a few Rab proteins ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3145-9

    authors: Kumar S,Lee HJ,Park HS,Lee K

    更新日期:2016-10-10 00:00:00

  • High-throughput sequencing of Astrammina rara: sampling the giant genome of a giant foraminiferan protist.

    abstract:BACKGROUND:Foraminiferan protists, which are significant players in most marine ecosystems, are also genetic innovators, harboring unique modifications to proteins that make up the basic eukaryotic cell machinery. Despite their ecological and evolutionary importance, foraminiferan genomes are poorly understood due to t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-169

    authors: Habura A,Hou Y,Reilly AA,Bowser SS

    更新日期:2011-03-31 00:00:00

  • Comparative mitogenome analyses uncover mitogenome features and phylogenetic implications of the subfamily Cobitinae.

    abstract:BACKGROUND:Loaches of Cobitinae, widely distributed in Eurasian continent, have high economic, ornamental and scientific value. However, the phylogeny of Cobitinae fishes within genera or family level remains complex and controversial. Up to now, about 60 Cobitinae mitogenomes had been deposited in GenBank, but their i...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07360-w

    authors: Yu P,Zhou L,Yang WT,Miao LJ,Li Z,Zhang XJ,Wang Y,Gui JF

    更新日期:2021-01-14 00:00:00

  • Complete chloroplast genome sequence of Barleria prionitis, comparative chloroplast genomics and phylogenetic relationships among Acanthoideae.

    abstract:BACKGROUND:The plastome of medicinal and endangered species in Kingdom of Saudi Arabia, Barleria prionitis was sequenced. The plastome was compared with that of seven Acanthoideae species in order to describe the plastome, spot the microsatellite, assess the dissimilarities within the sampled plastomes and to infer the...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-06798-2

    authors: Alzahrani DA,Yaradua SS,Albokhari EJ,Abba A

    更新日期:2020-06-06 00:00:00

  • Differential representation of sunflower ESTs in enriched organ-specific cDNA libraries in a small scale sequencing project.

    abstract:BACKGROUND:Subtractive hybridization methods are valuable tools for identifying differentially regulated genes in a given tissue avoiding redundant sequencing of clones representing the same expressed genes, maximizing detection of low abundant transcripts and thus, affecting the efficiency and cost effectiveness of sm...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-4-40

    authors: Fernández P,Paniego N,Lew S,Hopp HE,Heinz RA

    更新日期:2003-09-30 00:00:00

  • Co-expression network analysis of duplicate genes in maize (Zea mays L.) reveals no subgenome bias.

    abstract:BACKGROUND:Gene duplication is prevalent in many species and can result in coding and regulatory divergence. Gene duplications can be classified as whole genome duplication (WGD), tandem and inserted (non-syntenic). In maize, WGD resulted in the subgenomes maize1 and maize2, of which maize1 is considered the dominant s...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3194-0

    authors: Li L,Briskine R,Schaefer R,Schnable PS,Myers CL,Flagel LE,Springer NM,Muehlbauer GJ

    更新日期:2016-11-04 00:00:00

  • Divergence in function and expression of the NOD26-like intrinsic proteins in plants.

    abstract:BACKGROUND:NOD26-like intrinsic proteins (NIPs) that belong to the aquaporin superfamily are plant-specific and exhibit a similar three-dimensional structure. Experimental evidences however revealed that functional divergence should have extensively occurred among NIP genes. It is therefore intriguing to further invest...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-313

    authors: Liu Q,Wang H,Zhang Z,Wu J,Feng Y,Zhu Z

    更新日期:2009-07-15 00:00:00

  • Inferring microbial interaction networks from metagenomic data using SgLV-EKF algorithm.

    abstract:BACKGROUND:Inferring the microbial interaction networks (MINs) and modeling their dynamics are critical in understanding the mechanisms of the bacterial ecosystem and designing antibiotic and/or probiotic therapies. Recently, several approaches were proposed to infer MINs using the generalized Lotka-Volterra (gLV) mode...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3605-x

    authors: Alshawaqfeh M,Serpedin E,Younes AB

    更新日期:2017-03-27 00:00:00

  • In silico identification of genes involved in selenium metabolism: evidence for a third selenium utilization trait.

    abstract:BACKGROUND:Selenium (Se) is a trace element that occurs in proteins in the form of selenocysteine (Sec) and in tRNAs in the form of selenouridine (SeU). Selenophosphate synthetase (SelD) is required for both utilization traits. However, previous research also revealed SelDs in two organisms lacking Sec and SeU, suggest...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-251

    authors: Zhang Y,Turanov AA,Hatfield DL,Gladyshev VN

    更新日期:2008-05-29 00:00:00

  • Unlocking the bovine genome.

    abstract::The draft genome sequence of cattle (Bos taurus) has now been analyzed by the Bovine Genome Sequencing and Analysis Consortium and the Bovine HapMap Consortium, which together represent an extensive collaboration involving more than 300 scientists from 25 different countries. ...

    journal_title:BMC genomics

    pub_type: 社论

    doi:10.1186/1471-2164-10-193

    authors: Tellam RL,Lemay DG,Van Tassell CP,Lewin HA,Worley KC,Elsik CG

    更新日期:2009-04-24 00:00:00

  • Systems genomics evaluation of the SH-SY5Y neuroblastoma cell line as a model for Parkinson's disease.

    abstract:BACKGROUND:The human neuroblastoma cell line, SH-SY5Y, is a commonly used cell line in studies related to neurotoxicity, oxidative stress, and neurodegenerative diseases. Although this cell line is often used as a cellular model for Parkinson's disease, the relevance of this cellular model in the context of Parkinson's...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-1154

    authors: Krishna A,Biryukov M,Trefois C,Antony PM,Hussong R,Lin J,Heinäniemi M,Glusman G,Köglsberger S,Boyd O,van den Berg BH,Linke D,Huang D,Wang K,Hood L,Tholey A,Schneider R,Galas DJ,Balling R,May P

    更新日期:2014-12-20 00:00:00

  • Genomes of Helicobacter pylori from native Peruvians suggest admixture of ancestral and modern lineages and reveal a western type cag-pathogenicity island.

    abstract:BACKGROUND:Helicobacter pylori is presumed to be co-evolved with its human host and is a highly diverse gastric pathogen at genetic levels. Ancient origins of H. pylori in the New World are still debatable. It is not clear how different waves of human migrations in South America contributed to the evolution of strain d...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-191

    authors: Devi SM,Ahmed I,Khan AA,Rahman SA,Alvi A,Sechi LA,Ahmed N

    更新日期:2006-07-27 00:00:00

  • Transcriptome analysis of the oil-rich seed of the bioenergy crop Jatropha curcas L.

    abstract:BACKGROUND:To date, oil-rich plants are the main source of biodiesel products. Because concerns have been voiced about the impact of oil-crop cultivation on the price of food commodities, the interest in oil plants not used for food production and amenable to cultivation on non-agricultural land has soared. As a non-fo...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-462

    authors: Costa GG,Cardoso KC,Del Bem LE,Lima AC,Cunha MA,de Campos-Leite L,Vicentini R,Papes F,Moreira RC,Yunes JA,Campos FA,Da Silva MJ

    更新日期:2010-08-06 00:00:00

  • Comparative proteome analysis of Saccharomyces cerevisiae: a global overview of in vivo targets of the yeast activator protein 1.

    abstract:BACKGROUND:The activity of the yeast activator protein 1 (Yap1p) increases under stress conditions, which leads to enhanced transcription of a number of genes encoding protective enzymes or other proteins. To obtain a global overview of changes in expression of Yap1p-targeted proteins, we compared a Yap1p-overexpressin...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-230

    authors: Jun H,Kieselbach T,Jönsson LJ

    更新日期:2012-06-09 00:00:00

  • MRCNN: a deep learning model for regression of genome-wide DNA methylation.

    abstract:BACKGROUND:Determination of genome-wide DNA methylation is significant for both basic research and drug development. As a key epigenetic modification, this biochemical process can modulate gene expression to influence the cell differentiation which can possibly lead to cancer. Due to the involuted biochemical mechanism...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5488-5

    authors: Tian Q,Zou J,Tang J,Fang Y,Yu Z,Fan S

    更新日期:2019-04-04 00:00:00

  • RNAseq analysis of Aspergillus fumigatus in blood reveals a just wait and see resting stage behavior.

    abstract:BACKGROUND:Invasive aspergillosis is started after germination of Aspergillus fumigatus conidia that are inhaled by susceptible individuals. Fungal hyphae can grow in the lung through the epithelial tissue and disseminate hematogenously to invade into other organs. Low fungaemia indicates that fungal elements do not re...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1853-1

    authors: Irmer H,Tarazona S,Sasse C,Olbermann P,Loeffler J,Krappmann S,Conesa A,Braus GH

    更新日期:2015-08-27 00:00:00

  • Common inversion polymorphism at 17q21.31 affects expression of multiple genes in tissue-specific manner.

    abstract:BACKGROUND:Chromosome 17q21.31 contains a common inversion polymorphism of approximately 900 kb in populations with European ancestry. Two divergent MAPT haplotypes, H1 and H2 are described with distinct linkage disequilibrium patterns across the region reflecting the inversion status at this locus. The MAPT H1 haploty...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-13-458

    authors: de Jong S,Chepelev I,Janson E,Strengman E,van den Berg LH,Veldink JH,Ophoff RA

    更新日期:2012-09-06 00:00:00

  • Mycoplasma non-coding RNA: identification of small RNAs and targets.

    abstract:BACKGROUND:Bacterial non-coding RNAs act by base-pairing as regulatory elements in crucial biological processes. We performed the identification of trans-encoded small RNAs (sRNA) from the genomes of Mycoplama hyopneumoniae, Mycoplasma flocculare and Mycoplasma hyorhinis, which are Mycoplasma species that have been ide...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-3061-z

    authors: Siqueira FM,de Morais GL,Higashi S,Beier LS,Breyer GM,de Sá Godinho CP,Sagot MF,Schrank IS,Zaha A,de Vasconcelos AT

    更新日期:2016-10-25 00:00:00

  • Dual RNA-seq transcriptional analysis of wheat roots colonized by Azospirillum brasilense reveals up-regulation of nutrient acquisition and cell cycle genes.

    abstract:BACKGROUND:The rapid growth of the world's population demands an increase in food production that no longer can be reached by increasing amounts of nitrogenous fertilizers. Plant growth promoting bacteria (PGPB) might be an alternative to increase nitrogenous use efficiency (NUE) in important crops such wheat. Azospiri...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-378

    authors: Camilios-Neto D,Bonato P,Wassem R,Tadra-Sfeir MZ,Brusamarello-Santos LC,Valdameri G,Donatti L,Faoro H,Weiss VA,Chubatsu LS,Pedrosa FO,Souza EM

    更新日期:2014-05-16 00:00:00

  • Refinement of Bos taurus sequence assembly based on BAC-FISH experiments.

    abstract:BACKGROUND:The sequencing of the cow genome was recently published (Btau_4.0 assembly). A second, alternate cow genome assembly (UMD2), based on the same raw sequence data, was also published. The two assemblies have been subsequently updated to Btau_4.2 and UMD3.1, respectively. RESULTS:We compared the Btau_4.2 and U...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-639

    authors: Partipilo G,D'Addabbo P,Lacalandra GM,Liu GE,Rocchi M

    更新日期:2011-12-30 00:00:00

  • Universal Reference RNA as a standard for microarray experiments.

    abstract:BACKGROUND:Obtaining reliable and reproducible two-color microarray gene expression data is critically important for understanding the biological significance of perturbations made on a cellular system. Microarray design, RNA preparation and labeling, hybridization conditions and data acquisition and analysis are varia...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-5-20

    authors: Novoradovskaya N,Whitfield ML,Basehore LS,Novoradovsky A,Pesich R,Usary J,Karaca M,Wong WK,Aprelikova O,Fero M,Perou CM,Botstein D,Braman J

    更新日期:2004-03-09 00:00:00

  • High-throughput proteomic profiling of the fish liver following bacterial infection.

    abstract:BACKGROUND:High-throughput proteomics was used to determine the role of the fish liver in defense responses to bacterial infection. This was done using a rainbow trout (Oncorhynchus mykiss) model following infection with Aeromonas salmonicida, the causative agent of furunculosis. The vertebrate liver has multifaceted f...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-5092-0

    authors: Causey DR,Pohl MAN,Stead DA,Martin SAM,Secombes CJ,Macqueen DJ

    更新日期:2018-10-01 00:00:00

  • Monophyly of clade III nematodes is not supported by phylogenetic analysis of complete mitochondrial genome sequences.

    abstract:BACKGROUND:The orders Ascaridida, Oxyurida, and Spirurida represent major components of zooparasitic nematode diversity, including many species of veterinary and medical importance. Phylum-wide nematode phylogenetic hypotheses have mainly been based on nuclear rDNA sequences, but more recently complete mitochondrial (m...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-392

    authors: Park JK,Sultana T,Lee SH,Kang S,Kim HK,Min GS,Eom KS,Nadler SA

    更新日期:2011-08-03 00:00:00

  • The ubiquitin-conjugating enzyme HR6B is required for maintenance of X chromosome silencing in mouse spermatocytes and spermatids.

    abstract:BACKGROUND:The ubiquitin-conjugating enzyme HR6B is required for spermatogenesis in mouse. Loss of HR6B results in aberrant histone modification patterns on the trancriptionally silenced X and Y chromosomes (XY body) and on centromeric chromatin in meiotic prophase. We studied the relationship between these chromatin m...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-367

    authors: Mulugeta Achame E,Wassenaar E,Hoogerbrugge JW,Sleddens-Linkels E,Ooms M,Sun ZW,van IJcken WF,Grootegoed JA,Baarends WM

    更新日期:2010-06-10 00:00:00

  • A multi-treatment experimental system to examine photosynthetic differentiation in the maize leaf.

    abstract:BACKGROUND:The establishment of C4 photosynthesis in maize is associated with differential accumulation of gene transcripts and proteins between bundle sheath and mesophyll photosynthetic cell types. We have physically separated photosynthetic cell types in the leaf blade to characterize differences in gene expression ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-12

    authors: Sawers RJ,Liu P,Anufrikova K,Hwang JT,Brutnell TP

    更新日期:2007-01-09 00:00:00