Abstract:
INTRODUCTION:In the classification of Mass Spectrometry (MS) proteomics data, peak detection, feature selection, and learning classifiers are critical to classification accuracy. To better understand which methods are more accurate when classifying data, some publicly available peak detection algorithms for Matrix assisted Laser Desorption Ionization Mass Spectrometry (MALDI-MS) data were recently compared; however, the issue of different feature selection methods and different classification models as they relate to classification performance has not been addressed. With the application of intelligent computing, much progress has been made in the development of feature selection methods and learning classifiers for the analysis of high-throughput biological data. The main objective of this paper is to compare the methods of feature selection and different learning classifiers when applied to MALDI-MS data and to provide a subsequent reference for the analysis of MS proteomics data. RESULTS:We compared a well-known method of feature selection, Support Vector Machine Recursive Feature Elimination (SVMRFE), and a recently developed method, Gradient based Leave-one-out Gene Selection (GLGS) that effectively performs microarray data analysis. We also compared several learning classifiers including K-Nearest Neighbor Classifier (KNNC), Naïve Bayes Classifier (NBC), Nearest Mean Scaled Classifier (NMSC), uncorrelated normal based quadratic Bayes Classifier recorded as UDC, Support Vector Machines, and a distance metric learning for Large Margin Nearest Neighbor classifier (LMNN) based on Mahanalobis distance. To compare, we conducted a comprehensive experimental study using three types of MALDI-MS data. CONCLUSION:Regarding feature selection, SVMRFE outperformed GLGS in classification. As for the learning classifiers, when classification models derived from the best training were compared, SVMs performed the best with respect to the expected testing accuracy. However, the distance metric learning LMNN outperformed SVMs and other classifiers on evaluating the best testing. In such cases, the optimum classification model based on LMNN is worth investigating for future study.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Liu Q,Sung AH,Qiao M,Chen Z,Yang JY,Yang MQ,Huang X,Deng Ydoi
10.1186/1471-2164-10-S1-S3subject
Has Abstractpub_date
2009-07-07 00:00:00pages
S3issn
1471-2164pii
1471-2164-10-S1-S3journal_volume
10 Suppl 1pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Starch and protein are two major components of polished rice, and the amylose and protein contents affect eating and cooking qualities (ECQs). In the present study, genome-wide association study with high-quality re-sequencing data was performed for 10 ECQs in a panel of 227 non-glutinous rice accessions and...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3000-z
更新日期:2016-08-20 00:00:00
abstract:BACKGROUND:Protein-protein interaction (PPI) maps are useful tools for investigating the cellular functions of genes. Thus far, large-scale PPI mapping projects have not been implemented for the rice blast fungus Magnaporthe grisea, which is responsible for the most severe rice disease. Inspired by recent advances in P...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-519
更新日期:2008-11-02 00:00:00
abstract:BACKGROUND:Oral squamous cell carcinoma (OSCC) is one of the most common forms of cancer associated with the presence of precancerous oral leukoplakia. Given the poor prognosis associated with oral leukoplakia, and the difficulties in distinguishing it from cancer lesions, there is an urgent need to elucidate the molec...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-383
更新日期:2009-08-19 00:00:00
abstract:BACKGROUND:Malaria pathogenesis relies on sexual gametocyte forms of the malaria parasite to be transmitted between the infected human and the mosquito host but the molecular mechanisms controlling gametocytogenesis remains poorly understood. Here we provide a high-resolution transcriptome of Plasmodium falciparum as i...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-6322-9
更新日期:2019-12-03 00:00:00
abstract:BACKGROUND:There is strong but mostly circumstantial evidence that genetic factors modulate the severity of influenza infection in humans. Using genetically diverse but fully inbred strains of mice it has been shown that host sequence variants have a strong influence on the severity of influenza A disease progression. ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-411
更新日期:2012-08-20 00:00:00
abstract:BACKGROUND:Ras-related proteins in brain (Rab)-family proteins are key members of the membrane trafficking pathway in cells. In addition, these proteins have been identified to have diverse functions such as cross-talking with different kinases and playing a role in cellular signaling. However, only a few Rab proteins ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3145-9
更新日期:2016-10-10 00:00:00
abstract:BACKGROUND:Foraminiferan protists, which are significant players in most marine ecosystems, are also genetic innovators, harboring unique modifications to proteins that make up the basic eukaryotic cell machinery. Despite their ecological and evolutionary importance, foraminiferan genomes are poorly understood due to t...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-169
更新日期:2011-03-31 00:00:00
abstract:BACKGROUND:Loaches of Cobitinae, widely distributed in Eurasian continent, have high economic, ornamental and scientific value. However, the phylogeny of Cobitinae fishes within genera or family level remains complex and controversial. Up to now, about 60 Cobitinae mitogenomes had been deposited in GenBank, but their i...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-07360-w
更新日期:2021-01-14 00:00:00
abstract:BACKGROUND:The plastome of medicinal and endangered species in Kingdom of Saudi Arabia, Barleria prionitis was sequenced. The plastome was compared with that of seven Acanthoideae species in order to describe the plastome, spot the microsatellite, assess the dissimilarities within the sampled plastomes and to infer the...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-06798-2
更新日期:2020-06-06 00:00:00
abstract:BACKGROUND:Subtractive hybridization methods are valuable tools for identifying differentially regulated genes in a given tissue avoiding redundant sequencing of clones representing the same expressed genes, maximizing detection of low abundant transcripts and thus, affecting the efficiency and cost effectiveness of sm...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-4-40
更新日期:2003-09-30 00:00:00
abstract:BACKGROUND:Gene duplication is prevalent in many species and can result in coding and regulatory divergence. Gene duplications can be classified as whole genome duplication (WGD), tandem and inserted (non-syntenic). In maize, WGD resulted in the subgenomes maize1 and maize2, of which maize1 is considered the dominant s...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3194-0
更新日期:2016-11-04 00:00:00
abstract:BACKGROUND:NOD26-like intrinsic proteins (NIPs) that belong to the aquaporin superfamily are plant-specific and exhibit a similar three-dimensional structure. Experimental evidences however revealed that functional divergence should have extensively occurred among NIP genes. It is therefore intriguing to further invest...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-313
更新日期:2009-07-15 00:00:00
abstract:BACKGROUND:Inferring the microbial interaction networks (MINs) and modeling their dynamics are critical in understanding the mechanisms of the bacterial ecosystem and designing antibiotic and/or probiotic therapies. Recently, several approaches were proposed to infer MINs using the generalized Lotka-Volterra (gLV) mode...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-3605-x
更新日期:2017-03-27 00:00:00
abstract:BACKGROUND:Selenium (Se) is a trace element that occurs in proteins in the form of selenocysteine (Sec) and in tRNAs in the form of selenouridine (SeU). Selenophosphate synthetase (SelD) is required for both utilization traits. However, previous research also revealed SelDs in two organisms lacking Sec and SeU, suggest...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-251
更新日期:2008-05-29 00:00:00
abstract::The draft genome sequence of cattle (Bos taurus) has now been analyzed by the Bovine Genome Sequencing and Analysis Consortium and the Bovine HapMap Consortium, which together represent an extensive collaboration involving more than 300 scientists from 25 different countries. ...
journal_title:BMC genomics
pub_type: 社论
doi:10.1186/1471-2164-10-193
更新日期:2009-04-24 00:00:00
abstract:BACKGROUND:The human neuroblastoma cell line, SH-SY5Y, is a commonly used cell line in studies related to neurotoxicity, oxidative stress, and neurodegenerative diseases. Although this cell line is often used as a cellular model for Parkinson's disease, the relevance of this cellular model in the context of Parkinson's...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-1154
更新日期:2014-12-20 00:00:00
abstract:BACKGROUND:Helicobacter pylori is presumed to be co-evolved with its human host and is a highly diverse gastric pathogen at genetic levels. Ancient origins of H. pylori in the New World are still debatable. It is not clear how different waves of human migrations in South America contributed to the evolution of strain d...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-7-191
更新日期:2006-07-27 00:00:00
abstract:BACKGROUND:To date, oil-rich plants are the main source of biodiesel products. Because concerns have been voiced about the impact of oil-crop cultivation on the price of food commodities, the interest in oil plants not used for food production and amenable to cultivation on non-agricultural land has soared. As a non-fo...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-462
更新日期:2010-08-06 00:00:00
abstract:BACKGROUND:The activity of the yeast activator protein 1 (Yap1p) increases under stress conditions, which leads to enhanced transcription of a number of genes encoding protective enzymes or other proteins. To obtain a global overview of changes in expression of Yap1p-targeted proteins, we compared a Yap1p-overexpressin...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-230
更新日期:2012-06-09 00:00:00
abstract:BACKGROUND:Determination of genome-wide DNA methylation is significant for both basic research and drug development. As a key epigenetic modification, this biochemical process can modulate gene expression to influence the cell differentiation which can possibly lead to cancer. Due to the involuted biochemical mechanism...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5488-5
更新日期:2019-04-04 00:00:00
abstract:BACKGROUND:Invasive aspergillosis is started after germination of Aspergillus fumigatus conidia that are inhaled by susceptible individuals. Fungal hyphae can grow in the lung through the epithelial tissue and disseminate hematogenously to invade into other organs. Low fungaemia indicates that fungal elements do not re...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1853-1
更新日期:2015-08-27 00:00:00
abstract:BACKGROUND:Chromosome 17q21.31 contains a common inversion polymorphism of approximately 900 kb in populations with European ancestry. Two divergent MAPT haplotypes, H1 and H2 are described with distinct linkage disequilibrium patterns across the region reflecting the inversion status at this locus. The MAPT H1 haploty...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-458
更新日期:2012-09-06 00:00:00
abstract:BACKGROUND:Bacterial non-coding RNAs act by base-pairing as regulatory elements in crucial biological processes. We performed the identification of trans-encoded small RNAs (sRNA) from the genomes of Mycoplama hyopneumoniae, Mycoplasma flocculare and Mycoplasma hyorhinis, which are Mycoplasma species that have been ide...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3061-z
更新日期:2016-10-25 00:00:00
abstract:BACKGROUND:The rapid growth of the world's population demands an increase in food production that no longer can be reached by increasing amounts of nitrogenous fertilizers. Plant growth promoting bacteria (PGPB) might be an alternative to increase nitrogenous use efficiency (NUE) in important crops such wheat. Azospiri...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-378
更新日期:2014-05-16 00:00:00
abstract:BACKGROUND:The sequencing of the cow genome was recently published (Btau_4.0 assembly). A second, alternate cow genome assembly (UMD2), based on the same raw sequence data, was also published. The two assemblies have been subsequently updated to Btau_4.2 and UMD3.1, respectively. RESULTS:We compared the Btau_4.2 and U...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-639
更新日期:2011-12-30 00:00:00
abstract:BACKGROUND:Obtaining reliable and reproducible two-color microarray gene expression data is critically important for understanding the biological significance of perturbations made on a cellular system. Microarray design, RNA preparation and labeling, hybridization conditions and data acquisition and analysis are varia...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-5-20
更新日期:2004-03-09 00:00:00
abstract:BACKGROUND:High-throughput proteomics was used to determine the role of the fish liver in defense responses to bacterial infection. This was done using a rainbow trout (Oncorhynchus mykiss) model following infection with Aeromonas salmonicida, the causative agent of furunculosis. The vertebrate liver has multifaceted f...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5092-0
更新日期:2018-10-01 00:00:00
abstract:BACKGROUND:The orders Ascaridida, Oxyurida, and Spirurida represent major components of zooparasitic nematode diversity, including many species of veterinary and medical importance. Phylum-wide nematode phylogenetic hypotheses have mainly been based on nuclear rDNA sequences, but more recently complete mitochondrial (m...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-392
更新日期:2011-08-03 00:00:00
abstract:BACKGROUND:The ubiquitin-conjugating enzyme HR6B is required for spermatogenesis in mouse. Loss of HR6B results in aberrant histone modification patterns on the trancriptionally silenced X and Y chromosomes (XY body) and on centromeric chromatin in meiotic prophase. We studied the relationship between these chromatin m...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-367
更新日期:2010-06-10 00:00:00
abstract:BACKGROUND:The establishment of C4 photosynthesis in maize is associated with differential accumulation of gene transcripts and proteins between bundle sheath and mesophyll photosynthetic cell types. We have physically separated photosynthetic cell types in the leaf blade to characterize differences in gene expression ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-8-12
更新日期:2007-01-09 00:00:00