Abstract:
BACKGROUND:Dimension reduction is a critical issue in the analysis of microarray data, because the high dimensionality of gene expression microarray data set hurts generalization performance of classifiers. It consists of two types of methods, i.e. feature selection and feature extraction. Principle component analysis (PCA) and partial least squares (PLS) are two frequently used feature extraction methods, and in the previous works, the top several components of PCA or PLS are selected for modeling according to the descending order of eigenvalues. While in this paper, we prove that not all the top features are useful, but features should be selected from all the components by feature selection methods. RESULTS:We demonstrate a framework for selecting feature subsets from all the newly extracted components, leading to reduced classification error rates on the gene expression microarray data. Here we have considered both an unsupervised method PCA and a supervised method PLS for extracting new components, genetic algorithms for feature selection, and support vector machines and k nearest neighbor for classification. Experimental results illustrate that our proposed framework is effective to select feature subsets and to reduce classification error rates. CONCLUSION:Not only the top features newly extracted by PCA or PLS are important, therefore, feature selection should be performed to select subsets from new features to improve generalization performance of classifiers.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Li GZ,Bu HL,Yang MQ,Zeng XQ,Yang JYdoi
10.1186/1471-2164-9-S2-S24subject
Has Abstractpub_date
2008-09-16 00:00:00pages
S24issn
1471-2164pii
1471-2164-9-S2-S24journal_volume
9 Suppl 2pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:Rice blast, caused by the fungal pathogen Magnaporthe grisea, is a devastating disease causing tremendous yield loss in rice production. The public availability of the complete genome sequence of M. grisea provides ample opportunities to understand the molecular mechanism of its pathogenesis on rice plants a...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-7-310
更新日期:2006-12-08 00:00:00
abstract:BACKGROUND:Ocean acidification (OA), a change in ocean chemistry due to the absorption of atmospheric CO2 into surface oceans, challenges biogenic calcification in many marine organisms. Ocean acidification is expected to rapidly progress in polar seas, with regions of the Southern Ocean expected to experience severe O...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4161-0
更新日期:2017-10-23 00:00:00
abstract:BACKGROUND:The neonatal bovine mammary fat pad (MFP) surrounding the mammary parenchyma (PAR) is thought to exert proliferative effects on the PAR through secretion of local modulators of growth induced by systemic hormones. We used bioinformatics to characterize transcriptomics differences between PAR and MFP from app...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-331
更新日期:2010-05-26 00:00:00
abstract:BACKGROUND:RNA sequencing (RNA-seq) and microarrays are two transcriptomics techniques aimed at the quantification of transcribed genes and their isoforms. Here we compare the latest Affymetrix HTA 2.0 microarray with Illumina 2000 RNA-seq for the analysis of patient samples - normal lung epithelium tissue and squamous...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-3819-y
更新日期:2017-06-06 00:00:00
abstract:BACKGROUND:New gene emergence is so far assumed to be mostly driven by duplication and divergence of existing genes. The possibility that entirely new genes could emerge out of the non-coding genomic background was long thought to be almost negligible. With the increasing availability of fully sequenced genomes across ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-117
更新日期:2013-02-21 00:00:00
abstract:BACKGROUND:Bacteria belonging to the Rhodococcus genus play an important role in the degradation of many contaminants, including methylbenzenes. These bacteria, widely distributed in the environment, are known to be a powerhouse of numerous degradation functions, due to their ability to metabolize a wide range of organ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4965-6
更新日期:2018-08-06 00:00:00
abstract:BACKGROUND:The genus Populus includes poplars, aspens and cottonwoods, which will be collectively referred to as poplars hereafter unless otherwise specified. Poplars are the dominant tree species in many forest ecosystems in the Northern Hemisphere and are of substantial economic value in plantation forestry. Poplar h...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-57
更新日期:2008-01-29 00:00:00
abstract:BACKGROUND:Over the last decade, emerging research methods, such as comparative genomic analysis and phylogenetic study, have yielded new insights into genotypes and phenotypes of closely related bacterial strains. Several findings have revealed that genomic structural variations (SVs), including gene gain/loss, gene d...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1259-0
更新日期:2015-02-14 00:00:00
abstract:BACKGROUND:Midgut invasion, a major bottleneck for malaria parasites transmission is considered as a potential target for vector-parasite interaction studies. New intervention strategies are required to explore the midgut proteins and their potential role in refractoriness for malaria control in Anopheles mosquitoes. T...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4729-3
更新日期:2018-05-08 00:00:00
abstract:BACKGROUND:The ciliate Paramecium bursaria harbors several hundred cells of the green-alga Chlorella sp. in their cytoplasm. Irrespective of the mutual relation between P. bursaria and the symbiotic algae, both cells retain the ability to grow without the partner. They can easily reestablish endosymbiosis when put in c...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-183
更新日期:2014-03-10 00:00:00
abstract:BACKGROUND:The mosquito Anopheles gambiae is a major vector of human malaria. Increasing evidence indicates that blood cells (hemocytes) comprise an essential arm of the mosquito innate immune response against both bacteria and malaria parasites. To further characterize the role of hemocytes in mosquito immunity, we un...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-257
更新日期:2009-06-05 00:00:00
abstract:BACKGROUND:Urechis unicinctus, an echiuran worm inhabiting the U-shaped burrows in the coastal mud flats, is an important commercial and ecological invertebrate in Northeast Asian countries, which has potential applications in the study of animal evolution, coastal sediment improvement and marine drug development. Furt...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-2094-z
更新日期:2015-10-21 00:00:00
abstract:BACKGROUND:Genome-wide association studies (GWAS) have identified many individual genes associated with brain imaging quantitative traits (QTs) in Alzheimer's disease (AD). However single marker level association discovery may not be able to address the underlying biological interactions with disease mechanism. RESULT...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-07282-7
更新日期:2020-12-29 00:00:00
abstract:BACKGROUND:Nutritional quality of phytoplankton is a major determinant of the trophic transfer efficiency at the plant-herbivore interface in freshwater food webs. In particular, the phytoplankton's content of the essential polyunsaturated omega-3 fatty acid eicosapentaenoic acid (EPA) has been repeatedly shown to limi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-6268-y
更新日期:2019-11-21 00:00:00
abstract:BACKGROUND:Gymnosporangium spp. are fungal plant pathogens causing rust disease and most of them are known to infect two different host plants (heteroecious) with four spore stages (demicyclic). In the present study, we sequenced the transcriptome of G. japonicum teliospores on its host plant Juniperus chinensis and we...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-6099-x
更新日期:2019-10-09 00:00:00
abstract:BACKGROUND:Aspartic proteases are known to play an important role in the biology of nematode parasitism. This role is best characterised in blood-feeding nematodes, where they digest haemoglobin, but they are also likely to play important roles in the biology of nematode parasites that do not feed on blood. In the pres...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-611
更新日期:2009-12-16 00:00:00
abstract:BACKGROUND:Long terminal repeat retrotransposons are the most abundant transposons in plants. They play important roles in alternative splicing, recombination, gene regulation, and defense mechanisms. Large-scale sequencing projects for plant genomes are currently underway. Software tools are important for annotating l...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5796-9
更新日期:2019-06-03 00:00:00
abstract:BACKGROUND:Many reptiles exhibit temperature-dependent sex determination (TSD). The initial cue in TSD is incubation temperature, unlike genotypic sex determination (GSD) where it is determined by the presence of specific alleles (or genetic loci). We used patterns of gene expression to identify candidates for genes wi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-308
更新日期:2012-07-15 00:00:00
abstract:BACKGROUND:Bayesian mixture models in which the effects of SNP are assumed to come from normal distributions with different variances are attractive for simultaneous genomic prediction and QTL mapping. These models are usually implemented with Monte Carlo Markov Chain (MCMC) sampling, which requires long compute times ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3082-7
更新日期:2016-09-21 00:00:00
abstract:BACKGROUND:Lipopolysaccharide (LPS) from Gram-negative bacteria cause innate immune responses in animals and plants. The molecules involved in LPS signaling in animals are well studied, whereas those in plants are not yet as well documented. Recently, we identified Arabidopsis AtLBR-2, which binds to LPS from Pseudomon...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4372-4
更新日期:2017-12-29 00:00:00
abstract:BACKGROUND:Genome-scale functional genomic screens across large cell line panels provide a rich resource for discovering tumor vulnerabilities that can lead to the next generation of targeted therapies. Their data analysis typically has focused on identifying genes whose knockdown enhances response in various pre-defin...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2807-y
更新日期:2016-06-13 00:00:00
abstract:BACKGROUND:Oxidative stress is a common stress encountered by living organisms and is due to an imbalance between intracellular reactive oxygen and nitrogen species (ROS, RNS) and cellular antioxidant defence. To defend themselves against ROS/RNS, bacteria possess a subsystem of detoxification enzymes, which are classi...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-637
更新日期:2008-12-31 00:00:00
abstract:BACKGROUND:Thoroughbred horses are the most expensive domestic animals, and their running ability and knowledge about their muscle-related diseases are important in animal genetics. While the horse reference genome is available, there has been no large-scale functional annotation of the genome using expressed genes der...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-473
更新日期:2012-09-12 00:00:00
abstract:BACKGROUND:Protein phosphorylation by kinases plays crucial roles in various biological processes including signal transduction and tumorigenesis, thus a better understanding of protein phosphorylation events in cells is fundamental for studying protein functions and designing drugs to treat diseases caused by the malf...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-06895-2
更新日期:2020-08-04 00:00:00
abstract:BACKGROUND:The MYB superfamily is one of the most abundant transcription factor (TF) families in plants. MYB proteins include highly conserved N-terminal MYB repeats (1R, R2R3, 3R, and atypical) and various C-terminal sequences that confer extensive functions. However, the functions of most MYB genes are unknown, and h...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1216-y
更新日期:2015-01-23 00:00:00
abstract:BACKGROUND:Streptococcus uberis, a Gram positive bacterial pathogen responsible for a significant proportion of bovine mastitis in commercial dairy herds, colonises multiple body sites of the cow including the gut, genital tract and mammary gland. Comparative analysis of the complete genome sequence of S. uberis strain...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-54
更新日期:2009-01-28 00:00:00
abstract:BACKGROUND:Prolactin is a polypeptide hormone secreted by the anterior pituitary gland that plays an essential role in lactation, tissue growth, and suppressing apoptosis to increase cell survival. Prolactin serves as a key player in many life-critical processes, including immune system and reproduction. Prolactin is a...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2785-0
更新日期:2016-06-29 00:00:00
abstract:BACKGROUND:Pharmacological and gene ablation studies have demonstrated the crucial role of the endocrine function of the heart as mediated by the polypeptide hormones ANF and BNP in the maintenance of cardiovascular homeostasis. The importance of these studies lies on the fact that hypertension and chronic congestive h...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-10-254
更新日期:2009-06-01 00:00:00
abstract:BACKGROUND:The Multinational Brassica rapa Genome Sequencing Project (BrGSP) has developed valuable genomic resources, including BAC libraries, BAC-end sequences, genetic and physical maps, and seed BAC sequences for Brassica rapa. An integrated linkage map between the amphidiploid B. napus and diploid B. rapa will fac...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-594
更新日期:2010-10-22 00:00:00
abstract:BACKGROUND:Infectious salmon anemia virus (ISAV) causes a multisystemic disease responsible for severe losses in salmon aquaculture. Better understanding of factors that explain variations in resistance between individuals and families is essential for development of strategies for disease control. To approach this, we...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-179
更新日期:2008-04-18 00:00:00