Abstract:
BACKGROUND:Missing data is an inevitable phenomenon in gene expression microarray experiments due to instrument failure or human error. It has a negative impact on performance of downstream analysis. Technically, most existing approaches suffer from this prevalent problem. Imputation is one of the frequently used methods for processing missing data. Actually many developments have been achieved in the research on estimating missing values. The challenging task is how to improve imputation accuracy for data with a large missing rate. METHODS:In this paper, induced by the thought of collaborative training, we propose a novel hybrid imputation method, called Recursive Mutual Imputation (RMI). Specifically, RMI exploits global correlation information and local structure in the data, captured by two popular methods, Bayesian Principal Component Analysis (BPCA) and Local Least Squares (LLS), respectively. Mutual strategy is implemented by sharing the estimated data sequences at each recursive process. Meanwhile, we consider the imputation sequence based on the number of missing entries in the target gene. Furthermore, a weight based integrated method is utilized in the final assembling step. RESULTS:We evaluate RMI with three state-of-art algorithms (BPCA, LLS, Iterated Local Least Squares imputation (ItrLLS)) on four publicly available microarray datasets. Experimental results clearly demonstrate that RMI significantly outperforms comparative methods in terms of Normalized Root Mean Square Error (NRMSE), especially for datasets with large missing rates and less complete genes. CONCLUSIONS:It is noted that our proposed hybrid imputation approach incorporates both global and local information of microarray genes, which achieves lower NRMSE values against to any single approach only. Besides, this study highlights the need for considering the imputing sequence of missing entries for imputation methods.
journal_name
BMC Genomicsjournal_title
BMC genomicsauthors
Li H,Zhao C,Shao F,Li GZ,Wang Xdoi
10.1186/1471-2164-16-S9-S1subject
Has Abstractpub_date
2015-01-01 00:00:00pages
S1issn
1471-2164pii
1471-2164-16-S9-S1journal_volume
16 Suppl 9pub_type
杂志文章相关文献
BMC GENOMICS文献大全abstract:BACKGROUND:The capacity of European pear fruit (Pyrus communis L.) to ripen after harvest develops during the final stages of growth on the tree. The objective of this study was to characterize changes in 'Bartlett' pear fruit physico-chemical properties and transcription profiles during fruit maturation leading to att...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1939-9
更新日期:2015-10-09 00:00:00
abstract:BACKGROUND:Boar taint is observed in a high proportion of uncastrated male pigs and is characterized by an unpleasant odor/flavor in cooked meat, primarily caused by elevated levels of androstenone and skatole. Androstenone is a steroid produced in the testis in parallel with biosynthesis of other sex steroids like tes...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-362
更新日期:2011-07-13 00:00:00
abstract:BACKGROUND:Due to the predominant usage of short-read sequencing to date, most bacterial genome sequences reported in the last years remain at the draft level. This precludes certain types of analyses, such as the in-depth analysis of genome plasticity. RESULTS:Here we report the finalized genome sequence of the envir...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4301-6
更新日期:2018-01-05 00:00:00
abstract:BACKGROUND:The endothelial PAS domain protein 1 (EPAS1) activates genes that are involved in erythropoiesis and angiogenesis, thus favoring a better delivery of oxygen to the tissues and is a plausible candidate to influence athletic performance. Using innovative statistical methods we compared genotype distributions a...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-382
更新日期:2014-05-18 00:00:00
abstract:BACKGROUND:Cell lines are an indispensable tool in biomedical research and often used as surrogates for tissues. Although there are recognized important cellular and transcriptomic differences between cell lines and tissues, a systematic overview of the differences between the regulatory processes of a cell line and th...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-4111-x
更新日期:2017-09-12 00:00:00
abstract:BACKGROUND:Ubiquitination is an important post-translational modification involved in diverse biological processes. Therefore, genomewide representation of the ubiquitination system for a species is important. DESCRIPTION:SCUD is a web-based database for the ubiquitination system in Saccharomyces cerevisiae (Baker's y...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-9-440
更新日期:2008-09-24 00:00:00
abstract:BACKGROUND:Paired-tag sequencing approaches are commonly used for the analysis of genome structure. However, mammalian genomes have a complex organization with a variety of repetitive elements that complicate comprehensive genome-wide analyses. RESULTS:Here, we systematically assessed the utility of paired-end and mat...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-257
更新日期:2013-04-16 00:00:00
abstract:BACKGROUND:The NBS disease-related gene family coordinates the inherent immune system in plants in response to pathogen infections. Previous studies have identified NBS-encoding genes in Pyrus bretschneideri ('Dangshansuli', an Asian pear) and Pyrus communis ('Bartlett', a European pear) genomes, but the patterns of ge...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-020-07226-1
更新日期:2020-11-19 00:00:00
abstract:BACKGROUND:Approximately 11 Mb of finished high quality genomic sequences were sampled from cattle, dog and human to estimate genomic divergences and their regional variation among these lineages. RESULTS:Optimal three-way multi-species global sequence alignments for 84 cattle clones or loci (each >50 kb of genomic se...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-7-140
更新日期:2006-06-07 00:00:00
abstract:BACKGROUND:The SOS response is a well-known regulatory network present in most bacteria and aimed at addressing DNA damage. It has also been linked extensively to stress-induced mutagenesis, virulence and the emergence and dissemination of antibiotic resistance determinants. Recently, the SOS response has been shown to...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-58
更新日期:2012-02-03 00:00:00
abstract:BACKGROUND:Copy neutral loss of heterozygosity (CN-LOH) refers to a special case of LOH occurring without any resulting loss in copy number. These alterations is sometimes seen in tumors as a way to inactivate a tumor suppressor gene and have been found to be important in several types of cancer. RESULTS:We have used ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-443
更新日期:2011-09-07 00:00:00
abstract:BACKGROUND:Exosomes, endosome-derived membrane microvesicles, contain specific RNA transcripts that are thought to be involved in cell-cell communication. These RNA transcripts have great potential as disease biomarkers. To characterize exosomal RNA profiles systemically, we performed RNA sequencing analysis using thre...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-319
更新日期:2013-05-10 00:00:00
abstract:BACKGROUND:Prolactin is a polypeptide hormone secreted by the anterior pituitary gland that plays an essential role in lactation, tissue growth, and suppressing apoptosis to increase cell survival. Prolactin serves as a key player in many life-critical processes, including immune system and reproduction. Prolactin is a...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2785-0
更新日期:2016-06-29 00:00:00
abstract:BACKGROUND:The human microbiome plays a significant role in maintaining normal physiology. Changes in its composition have been associated with bowel disease, metabolic disorders and atherosclerosis. Sequences of microbial origin have been observed within small RNA sequencing data obtained from blood samples. The aim o...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-15-933
更新日期:2014-10-25 00:00:00
abstract:BACKGROUND:Litchi (Litchi chinensis Sonn.) is an economically important evergreen fruit tree widely cultivated in subtropical areas. Low temperature is absolutely required for floral induction of litchi, but its molecular mechanism is not fully understood. Leaves of litchi played a key role during floral induction and ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-017-3747-x
更新日期:2017-05-10 00:00:00
abstract:BACKGROUND:Sequencing data from The Cancer Genome Atlas (TGCA), the International Cancer Genome Consortium and other research institutes have revealed the presence of genetic alterations in several tumor types, including gastric cancer. These data have been combined into a catalog of significantly mutated genes for eac...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3166-4
更新日期:2016-10-26 00:00:00
abstract:BACKGROUND:Salmonella enterica is a significant foodborne pathogen, which can be transmitted via several distinct routes, and reports on acquisition of antimicrobial resistance (AMR) are increasing. To better understand the association between human Salmonella clinical isolates and the potential environmental/animal re...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-5137-4
更新日期:2018-11-06 00:00:00
abstract:BACKGROUND:The Gram-negative bacterium Chlamydia pneumoniae (Cpn) is the leading intracellular human pathogen responsible for respiratory infections such as pneumonia and bronchitis. Basic and applied research in pathogen biology, especially the elaboration of new mechanism-based anti-pathogen strategies, target discov...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-632
更新日期:2012-11-16 00:00:00
abstract:BACKGROUND:Alternative polyadenylation (APA) has emerged as a pervasive mechanism that contributes to the transcriptome complexity and dynamics of gene regulation. The current tsunami of whole genome poly(A) site data from various conditions generated by 3' end sequencing provides a valuable data source for the study o...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-019-5433-7
更新日期:2019-01-22 00:00:00
abstract:BACKGROUND:Natural accessions of Arabidopsis thaliana are characterized by a high level of phenotypic variation that can be used to investigate the extent and mode of selection on the primary metabolic traits. A collection of 54 A. thaliana natural accession-derived lines were subjected to deep genotyping through Singl...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-11-188
更新日期:2010-03-20 00:00:00
abstract:BACKGROUND:The preferred habitat of a given bacterium can provide a hint of which types of enzymes of potential industrial interest it might produce. These might include enzymes that are stable and active at very high or very low temperatures. Being able to accurately predict this based on a genomic sequence, would thu...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-13-S7-S3
更新日期:2012-01-01 00:00:00
abstract:BACKGROUND:The objective of this research was to investigate the reproducibility of cross-species microarray hybridisation. Comparisons between same- and cross-species hybridisations were also made. Nine hybridisations between a single pig skeletal muscle RNA sample and three human cDNA nylon microarrays were completed...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-3-27
更新日期:2002-09-27 00:00:00
abstract:BACKGROUND:Genome-scale functional genomic screens across large cell line panels provide a rich resource for discovering tumor vulnerabilities that can lead to the next generation of targeted therapies. Their data analysis typically has focused on identifying genes whose knockdown enhances response in various pre-defin...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2807-y
更新日期:2016-06-13 00:00:00
abstract:BACKGROUND:The TALLYHO/Jng (TH) mouse is a polygenic model for obesity and type 2 diabetes first described in the literature in 2001. The origin of the TH strain is an outbred colony of the Theiler Original strain and mice derived from this source were selectively bred for male hyperglycemia establishing an inbred stra...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-3245-6
更新日期:2016-11-11 00:00:00
abstract:BACKGROUND:Maintaining maximum genetic diversity and preserving breed viability in conserved populations necessitates the rigorous evaluation of conservation schemes. Three chicken breeds (Baier Yellow Chicken (BEC), Beijing You Chicken (BYC) and Langshan Chicken (LSC)) are currently in conservation programs in China. ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-018-4973-6
更新日期:2018-08-09 00:00:00
abstract:BACKGROUND:Marek's disease (MD) is a lymphoproliferative disease in chickens caused by Marek's disease virus (MDV) and characterized by T cell lymphoma and infiltration of lymphoid cells into various organs such as liver, spleen, peripheral nerves and muscle. Resistance to MD and disease risk have long been thought to ...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-12-501
更新日期:2011-10-12 00:00:00
abstract:BACKGROUND:Harvest index (HI), the ratio of grain yield to total biomass, is considered as a measure of biological success in partitioning assimilated photosynthate to the harvestable product. While crop production can be dramatically improved by increasing HI, the underlying molecular genetic mechanism of HI in rapese...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1607-0
更新日期:2015-05-12 00:00:00
abstract:BACKGROUND:Horizontal gene transfer has shaped the evolution of the ammonium transporter/ammonia permease gene family. Horizontal transfers of ammonium transporter/ammonia permease genes into the fungi include one transfer from archaea to the filamentous ascomycetes associated with the adaptive radiation of the leotiom...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/1471-2164-14-225
更新日期:2013-04-04 00:00:00
abstract:BACKGROUND:Wood formation affects the chemical and physical properties of wood, and thus affects its utility as a building material or a feedstock for biofuels, pulp and paper. To obtain genome-wide insights on the transcriptome changes and regulatory networks in wood formation, we used high-throughput RNA sequencing t...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-015-1390-y
更新日期:2015-03-10 00:00:00
abstract:BACKGROUND:Post-translational glycosylation of the flagellin protein is relatively common among Gram-negative bacteria, and has been linked to several phenotypes, including flagellar biosynthesis and motility, biofilm formation, host immune evasion and manipulation and virulence. However to date, despite extensive phys...
journal_title:BMC genomics
pub_type: 杂志文章
doi:10.1186/s12864-016-2735-x
更新日期:2016-05-20 00:00:00