Assisted clustering of gene expression data using ANCut.

Abstract:

BACKGROUND:In biomedical research, gene expression profiling studies have been extensively conducted. The analysis of gene expression data has led to a deeper understanding of human genetics as well as practically useful models. Clustering analysis has been a critical component of gene expression data analysis and can reveal the (previously unknown) interconnections among genes. With the high dimensionality of gene expression data, many of the existing clustering methods and results are not as satisfactory. Intuitively, this is caused by "a lack of information". In recent profiling studies, a prominent trend is to collect data on gene expressions as well as their regulators (copy number alteration, microRNA, methylation, etc.) on the same subjects, making it possible to borrow information from other types of omics measurements in gene expression analysis. METHODS:In this study, an ANCut approach is developed, which is built on the regularized estimation and NCut techniques. An effective R code that implements this approach is developed. RESULTS:Simulation shows that the proposed approach outperforms direct competitors. The analysis of TCGA (The Cancer Genome Atlas) data further demonstrates its satisfactory performance. CONCLUSIONS:We propose a more effective clustering analysis of gene expression data, with the assistance of information from regulators. It provides a new venue for analyzing gene expression data based on the assisted analysis strategy.

journal_name

BMC Genomics

journal_title

BMC genomics

authors

Teran Hidalgo SJ,Wu M,Ma S

doi

10.1186/s12864-017-3990-1

subject

Has Abstract

pub_date

2017-08-16 00:00:00

pages

623

issue

1

issn

1471-2164

pii

10.1186/s12864-017-3990-1

journal_volume

18

pub_type

杂志文章
  • Identification of proprotein convertase substrates using genome-wide expression correlation analysis.

    abstract:BACKGROUND:Subtilisin/kexin-like proprotein convertase (PCSK) enzymes have important regulatory function in a wide variety of biological processes. PCSKs proteolytically process at a target sequence that contains basic amino acids arginine and lysine, which results in functional maturation of the target protein. In vit...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-12-618

    authors: Turpeinen H,Kukkurainen S,Pulkkinen K,Kauppila T,Ojala K,Hytönen VP,Pesu M

    更新日期:2011-12-20 00:00:00

  • SNP discovery and genetic mapping using genotyping by sequencing of whole genome genomic DNA from a pea RIL population.

    abstract:BACKGROUND:Progress in genetics and breeding in pea still suffers from the limited availability of molecular resources. SNP markers that can be identified through affordable sequencing processes, without the need for prior genome reduction or a reference genome to assemble sequencing data would allow the discovery and ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-016-2447-2

    authors: Boutet G,Alves Carvalho S,Falque M,Peterlongo P,Lhuillier E,Bouchez O,Lavaud C,Pilet-Nayel ML,Rivière N,Baranger A

    更新日期:2016-02-18 00:00:00

  • Linkage disequilibrium and genome-wide association analysis for anthocyanin pigmentation and fruit color in eggplant.

    abstract:BACKGROUND:The genome-wide association (GWA) approach represents an alternative to biparental linkage mapping for determining the genetic basis of trait variation. Both approaches rely on recombination to re-arrange the genome, and seek to establish correlations between phenotype and genotype. The major advantages of G...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-896

    authors: Cericola F,Portis E,Lanteri S,Toppino L,Barchi L,Acciarri N,Pulcini L,Sala T,Rotino GL

    更新日期:2014-10-14 00:00:00

  • A manually annotated Actinidia chinensis var. chinensis (kiwifruit) genome highlights the challenges associated with draft genomes and gene prediction in plants.

    abstract:BACKGROUND:Most published genome sequences are drafts, and most are dominated by computational gene prediction. Draft genomes typically incorporate considerable sequence data that are not assigned to chromosomes, and predicted genes without quality confidence measures. The current Actinidia chinensis (kiwifruit) 'Hongy...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4656-3

    authors: Pilkington SM,Crowhurst R,Hilario E,Nardozza S,Fraser L,Peng Y,Gunaseelan K,Simpson R,Tahir J,Deroles SC,Templeton K,Luo Z,Davy M,Cheng C,McNeilage M,Scaglione D,Liu Y,Zhang Q,Datson P,De Silva N,Gardiner SE,Bas

    更新日期:2018-04-16 00:00:00

  • Genome-wide association analysis identified splicing single nucleotide polymorphism in CFLAR predictive of triptolide chemo-sensitivity.

    abstract:BACKGROUND:Triptolide is a therapeutic diterpenoid derived from the Chinese herb Tripterygium wilfordii Hook f. Triptolide has been shown to induce apoptosis by activation of pro-apoptotic proteins, inhibiting NFkB and c-KIT pathways, suppressing the Jak2 transcription, activating MAPK8/JNK signaling and modulating the...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1614-1

    authors: Chauhan L,Jenkins GD,Bhise N,Feldberg T,Mitra-Ghosh T,Fridley BL,Lamba JK

    更新日期:2015-06-30 00:00:00

  • The intestinal microbiome of fish under starvation.

    abstract:BACKGROUND:Starvation not only affects the nutritional and health status of the animals, but also the microbial composition in the host's intestine. Next-generation sequencing provides a unique opportunity to explore gut microbial communities and their interactions with hosts. However, studies on gut microbiomes have b...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-266

    authors: Xia JH,Lin G,Fu GH,Wan ZY,Lee M,Wang L,Liu XJ,Yue GH

    更新日期:2014-04-05 00:00:00

  • Revealing common disease mechanisms shared by tumors of different tissues of origin through semantic representation of genomic alterations and topic modeling.

    abstract:BACKGROUND:Cancer is a complex disease driven by somatic genomic alterations (SGAs) that perturb signaling pathways and consequently cellular function. Identifying patterns of pathway perturbations would provide insights into common disease mechanisms shared among tumors, which is important for guiding treatment and pr...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3494-z

    authors: Chen V,Paisley J,Lu X

    更新日期:2017-03-14 00:00:00

  • Simple models of genomic variation in human SNP density.

    abstract:BACKGROUND:Descriptive hierarchical Poisson models and population-genetic coalescent mixture models are used to describe the observed variation in single-nucleotide polymorphism (SNP) density from samples of size two across the human genome. RESULTS:Using empirical estimates of recombination rate across the human geno...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-8-146

    authors: Sainudiin R,Clark AG,Durrett RT

    更新日期:2007-06-06 00:00:00

  • De novo transcriptome sequencing in Bixa orellana to identify genes involved in methylerythritol phosphate, carotenoid and bixin biosynthesis.

    abstract:BACKGROUND:Bixin or annatto is a commercially important natural orange-red pigment derived from lycopene that is produced and stored in seeds of Bixa orellana L. An enzymatic pathway for bixin biosynthesis was inferred from homology of putative proteins encoded by differentially expressed seed cDNAs. Some activities we...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-2065-4

    authors: Cárdenas-Conejo Y,Carballo-Uicab V,Lieberman M,Aguilar-Espinosa M,Comai L,Rivera-Madrid R

    更新日期:2015-10-28 00:00:00

  • Transcriptome of the floral transition in Rosa chinensis 'Old Blush'.

    abstract:BACKGROUND:The floral transition plays a vital role in the life of ornamental plants. Despite progress in model plants, the molecular mechanisms of flowering regulation remain unknown in perennial plants. Rosa chinensis 'Old Blush' is a unique plant that can flower continuously year-round. In this study, gene expressio...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3584-y

    authors: Guo X,Yu C,Luo L,Wan H,Zhen N,Xu T,Tan J,Pan H,Zhang Q

    更新日期:2017-02-23 00:00:00

  • Small RNAs from plants, bacteria and fungi within the order Hypocreales are ubiquitous in human plasma.

    abstract:BACKGROUND:The human microbiome plays a significant role in maintaining normal physiology. Changes in its composition have been associated with bowel disease, metabolic disorders and atherosclerosis. Sequences of microbial origin have been observed within small RNA sequencing data obtained from blood samples. The aim o...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-15-933

    authors: Beatty M,Guduric-Fuchs J,Brown E,Bridgett S,Chakravarthy U,Hogg RE,Simpson DA

    更新日期:2014-10-25 00:00:00

  • COMUS: Clinician-Oriented locus-specific MUtation detection and deposition System.

    abstract:BACKGROUND:A disease-causing mutation refers to a heritable genetic change that is associated with a specific phenotype (disease). The detection of a mutation from a patient's sample is critical for the diagnosis, treatment, and prognosis of the disease. There are numerous databases and applications with which to archi...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-10-S3-S35

    authors: Jho S,Kim BC,Ghang H,Kim JH,Park D,Kim HM,Jung SY,Yoo KY,Kim HJ,Lee S,Bhak J

    更新日期:2009-12-03 00:00:00

  • Gene expression analyses in Atlantic salmon challenged with infectious salmon anemia virus reveal differences between individuals with early, intermediate and late mortality.

    abstract:BACKGROUND:Infectious salmon anemia virus (ISAV) causes a multisystemic disease responsible for severe losses in salmon aquaculture. Better understanding of factors that explain variations in resistance between individuals and families is essential for development of strategies for disease control. To approach this, we...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-9-179

    authors: Jørgensen SM,Afanasyev S,Krasnov A

    更新日期:2008-04-18 00:00:00

  • Genome-wide binding of the orphan nuclear receptor TR4 suggests its general role in fundamental biological processes.

    abstract:BACKGROUND:The orphan nuclear receptor TR4 (human testicular receptor 4 or NR2C2) plays a pivotal role in a variety of biological and metabolic processes. With no known ligand and few known target genes, the mode of TR4 function was unclear. RESULTS:We report the first genome-wide identification and characterization o...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-689

    authors: O'Geen H,Lin YH,Xu X,Echipare L,Komashko VM,He D,Frietze S,Tanabe O,Shi L,Sartor MA,Engel JD,Farnham PJ

    更新日期:2010-12-02 00:00:00

  • Analysis of the dermatophyte Trichophyton rubrum expressed sequence tags.

    abstract:BACKGROUND:Dermatophytes are the primary causative agent of dermatophytoses, a disease that affects billions of individuals worldwide. Trichophyton rubrum is the most common of the superficial fungi. Although T. rubrum is a recognized pathogen for humans, little is known about how its transcriptional pattern is related...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-255

    authors: Wang L,Ma L,Leng W,Liu T,Yu L,Yang J,Yang L,Zhang W,Zhang Q,Dong J,Xue Y,Zhu Y,Xu X,Wan Z,Ding G,Yu F,Tu K,Li Y,Li R,Shen Y,Jin Q

    更新日期:2006-10-11 00:00:00

  • Protein acetylation in mitochondria plays critical functions in the pathogenesis of fatty liver disease.

    abstract:BACKGROUND:Fatty liver is a high incidence of perinatal disease in dairy cows caused by negative energy balance, which seriously threatens the postpartum health and milk production. It has been reported that lysine acetylation plays an important role in substance and energy metabolism. Predictably, most metabolic proce...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-06837-y

    authors: Le-Tian Z,Cheng-Zhang H,Xuan Z,Zhang Q,Zhen-Gui Y,Qing-Qing W,Sheng-Xuan W,Zhong-Jin X,Ran-Ran L,Ting-Jun L,Zhong-Qu S,Zhong-Hua W,Ke-Rong S

    更新日期:2020-06-26 00:00:00

  • The Epc-N domain: a predicted protein-protein interaction domain found in select chromatin associated proteins.

    abstract:BACKGROUND:An underlying tenet of the epigenetic code hypothesis is the existence of protein domains that can recognize various chromatin structures. To date, two major candidates have emerged: (i) the bromodomain, which can recognize certain acetylation marks and (ii) the chromodomain, which can recognize certain meth...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-7-6

    authors: Perry J

    更新日期:2006-01-16 00:00:00

  • Sequence diversity and differential expression of major phenylpropanoid-flavonoid biosynthetic genes among three mango varieties.

    abstract:BACKGROUND:Mango fruits contain a broad spectrum of phenolic compounds which impart potential health benefits; their biosynthesis is catalysed by enzymes in the phenylpropanoid-flavonoid (PF) pathway. The aim of this study was to reveal the variability in genes involved in the PF pathway in three different mango variet...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1784-x

    authors: Hoang VL,Innes DJ,Shaw PN,Monteith GR,Gidley MJ,Dietzgen RG

    更新日期:2015-07-30 00:00:00

  • Comparative analysis of fungal protein kinases and associated domains.

    abstract:BACKGROUND:Protein phosphorylation is responsible for a large portion of the regulatory functions of eukaryotic cells. Although the list of sequenced genomes of filamentous fungi has grown rapidly, the kinomes of recently sequenced species have not yet been studied in detail. The objective of this study is to apply a c...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-133

    authors: Kosti I,Mandel-Gutfreund Y,Glaser F,Horwitz BA

    更新日期:2010-02-24 00:00:00

  • Proteomic analysis of Citrus sinensis roots and leaves in response to long-term magnesium-deficiency.

    abstract:BACKGROUND:Magnesium (Mg)-deficiency is frequently observed in Citrus plantations and is responsible for the loss of productivity and poor fruit quality. Knowledge on the effects of Mg-deficiency on upstream targets is scarce. Seedlings of 'Xuegan' [Citrus sinensis (L.) Osbeck] were irrigated with Mg-deficient (0 mM Mg...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-015-1462-z

    authors: Peng HY,Qi YP,Lee J,Yang LT,Guo P,Jiang HX,Chen LS

    更新日期:2015-03-31 00:00:00

  • Characterization of SR3 reveals abundance of non-LTR retrotransposons of the RTE clade in the genome of the human blood fluke, Schistosoma mansoni.

    abstract:BACKGROUND:It is becoming apparent that perhaps as much as half of the genome of the human blood fluke Schistosoma mansoni is constituted of mobile genetic element-related sequences. Non-long terminal repeat (LTR) retrotransposons, related to the LINE elements of mammals, comprise much of this repetitive component of t...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-6-154

    authors: Laha T,Kewgrai N,Loukas A,Brindley PJ

    更新日期:2005-11-04 00:00:00

  • RNA sequencing and transcriptome arrays analyses show opposing results for alternative splicing in patient derived samples.

    abstract:BACKGROUND:RNA sequencing (RNA-seq) and microarrays are two transcriptomics techniques aimed at the quantification of transcribed genes and their isoforms. Here we compare the latest Affymetrix HTA 2.0 microarray with Illumina 2000 RNA-seq for the analysis of patient samples - normal lung epithelium tissue and squamous...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-017-3819-y

    authors: Nazarov PV,Muller A,Kaoma T,Nicot N,Maximo C,Birembaut P,Tran NL,Dittmar G,Vallar L

    更新日期:2017-06-06 00:00:00

  • Blood-based epigenetic estimators of chronological age in human adults using DNA methylation data from the Illumina MethylationEPIC array.

    abstract:BACKGROUND:Epigenetic clocks have been recognized for their precise prediction of chronological age, age-related diseases, and all-cause mortality. Existing epigenetic clocks are based on CpGs from the Illumina HumanMethylation450 BeadChip (450 K) which has now been replaced by the latest platform, Illumina Methylation...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-07168-8

    authors: Lee Y,Haftorn KL,Denault WRP,Nustad HE,Page CM,Lyle R,Lee-Ødegård S,Moen GH,Prasad RB,Groop LC,Sletner L,Sommer C,Magnus MC,Gjessing HK,Harris JR,Magnus P,Håberg SE,Jugessur A,Bohlin J

    更新日期:2020-10-27 00:00:00

  • Codon usage patterns in Chinese bayberry (Myrica rubra) based on RNA-Seq data.

    abstract:BACKGROUND:Codon usage analysis has been a classical topic for decades and has significances for studies of evolution, mRNA translation, and new gene discovery, etc. While the codon usage varies among different members of the plant kingdom, indicating the necessity for species-specific study, this work has mostly been ...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-732

    authors: Feng C,Xu CJ,Wang Y,Liu WL,Yin XR,Li X,Chen M,Chen KS

    更新日期:2013-10-25 00:00:00

  • Heritability and genome-wide association analyses of fasting plasma glucose in Chinese adult twins.

    abstract:BACKGROUND:Currently, diabetes has become one of the leading causes of death worldwide. Fasting plasma glucose (FPG) levels that are higher than optimal, even if below the diagnostic threshold of diabetes, can also lead to increased morbidity and mortality. Here we intend to study the magnitude of the genetic influence...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-020-06898-z

    authors: Wang W,Zhang C,Liu H,Xu C,Duan H,Tian X,Zhang D

    更新日期:2020-07-18 00:00:00

  • ICPD-a new peak detection algorithm for LC/MS.

    abstract:BACKGROUND:The identification and quantification of proteins using label-free Liquid Chromatography/Mass Spectrometry (LC/MS) play crucial roles in biological and biomedical research. Increasing evidence has shown that biomarkers are often low abundance proteins. However, LC/MS systems are subject to considerable noise...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-11-S3-S8

    authors: Zhang J,Haskins W

    更新日期:2010-12-01 00:00:00

  • Correction to: Comparative transcriptomics reveals PrrABmediated control of metabolic, respiration, energy-generating, and dormancy pathways in Mycobacterium smegmatis.

    abstract::Following the publication of the original article [1], the authors reported an error in Fig. 2 of the PDF version of their article. ...

    journal_title:BMC genomics

    pub_type: 杂志文章,已发布勘误

    doi:10.1186/s12864-019-6419-1

    authors: Maarsingh JD,Yang S,Park JG,Haydel SE

    更新日期:2019-12-31 00:00:00

  • Stoichiometric gene-to-reaction associations enhance model-driven analysis performance: Metabolic response to chronic exposure to Aldrin in prostate cancer.

    abstract:BACKGROUND:Genome-scale metabolic models (GSMM) integrating transcriptomics have been widely used to study cancer metabolism. This integration is achieved through logical rules that describe the association between genes, proteins, and reactions (GPRs). However, current gene-to-reaction formulation lacks the stoichiome...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-019-5979-4

    authors: Marín de Mas I,Torrents L,Bedia C,Nielsen LK,Cascante M,Tauler R

    更新日期:2019-08-15 00:00:00

  • Transcriptome profiling of two maize inbreds with distinct responses to Gibberella ear rot disease to identify candidate resistance genes.

    abstract:BACKGROUND:Gibberella ear rot (GER) is one of the most economically important fungal diseases of maize in the temperate zone due to moldy grain contaminated with health threatening mycotoxins. To develop resistant genotypes and control the disease, understanding the host-pathogen interaction is essential. RESULTS:RNA-...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/s12864-018-4513-4

    authors: Kebede AZ,Johnston A,Schneiderman D,Bosnich W,Harris LJ

    更新日期:2018-02-09 00:00:00

  • Comprehensive analysis of the Corynebacterium glutamicum transcriptome using an improved RNAseq technique.

    abstract:BACKGROUND:The use of RNAseq to resolve the transcriptional organization of an organism was established in recent years and also showed the complexity and dynamics of bacterial transcriptomes. The aim of this study was to comprehensively investigate the transcriptome of the industrially relevant amino acid producer and...

    journal_title:BMC genomics

    pub_type: 杂志文章

    doi:10.1186/1471-2164-14-888

    authors: Pfeifer-Sancar K,Mentz A,Rückert C,Kalinowski J

    更新日期:2013-12-17 00:00:00