Minimum error calibration and normalization for genomic copy number analysis.

Abstract:

BACKGROUND:Copy number variations (CNV) are regional deviations from the normal autosomal bi-allelic DNA content. While germline CNVs are a major contributor to genomic syndromes and inherited diseases, the majority of cancers accumulate extensive "somatic" CNV (sCNV or CNA) during the process of oncogenetic transformation and progression. While specific sCNV have closely been associated with tumorigenesis, intriguingly many neoplasias exhibit recurrent sCNV patterns beyond the involvement of a few cancer driver genes. Currently, CNV profiles of tumor samples are generated using genomic micro-arrays or high-throughput DNA sequencing. Regardless of the underlying technology, genomic copy number data is derived from the relative assessment and integration of multiple signals, with the data generation process being prone to contamination from several sources. Estimated copy number values have no absolute or strictly linear correlation to their corresponding DNA levels, and the extent of deviation differs between sample profiles, which poses a great challenge for data integration and comparison in large scale genome analysis. RESULTS:In this study, we present a novel method named "Minimum Error Calibration and Normalization for Copy Numbers Analysis" (Mecan4CNA). It only requires CNV segmentation files as input, is platform independent, and has a high performance with limited hardware requirements. For a given multi-sample copy number dataset, Mecan4CNA can batch-normalize all samples to the corresponding true copy number levels of the main tumor clones. Experiments of Mecan4CNA on simulated data showed an overall accuracy of 93% and 91% in determining the normal level and single copy alteration (i.e. duplication or loss of one allele), respectively. Comparison of estimated normal levels and single copy alternations with existing methods and karyotyping data on the NCI-60 tumor cell line produced coherent results. To estimate the method's impact on downstream analyses, we performed GISTIC analyses on the original and Mecan4CNA normalized data from the Cancer Genome Atlas (TCGA) where the normalized data showed prominent improvements of both sensitivity and specificity in detecting focal regions. CONCLUSIONS:Mecan4CNA provides an advanced method for CNA data normalization, especially in meta-analyses involving large profile numbers and heterogeneous source data quality. With its informative output and visualization options, Mecan4CNA also can improve the interpretation of individual CNA profiles. Mecan4CNA is freely available as a Python package and through its code repository on Github.

journal_name

Genomics

journal_title

Genomics

authors

Gao B,Baudis M

doi

10.1016/j.ygeno.2020.05.008

subject

Has Abstract

pub_date

2020-09-01 00:00:00

pages

3331-3341

issue

5

eissn

0888-7543

issn

1089-8646

pii

S0888-7543(20)30168-3

journal_volume

112

pub_type

杂志文章

相关文献

GENOMICS文献大全
  • Next generation sequencing identifies abnormal Y chromosome and candidate causal variants in premature ovarian failure patients.

    abstract::Premature ovarian failure (POF) is characterized by heterogeneous genetic causes such as chromosomal abnormalities and variants in causal genes. Recently, development of techniques made next generation sequencing (NGS) possible to detect genome wide variants including chromosomal abnormalities. Among 37 Korean POF pat...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2016.10.006

    authors: Lee Y,Kim C,Park Y,Pyun JA,Kwack K

    更新日期:2016-12-01 00:00:00

  • The Sp4H deletion may contain a new locus essential for postimplantation development.

    abstract::Sp4H is a semi-dominant mutation that maps to mouse chromosome 1. Heterozygous mice exhibit white spotting of the belly, whereas the fate of the homozygous embryos is unknown. We have previously shown that the entire coding region of the Pax3 gene is deleted in the Sp4H mutant. In this study, we have analyzed the fate...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1996.0267

    authors: Fleming J,Pearce A,Brown SD,Steel KP

    更新日期:1996-06-01 00:00:00

  • Characterization of a human gene encoding nucleosomal binding protein NSBP1.

    abstract::We characterize the cDNA and genomic structure of NSBP1, and demonstrate that it is a nuclear protein and the homologue of mouse Nsbp1, which is known to encode a nucleosomal binding and transcriptional activating protein related to the HMG-14/-17 chromosomal proteins. The encoded NSBP1 protein has 86% amino acid simi...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.2000.6443

    authors: King LM,Francomano CA

    更新日期:2001-01-15 00:00:00

  • MLL2: A new mammalian member of the trx/MLL family of genes.

    abstract::We have identified a gene at chromosome band 19q13.1, which is closely related to MLL. MLL is located in a region of chromosome 11q23 that has partial synteny with chromosome 19q. We have named this gene at 19q13.1, MLL2. MLL2 encodes a protein that exhibits a high level of similarity to MLL over several important pro...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1999.5860

    authors: FitzGerald KT,Diaz MO

    更新日期:1999-07-15 00:00:00

  • Identification of new translocation breakpoints at 12q13 in lipomas.

    abstract::Cytogenetic studies of banded chromosomes and fluorescence in situ hybridization (FISH) of several yeast artificial chromosomes (YACs) that are part of a 128-kb resolution physical map of a portion of 12q13 revealed that 4/14 (28%) lipomas have breakpoints in 12q13. These breakpoints are more than 10 Mb away from the ...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1997.4993

    authors: Merscher S,Marondel I,Pedeutour F,Gaudray P,Kucherlapati R,Turc-Carel C

    更新日期:1997-11-15 00:00:00

  • The gene encoding protein 4.2 is distinct from the mouse platelet storage pool deficiency mutation pallid.

    abstract::Previous studies identified the gene encoding the erythrocyte membrane protein 4.2 (Epb4.2) as a candidate for the mouse mutation pallid (pa); Epb4.2 genetically colocalized near pa on mouse Chromosome 2, and a truncated Epb4.2 transcript was present in tissues derived from pallid mice. We report here evidence that Ep...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1997.4764

    authors: Gwynn B,Korsgren C,Cohen CM,Ciciotte SL,Peters LL

    更新日期:1997-06-15 00:00:00

  • A dicistronic gene pair within a cluster of "EF-hand" protein genes in the genomes of Drosophila species.

    abstract::Androcam is a Drosophila melanogaster calmodulin-related protein that functions specifically in the testis. We show that the Acam gene is part of a cluster of three intronless genes arranged in a head-to-tail manner. The additional genes also encode calmodulin-related proteins with testis-specific transcription. Acam ...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2006.04.009

    authors: Pavlik P,Konduri V,Massa E,Simonette R,Beckingham KM

    更新日期:2006-09-01 00:00:00

  • Identification of an alternative transcript from the human iduronate-2-sulfatase (IDS) gene.

    abstract::Iduronate-2-sulfatase (IDS) is involved in the degradation of heparan sulfate and dermatan sulfate in the lysosomes, and a deficiency in this enzyme results in Hunter syndrome. A 2.3-kb cDNA clone that contains the entire coding sequence of IDS has previously been reported. Here we describe the identification of a 1.4...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1995.1249

    authors: Malmgren H,Carlberg BM,Pettersson U,Bondeson ML

    更新日期:1995-09-01 00:00:00

  • HERV-K-T47D-Related long terminal repeats mediate polyadenylation of cellular transcripts.

    abstract::The human genome harbors thousands of long terminal repeats (LTRs) that are derived from endogenous retroviruses and contain elements able to regulate the expression of neighboring cellular genes. We have investigated the ability of human endogenous retroviral (HERV)-K LTRs to provide transcriptional processing signal...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.2000.6175

    authors: Baust C,Seifarth W,Germaier H,Hehlmann R,Leib-Mösch C

    更新日期:2000-05-15 00:00:00

  • C-T variant in a miRNA target site of BCL2 is associated with increased risk of human papilloma virus related cervical cancer--an in silico approach.

    abstract::MicroRNAs control gene expression at the posttranscriptional level by base-pairing to the 3'-UTR of their target mRNAs, thus leading to mRNA degradation of protein fabrication. We hypothesize, SNPs within miRNAs and their targets could be of significance to an individual's risk of developing cancer. We analyzed in sil...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2011.06.005

    authors: Reshmi G,Surya R,Jissa VT,Babu PS,Preethi NR,Santhi WS,Jayaprakash PG,Pillai MR

    更新日期:2011-09-01 00:00:00

  • Identification and functional characterization of SOC1-like genes in Pyrus bretschneideri.

    abstract::Flowering is a prerequisite for pear fruit production. Therefore, the development of flower buds and the control of flowering time are important for pear trees. However, the molecular mechanism of pear flowering is unclear. SOC1, a member of MADS-box family, is known as a flowering signal integrator in Arabidopsis. We...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2019.09.011

    authors: Liu Z,Wu X,Cheng M,Xie Z,Xiong C,Zhang S,Wu J,Wang P

    更新日期:2020-03-01 00:00:00

  • Exploring the multi-drug resistance in Escherichia coli O157:H7 by gene interaction network: A systems biology approach.

    abstract::In the present study, we have constructed an interaction network of 29 antibiotic resistant genes along with 777 interactions in E. coli O157:H7. Gene ontology analysis reveals that 94, 89 and 67 genes have roles in the cellular process, biological process and molecular function respectively. Gene complexes related to...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2018.06.002

    authors: Miryala SK,Ramaiah S

    更新日期:2019-07-01 00:00:00

  • Statistical power for identifying nucleotide markers associated with quantitative traits in genome-wide association analysis using a mixed model.

    abstract::Use of mixed models is in the spotlight as an emerging method for genome-wide association studies (GWASs). This study investigated the statistical power for identifying nucleotide variants associated with quantitative traits using the mixed model methodology. Quantitative traits were simulated through design of herita...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2014.11.001

    authors: Shin J,Lee C

    更新日期:2015-01-01 00:00:00

  • The MAS proto-oncogene is imprinted in human breast tissue.

    abstract::The human MAS proto-oncogene is situated at 6q25.3-q26, a region that is homologous to mouse chromosome 17 where two parentally imprinted genes (Mas and Igf2r) have previously been identified. We investigated the imprinting status of MAS in adult lesions to establish the imprinting status of this gene in humans, as ce...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1997.5063

    authors: Miller N,McCann AH,O'Connell D,Pedersen IS,Spiers V,Gorey T,Dervan PA

    更新日期:1997-12-15 00:00:00

  • Organization and evolutionary relatedness of OR37 olfactory receptor genes in mouse and human.

    abstract::We report a comprehensive comparative analysis of human and mouse olfactory receptor (OR) genes encoding OR37 subtypes to determine the repertoire, chromosomal organization, and relatedness of these genes. Two OR37 clusters were found in both mouse (chromosome 4) and human (chromosome 9); with five genes in cluster I ...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/s0888-7543(03)00116-2

    authors: Hoppe R,Breer H,Strotmann J

    更新日期:2003-09-01 00:00:00

  • Genome-wide analysis of AP2/ERF transcription factors in pineapple reveals functional divergence during flowering induction mediated by ethylene and floral organ development.

    abstract::The APETALA2/ethylene-responsive factor (AP2/ERF) has important roles in regulating developmental processes and hormone signaling transduction in plants. Pineapple demonstrates a special sensitivity to ethylene, and AP2/ERFs may contribute to this distinct sensitivity of pineapples to ethylene. However, little informa...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2020.10.040

    authors: Zhang H,Pan X,Liu S,Lin W,Li Y,Zhang X

    更新日期:2021-01-20 00:00:00

  • A novel androgen-regulated gene, PMEPA1, located on chromosome 20q13 exhibits high level expression in prostate.

    abstract::Biologic effects of androgen on target cells are mediated in part by transcriptional regulation of androgen-regulated genes (ARGs) by androgen receptor. Using serial analysis of gene expression (SAGE), we have identified a comprehensive repertoire of ARGs in LNCaP cells. One of the SAGE-derived tags exhibiting homolog...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.2000.6214

    authors: Xu LL,Shanmugam N,Segawa T,Sesterhenn IA,McLeod DG,Moul JW,Srivastava S

    更新日期:2000-06-15 00:00:00

  • Functional analysis of the murine Emr1 promoter identifies a novel purine-rich regulatory motif required for high-level gene expression in macrophages.

    abstract::This study has investigated the transcriptional regulation of the Emr1 gene in murine macrophages and defined an enhancer element within the proximal promoter that is necessary for Emr1 expression in myeloid cells. This element consists of an extended purine-rich sequence (PuRS) of 83 consecutive purine residues conta...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2004.08.016

    authors: O'Reilly D,Addley M,Quinn C,MacFarlane AJ,Gordon S,McKnight AJ,Greaves DR

    更新日期:2004-12-01 00:00:00

  • Construction of a gene map of the nephronophthisis type 1 (NPHP1) region on human chromosome 2q12-q13.

    abstract::A gene for the autosomal recessive kidney disorder juvenile nephronophthisis (NPH) is located on chromosome 2q between markers D2S1893 and D2S1888. Recently, the presence of large homozygous deletions was described in the majority of NPH patients. We constructed an integrated YAC/PAC contig of 54 markers and 30 PAC cl...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1997.5102

    authors: Nothwang HG,Stubanus M,Adolphs J,Hanusch H,Vossmerbäumer U,Denich D,Kübler M,Mincheva A,Lichter P,Hildebrandt F

    更新日期:1998-01-15 00:00:00

  • Identification of two evolutionarily conserved and functional regulatory elements in intron 2 of the human BRCA1 gene.

    abstract::Cross-species comparative genomics is a powerful strategy for identifying functional regulatory elements within noncoding DNA. In this paper, comparative analysis of human and mouse intronic sequences in the breast cancer susceptibility gene (BRCA1) revealed two evolutionarily conserved noncoding sequences (CNS) in in...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2005.05.006

    authors: Wardrop SL,Brown MA,kConFab Investigators.

    更新日期:2005-09-01 00:00:00

  • Sequences homologous to glutamic acid decarboxylase cDNA are present on mouse chromosomes 2 and 10.

    abstract::The chromosomal locations of mouse DNA sequences homologous to a feline cDNA clone encoding glutamic acid decarboxylase (GAD) were determined. Although cats and humans are thought to have only one gene for GAD, GAD cDNA sequences hybridize to two distinct chromosomal loci in the mouse, chromosomes 2 and 10. The chromo...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/0888-7543(90)90455-4

    authors: Brilliant MH,Szabo G,Katarova Z,Kozak CA,Glaser TM,Greenspan RJ,Housman DE

    更新日期:1990-01-01 00:00:00

  • Molecular cloning and characterization of the mouse carboxyl ester lipase gene and evidence for expression in the lactating mammary gland.

    abstract::DNA hybridization was used to isolate a 2.04-kb cDNA encoding carboxyl ester lipase (CEL) from a mouse lactating mammary gland, lambda gt10 cDNA library. The cDNA sequence translated into a protein of 599 amino acids, including 20 amino acids of a putative signal peptide. Comparison of the deduced amino acid sequence ...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1995.1221

    authors: Lidmer AS,Kannius M,Lundberg L,Bjursell G,Nilsson J

    更新日期:1995-09-01 00:00:00

  • Work efficiency: a new criterion for comprehensive comparison and evaluation of statistical methods in large-scale identification of differentially expressed genes.

    abstract::Receiver operating characteristic (ROC) has been widely used to evaluate statistical methods, but a fatal problem is that ROC cannot evaluate estimation of the false discovery rate (FDR) of a statistical method and hence the area under of curve as a criterion cannot tell us if a statistical method is conservative. To ...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2011.05.006

    authors: Tan YD

    更新日期:2011-11-01 00:00:00

  • Inactive allele-specific methylation and chromatin structure of the imprinted gene U2af1-rs1 on mouse chromosome 11.

    abstract::The imprinted U2af1-rs1 gene that maps to mouse chromosome 11 is predominately expressed from the paternal allele. We examined the methylation of genomic sequences in and around the U2af1-rs1 locus to establish the extent of sequence modifications that accompanied the silencing of the maternal allele. The analysis of ...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1996.0348

    authors: Shibata H,Yoshino K,Sunahara S,Gondo Y,Katsuki M,Ueda T,Kamiya M,Muramatsu M,Murakami Y,Kalcheva I,Plass C,Chapman VM,Hayashizaki Y

    更新日期:1996-07-01 00:00:00

  • A novel ribosomal S6-kinase (RSK4; RPS6KA6) is commonly deleted in patients with complex X-linked mental retardation.

    abstract::Large deletions in Xq21 often are associated with contiguous gene syndromes consisting of X-linked deafness type 3 (DFN3), mental retardation (MRX), and choroideremia (CHM). The identification of deletions associated with classic CHM or DFN3 facilitated the positional cloning of the underlying genes, REP-1 and POU3F4,...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1999.6004

    authors: Yntema HG,van den Helm B,Kissing J,van Duijnhoven G,Poppelaars F,Chelly J,Moraine C,Fryns JP,Hamel BC,Heilbronner H,Pander HJ,Brunner HG,Ropers HH,Cremers FP,van Bokhoven H

    更新日期:1999-12-15 00:00:00

  • Molecular characterization of the gene for human cartilage gp-39 (CHI3L1), a member of the chitinase protein family and marker for late stages of macrophage differentiation.

    abstract::We have previously reported that the expression of HC gp-39, a 39-kDa secretory glycoprotein and member of the chitinase protein family, is associated with late stages of monocyte to macrophage maturation. To allow further investigations of its unique expression pattern and to facilitate studies on the regulation of t...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1997.4778

    authors: Rehli M,Krause SW,Andreesen R

    更新日期:1997-07-15 00:00:00

  • Genetic and physical mapping of the dreher locus on mouse chromosome 1.

    abstract::Mutations in the mouse dreher (dr) gene cause skeletal defects, hyperactivity, abnormal gait, deafness, white belly spotting, and hypoplasia of Müllerian duct derivatives. To map dr to high resolution, we utilized two crosses. Initially, we analyzed an intersubspecific intercross to construct a detailed genetic map of...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1999.5873

    authors: Bergstrom DE,Gagnon LH,Eicher EM

    更新日期:1999-08-01 00:00:00

  • Physical mapping within the tuberous sclerosis linkage group in region 9q32-q34.

    abstract::Pulsed-field gel electrophoresis and flow dot-blot analysis have been used to construct a physical map of the q32-q34 region of chromosome 9, where one of the loci responsible for tuberous sclerosis (TSC1) has been mapped by genetic linkage. Five linked groups of markers have been defined by pulsed-field gel electroph...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.1993.1056

    authors: Harris RM,Carter NP,Griffiths B,Goudie D,Hampson RM,Yates JR,Affara NA,Ferguson-Smith MA

    更新日期:1993-02-01 00:00:00

  • A sequence-ready physical map of the region containing the human natural killer gene complex on chromosome 12p12.3-p13.2.

    abstract::We developed a sequence-ready physical map of a part of human chromosome 12p12.3-p13.2 where the natural killer gene complex (NKC) is located. The NKC includes a cluster of genes with structure similar to that of the Ca(2+)-dependent lectin superfamily of glycoproteins that are expressed on the surface of most natural...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1006/geno.2000.6163

    authors: Renedo M,Arce I,Montgomery K,Roda-Navarro P,Lee E,Kucherlapati R,Fernández-Ruiz E

    更新日期:2000-04-15 00:00:00

  • Cloning and characterization of FAM13A1--a gene near a milk protein QTL on BTA6: evidence for population-wide linkage disequilibrium in Israeli Holsteins.

    abstract::A cluster of genes coding for proteins of the extracellular matrix (ECM) containing sequence motifs essential for integrin-receptor interactions is located on HSA4q21 and on BTA6, within the critical region of a quantitative trait locus (QTL) affecting milk protein production. Genes within this cluster are involved in...

    journal_title:Genomics

    pub_type: 杂志文章

    doi:10.1016/j.ygeno.2004.03.005

    authors: Cohen M,Reichenstein M,Everts-van der Wind A,Heon-Lee J,Shani M,Lewin HA,Weller JI,Ron M,Seroussi E

    更新日期:2004-08-01 00:00:00