Controlling the signal: Practical privacy protection of genomic data sharing through Beacon services.

Abstract:

BACKGROUND:Genomic data is increasingly collected by a wide array of organizations. As such, there is a growing demand to make summary information about such collections available more widely. However, over the past decade, a series of investigations have shown that attacks, rooted in statistical inference methods, can be applied to discern the presence of a known individual's DNA sequence in the pool of subjects. Recently, it was shown that the Beacon Project of the Global Alliance for Genomics and Health, a web service for querying about the presence (or absence) of a specific allele, was vulnerable. The Integrating Data for Analysis, Anonymization, and Sharing (iDASH) Center modeled a track in their third Privacy Protection Challenge on how to mitigate the Beacon vulnerability. We developed the winning solution for this track. METHODS:This paper describes our computational method to optimize the tradeoff between the utility and the privacy of the Beacon service. We generalize the genomic data sharing problem beyond that which was introduced in the iDASH Challenge to be more representative of real world scenarios to allow for a more comprehensive evaluation. We then conduct a sensitivity analysis of our method with respect to several state-of-the-art methods using a dataset of 400,000 positions in Chromosome 10 for 500 individuals from Phase 3 of the 1000 Genomes Project. All methods are evaluated for utility, privacy and efficiency. RESULTS:Our method achieves better performance than all state-of-the-art methods, irrespective of how key factors (e.g., the allele frequency in the population, the size of the pool and utility weights) change from the original parameters of the problem. We further illustrate that it is possible for our method to exhibit subpar performance under special cases of allele query sequences. However, we show our method can be extended to address this issue when the query sequence is fixed and known a priori to the data custodian, so that they may plan stage their responses accordingly. CONCLUSIONS:This research shows that it is possible to thwart the attack on Beacon services, without substantially altering the utility of the system, using computational methods. The method we initially developed is limited by the design of the scenario and evaluation protocol for the iDASH Challenge; however, it can be improved by allowing the data custodian to act in a staged manner.

journal_name

BMC Med Genomics

journal_title

BMC medical genomics

authors

Wan Z,Vorobeychik Y,Kantarcioglu M,Malin B

doi

10.1186/s12920-017-0282-1

subject

Has Abstract

pub_date

2017-07-26 00:00:00

pages

39

issue

Suppl 2

issn

1755-8794

pii

10.1186/s12920-017-0282-1

journal_volume

10

pub_type

杂志文章
  • Genetic association and stress mediated down-regulation in trabecular meshwork implicates MPP7 as a novel candidate gene in primary open angle glaucoma.

    abstract:BACKGROUND:Glaucoma is the largest cause of irreversible blindness affecting more than 60 million people globally. The disease is defined as a gradual loss of peripheral vision due to death of Retinal Ganglion Cells (RGC). The RGC death is largely influenced by the rate of aqueous humor production by ciliary processes ...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-016-0177-6

    authors: Vishal M,Sharma A,Kaurani L,Alfano G,Mookherjee S,Narta K,Agrawal J,Bhattacharya I,Roychoudhury S,Ray J,Waseem NH,Bhattacharya SS,Basu A,Sen A,Ray K,Mukhopadhyay A

    更新日期:2016-03-22 00:00:00

  • Development of a blood-based gene expression algorithm for assessment of obstructive coronary artery disease in non-diabetic patients.

    abstract:BACKGROUND:Alterations in gene expression in peripheral blood cells have been shown to be sensitive to the presence and extent of coronary artery disease (CAD). A non-invasive blood test that could reliably assess obstructive CAD likelihood would have diagnostic utility. RESULTS:Microarray analysis of RNA samples from...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-4-26

    authors: Elashoff MR,Wingrove JA,Beineke P,Daniels SE,Tingley WG,Rosenberg S,Voros S,Kraus WE,Ginsburg GS,Schwartz RS,Ellis SG,Tahirkheli N,Waksman R,McPherson J,Lansky AJ,Topol EJ

    更新日期:2011-03-28 00:00:00

  • Chronic insulin treatment of diabetes does not fully normalize alterations in the retinal transcriptome.

    abstract:BACKGROUND:Diabetic retinopathy (DR) is a leading cause of blindness in working age adults. Approximately 95% of patients with Type 1 diabetes develop some degree of retinopathy within 25 years of diagnosis despite normalization of blood glucose by insulin therapy. The goal of this study was to identify molecular chang...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-4-40

    authors: Bixler GV,Vanguilder HD,Brucklacher RM,Kimball SR,Bronson SK,Freeman WM

    更新日期:2011-05-15 00:00:00

  • Integration of genomic copy number variations and chemotherapy-response biomarkers in pediatric sarcoma.

    abstract:BACKGROUND:While most pediatric sarcomas respond to front-line therapy, some bone sarcomas do not show radiographic response like soft-tissue sarcomas (rhabdomyosarccomas) but do show 90% necrosis. Though, new therapies are urgently needed to improve survival and quality of life in pediatric patients with sarcomas. Com...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0456-5

    authors: Cheng L,Pandya PH,Liu E,Chandra P,Wang L,Murray ME,Carter J,Ferguson M,Saadatzadeh MR,Bijangi-Visheshsaraei K,Marshall M,Li L,Pollok KE,Renbarger JL

    更新日期:2019-01-31 00:00:00

  • Global transcriptome-wide analysis of CIK cells identify distinct roles of IL-2 and IL-15 in acquisition of cytotoxic capacity against tumor.

    abstract:BACKGROUND:Cytokine-induced killer (CIK) cells are an emerging approach of cancer treatment. Our previous study have shown that CIK cells stimulated with combination of IL-2 and IL-15 displayed improved proliferation capacity and tumor cytotoxicity. However, the mechanisms of CIK cell proliferation and acquisition of c...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-7-49

    authors: Wang W,Meng M,Zhang Y,Wei C,Xie Y,Jiang L,Wang C,Yang F,Tang W,Jin X,Chen D,Zong J,Hou Z,Li R

    更新日期:2014-08-09 00:00:00

  • CNAReporter: a GenePattern pipeline for the generation of clinical reports of genomic alterations.

    abstract:BACKGROUND:Genomic copy number alterations are widely associated with a broad range of human tumors and offer the potential to be used as a diagnostic tool. Especially in the emerging era of personalized medicine medical informatics tools that allow the fast visualization and analysis of genomic alterations of a patien...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-3-11

    authors: Kotliarov Y,Bozdag S,Cheng H,Wuchty S,Zenklusen JC,Fine HA

    更新日期:2010-04-09 00:00:00

  • Differential gene expression in disease: a comparison between high-throughput studies and the literature.

    abstract:BACKGROUND:Differential gene expression is important to understand the biological differences between healthy and diseased states. Two common sources of differential gene expression data are microarray studies and the biomedical literature. METHODS:With the aid of text mining and gene expression analysis we have exami...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-017-0293-y

    authors: Rodriguez-Esteban R,Jiang X

    更新日期:2017-10-11 00:00:00

  • Exon array analysis reveals neuroblastoma tumors have distinct alternative splicing patterns according to stage and MYCN amplification status.

    abstract:BACKGROUND:Neuroblastoma (NB) tumors are well known for their pronounced clinical and molecular heterogeneity. The global gene expression and DNA copy number alterations have been shown to have profound differences in tumors of low or high stage and those with or without MYCN amplification. RNA splicing is an important...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-4-35

    authors: Guo X,Chen QR,Song YK,Wei JS,Khan J

    更新日期:2011-04-18 00:00:00

  • Genome scale analysis of pathogenic variants targetable for single base editing.

    abstract:BACKGROUND:Single nucleotide variants account for approximately 90% of all known pathogenic variants responsible for human diseases. Recently discovered CRISPR/Cas9 base editors can correct individual nucleotides without cutting DNA and inducing double-stranded breaks. We aimed to find all possible pathogenic variants ...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-020-00735-8

    authors: Lavrov AV,Varenikov GG,Skoblov MY

    更新日期:2020-09-18 00:00:00

  • wtest: an integrated R package for genetic epistasis testing.

    abstract:BACKGROUND:With the increasing amount of high-throughput genomic sequencing data, there is a growing demand for a robust and flexible tool to perform interaction analysis. The identification of SNP-SNP, SNP-CpG, and higher order interactions helps explain the genetic etiology of human diseases, yet genome-wide analysis...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-019-0638-9

    authors: Sun R,Xia X,Chong KC,Zee BC,Wu WKK,Wang MH

    更新日期:2019-12-24 00:00:00

  • The Cancer Omics Atlas: an integrative resource for cancer omics annotations.

    abstract:BACKGROUND:The Cancer Genome Atlas (TCGA) is an important data resource for cancer biologists and oncologists. However, a lack of bioinformatics expertise often hinders experimental cancer biologists and oncologists from exploring the TCGA resource. Although a number of tools have been developed for facilitating cancer...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0381-7

    authors: Sun Q,Li M,Wang X

    更新日期:2018-08-08 00:00:00

  • LDSplitDB: a database for studies of meiotic recombination hotspots in MHC using human genomic data.

    abstract:BACKGROUND:Meiotic recombination happens during the process of meiosis when chromosomes inherited from two parents exchange genetic materials to generate chromosomes in the gamete cells. The recombination events tend to occur in narrow genomic regions called recombination hotspots. Its dysregulation could lead to serio...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0351-0

    authors: Guo J,Chen H,Yang P,Lee YT,Wu M,Przytycka TM,Kwoh CK,Zheng J

    更新日期:2018-04-20 00:00:00

  • Splice-site mutation causing partial retention of intron in the FLCN gene in Birt-Hogg-Dubé syndrome: a case report.

    abstract:BACKGROUND:Birt-Hogg-Dubé syndrome (BHD) is an autosomal dominant disorder caused by germline mutations in the folliculin gene (FLCN). Nearly 150 pathogenic mutations have been identified in FLCN. The most frequent pattern is a frameshift mutation within a coding exon. In addition, splice-site mutations have been repor...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0359-5

    authors: Furuya M,Kobayashi H,Baba M,Ito T,Tanaka R,Nakatani Y

    更新日期:2018-05-02 00:00:00

  • MySeq: privacy-protecting browser-based personal Genome analysis for genomics education and exploration.

    abstract:BACKGROUND:The complexity of genome informatics is a recurring challenge for genome exploration and analysis by students and other non-experts. This complexity creates a barrier to wider implementation of experiential genomics education, even in settings with substantial computational resources and expertise. Reducing ...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-019-0615-3

    authors: Linderman MD,McElroy L,Chang L

    更新日期:2019-11-27 00:00:00

  • OncoRep: an n-of-1 reporting tool to support genome-guided treatment for breast cancer patients using RNA-sequencing.

    abstract:BACKGROUND:Breast cancer comprises multiple tumor entities associated with different biological features and clinical behaviors, making individualized medicine a powerful tool to bring the right drug to the right patient. Next generation sequencing of RNA (RNA-Seq) is a suitable method to detect targets for individuali...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-015-0095-z

    authors: Meißner T,Fisch KM,Gioia L,Su AI

    更新日期:2015-05-21 00:00:00

  • Screening significantly hypermethylated genes in fetal tissues compared with maternal blood using a methylated-CpG island recovery assay-based microarray.

    abstract:BACKGROUND:The noninvasive prenatal diagnosis procedures that are currently used to detect genetic diseases do not achieve desirable levels of sensitivity and specificity. Recently, fetal methylated DNA biomarkers in maternal peripheral blood have been explored for the noninvasive prenatal detection of genetic disorder...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-5-26

    authors: Yin A,Zhang X,Wu J,Du L,He T,Zhang X

    更新日期:2012-06-18 00:00:00

  • Transcriptomic signatures in whole blood of patients who acquire a chronic inflammatory response syndrome (CIRS) following an exposure to the marine toxin ciguatoxin.

    abstract:BACKGROUND:Ciguatoxins (CTXs) are polyether marine neurotoxins found in multiple reef-fish species and are potent activators of voltage-gated sodium channels. It is estimated that up to 500,000 people annually experience acute ciguatera poisoning from consuming toxic fish and a small percentage of these victims will de...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-015-0089-x

    authors: Ryan JC,Wu Q,Shoemaker RC

    更新日期:2015-04-02 00:00:00

  • Detecting early-warning signals of type 1 diabetes and its leading biomolecular networks by dynamical network biomarkers.

    abstract:BACKGROUND:Type 1 diabetes (T1D) is a complex disease and harmful to human health, and most of the existing biomarkers are mainly to measure the disease phenotype after the disease onset (or drastic deterioration). Until now, there is no effective biomarker which can predict the upcoming disease (or pre-disease state) ...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-6-S2-S8

    authors: Liu X,Liu R,Zhao XM,Chen L

    更新日期:2013-01-01 00:00:00

  • Genome-wide prediction and analysis of human tissue-selective genes using microarray expression data.

    abstract:BACKGROUND:Understanding how genes are expressed specifically in particular tissues is a fundamental question in developmental biology. Many tissue-specific genes are involved in the pathogenesis of complex human diseases. However, experimental identification of tissue-specific genes is time consuming and difficult. Th...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-6-S1-S10

    authors: Teng S,Yang JY,Wang L

    更新日期:2013-01-01 00:00:00

  • Integration analysis of long non-coding RNA (lncRNA) role in tumorigenesis of colon adenocarcinoma.

    abstract:BACKGROUND:Colon adenocarcinoma (COAD) is one of the most common gastrointestinal cancers globally. Molecular aberrations of tumor suppressors and/or oncogenes are the main contributors to tumorigenesis. However, the exact underlying mechanisms of COAD pathogenesis are clearly not known yet. In this regard, there is an...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-020-00757-2

    authors: Poursheikhani A,Abbaszadegan MR,Nokhandani N,Kerachian MA

    更新日期:2020-07-29 00:00:00

  • Genome-wide association meta-analysis for early age-related macular degeneration highlights novel loci and insights for advanced disease.

    abstract:BACKGROUND:Advanced age-related macular degeneration (AMD) is a leading cause of blindness. While around half of the genetic contribution to advanced AMD has been uncovered, little is known about the genetic architecture of early AMD. METHODS:To identify genetic factors for early AMD, we conducted a genome-wide associ...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-020-00760-7

    authors: Winkler TW,Grassmann F,Brandl C,Kiel C,Günther F,Strunz T,Weidner L,Zimmermann ME,Korb CA,Poplawski A,Schuster AK,Müller-Nurasyid M,Peters A,Rauscher FG,Elze T,Horn K,Scholz M,Cañadas-Garre M,McKnight AJ,Quinn N,H

    更新日期:2020-08-26 00:00:00

  • Fraternal twins with Phelan-McDermid syndrome not involving the SHANK3 gene: case report and literature review.

    abstract:BACKGROUND:Phelan-McDermid syndrome (PMS, OMIM#606232), or 22q13 deletion syndrome, is a rare genetic disorder caused by deletion of the distal long arm of chromosome 22 with a variety of clinical features that display considerably heterogeneous degrees of severity. The SHANK3 gene is understood to be the critical gene...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-020-00802-0

    authors: Li S,Xi KW,Liu T,Zhang Y,Zhang M,Zeng LD,Li J

    更新日期:2020-10-06 00:00:00

  • Saliva samples are a viable alternative to blood samples as a source of DNA for high throughput genotyping.

    abstract:BACKGROUND:The increasing trend for incorporation of biological sample collection within clinical trials requires sample collection procedures which are convenient and acceptable for both patients and clinicians. This study investigated the feasibility of using saliva-extracted DNA in comparison to blood-derived DNA, a...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-5-19

    authors: Abraham JE,Maranian MJ,Spiteri I,Russell R,Ingle S,Luccarini C,Earl HM,Pharoah PP,Dunning AM,Caldas C

    更新日期:2012-05-30 00:00:00

  • What is the right sequencing approach? Solo VS extended family analysis in consanguineous populations.

    abstract:BACKGROUND:Testing strategies is crucial for genetics clinics and testing laboratories. In this study, we tried to compare the hit rate between solo and trio and trio plus testing and between trio and sibship testing. Finally, we studied the impact of extended family analysis, mainly in complex and unsolved cases. MET...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-020-00743-8

    authors: Alfares A,Alsubaie L,Aloraini T,Alaskar A,Althagafi A,Alahmad A,Rashid M,Alswaid A,Alothaim A,Eyaid W,Ababneh F,Albalwi M,Alotaibi R,Almutairi M,Altharawi N,Alsamer A,Abdelhakim M,Kafkas S,Mineta K,Cheung N,Abdall

    更新日期:2020-07-17 00:00:00

  • The International Conference on Intelligent Biology and Medicine (ICIBM) 2020: Data-driven analytics in biomedical genomics.

    abstract::This editorial summarizes eight research articles included in this supplement issue for the 2020 International Conference on Intelligent Biology and Medicine (ICIBM 2020) conference, that was held on August 9-10, 2020 (virtual conference), with a topic on data-driven analytics in biomedical genomics. These articles co...

    journal_title:BMC medical genomics

    pub_type: 社论

    doi:10.1186/s12920-020-00833-7

    authors: Shi X,Zhao Z,Wang K,Shen L

    更新日期:2020-12-28 00:00:00

  • HIP2: an online database of human plasma proteins from healthy individuals.

    abstract:BACKGROUND:With the introduction of increasingly powerful mass spectrometry (MS) techniques for clinical research, several recent large-scale MS proteomics studies have sought to characterize the entire human plasma proteome with a general objective for identifying thousands of proteins leaked from tissues in the circu...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-1-12

    authors: Saha S,Harrison SH,Shen C,Tang H,Radivojac P,Arnold RJ,Zhang X,Chen JY

    更新日期:2008-04-25 00:00:00

  • Within-pair differences of DNA methylation levels between monozygotic twins are different between male and female pairs.

    abstract:BACKGROUND:DNA methylation levels will be important for detection of epigenetic effects. However, there are few reports showing sex-related differences in the sensitivity to DNA methylation. To evaluate their sex-related individual differences in the sensitivity to methylation rigorously, we performed a systematic anal...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-016-0217-2

    authors: Watanabe M,Honda C,Osaka Twin Research Group.,Iwatani Y,Yorifuji S,Iso H,Kamide K,Hatazawa J,Kihara S,Sakai N,Watanabe H,Makimoto K,Watanabe M,Honda C,Iwatani Y

    更新日期:2016-08-26 00:00:00

  • Biological processes, properties and molecular wiring diagrams of candidate low-penetrance breast cancer susceptibility genes.

    abstract:BACKGROUND:Recent advances in whole-genome association studies (WGASs) for human cancer risk are beginning to provide the part lists of low-penetrance susceptibility genes. However, statistical analysis in these studies is complicated by the vast number of genetic variants examined and the weak effects observed, as a r...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/1755-8794-1-62

    authors: Bonifaci N,Berenguer A,Díez J,Reina O,Medina I,Dopazo J,Moreno V,Pujana MA

    更新日期:2008-12-18 00:00:00

  • Clinical utility of the low-density Infinium QC genotyping Array in a genomics-based diagnostics laboratory.

    abstract:BACKGROUND:With 15,949 markers, the low-density Infinium QC Array-24 BeadChip enables linkage analysis, HLA haplotyping, fingerprinting, ethnicity determination, mitochondrial genome variations, blood groups and pharmacogenomics. It represents an attractive independent QC option for NGS-based diagnostic laboratories, a...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-017-0297-7

    authors: Ponomarenko P,Ryutov A,Maglinte DT,Baranova A,Tatarinova TV,Gai X

    更新日期:2017-10-06 00:00:00

  • African ancestry is associated with cluster-based childhood asthma subphenotypes.

    abstract:BACKGROUND:Childhood asthma is a syndrome composed of heterogeneous phenotypes; furthermore, intrinsic biologic variation among racial/ethnic populations suggests possible genetic ancestry variation in childhood asthma. The objective of the study is to identify clinically homogeneous asthma subphenotypes in a diverse s...

    journal_title:BMC medical genomics

    pub_type: 杂志文章

    doi:10.1186/s12920-018-0367-5

    authors: Ding L,Li D,Wathen M,Altaye M,Mersha TB

    更新日期:2018-05-31 00:00:00