Text-mining clinically relevant cancer biomarkers for curation into the CIViC database.

Abstract:

BACKGROUND:Precision oncology involves analysis of individual cancer samples to understand the genes and pathways involved in the development and progression of a cancer. To improve patient care, knowledge of diagnostic, prognostic, predisposing, and drug response markers is essential. Several knowledgebases have been created by different groups to collate evidence for these associations. These include the open-access Clinical Interpretation of Variants in Cancer (CIViC) knowledgebase. These databases rely on time-consuming manual curation from skilled experts who read and interpret the relevant biomedical literature. METHODS:To aid in this curation and provide the greatest coverage for these databases, particularly CIViC, we propose the use of text mining approaches to extract these clinically relevant biomarkers from all available published literature. To this end, a group of cancer genomics experts annotated sentences that discussed biomarkers with their clinical associations and achieved good inter-annotator agreement. We then used a supervised learning approach to construct the CIViCmine knowledgebase. RESULTS:We extracted 121,589 relevant sentences from PubMed abstracts and PubMed Central Open Access full-text papers. CIViCmine contains over 87,412 biomarkers associated with 8035 genes, 337 drugs, and 572 cancer types, representing 25,818 abstracts and 39,795 full-text publications. CONCLUSIONS:Through integration with CIVIC, we provide a prioritized list of curatable clinically relevant cancer biomarkers as well as a resource that is valuable to other knowledgebases and precision cancer analysts in general. All data is publically available and distributed with a Creative Commons Zero license. The CIViCmine knowledgebase is available at http://bionlp.bcgsc.ca/civicmine/.

journal_name

Genome Med

journal_title

Genome medicine

authors

Lever J,Jones MR,Danos AM,Krysiak K,Bonakdar M,Grewal JK,Culibrk L,Griffith OL,Griffith M,Jones SJM

doi

10.1186/s13073-019-0686-y

subject

Has Abstract

pub_date

2019-12-03 00:00:00

pages

78

issue

1

issn

1756-994X

pii

10.1186/s13073-019-0686-y

journal_volume

11

pub_type

杂志文章,meta分析
  • Microbiome mediation of infections in the cancer setting.

    abstract::Infections encountered in the cancer setting may arise from intensive cancer treatments or may result from the cancer itself, leading to risk of infections through immune compromise, disruption of anatomic barriers, and exposure to nosocomial (hospital-acquired) pathogens. Consequently, cancer-related infections are u...

    journal_title:Genome medicine

    pub_type: 杂志文章,评审

    doi:10.1186/s13073-016-0306-z

    authors: Taur Y,Pamer EG

    更新日期:2016-04-18 00:00:00

  • Comprehensive promoter level expression quantitative trait loci analysis of the human frontal lobe.

    abstract:BACKGROUND:Expression quantitative trait loci (eQTL) analysis is a powerful method to detect correlations between gene expression and genomic variants and is widely used to interpret the biological mechanism underlying identified genome wide association studies (GWAS) risk loci. Numerous eQTL studies have been performe...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-016-0320-1

    authors: Blauwendraat C,Francescatto M,Gibbs JR,Jansen IE,Simón-Sánchez J,Hernandez DG,Dillman AA,Singleton AB,Cookson MR,Rizzu P,Heutink P

    更新日期:2016-06-10 00:00:00

  • Bridging the gap between systems biology and medicine.

    abstract::Systems biology has matured considerably as a discipline over the last decade, yet some of the key challenges separating current research efforts in systems biology and clinically useful results are only now becoming apparent. As these gaps are better defined, the new discipline of systems medicine is emerging as a tr...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/gm88

    authors: Clermont G,Auffray C,Moreau Y,Rocke DM,Dalevi D,Dubhashi D,Marshall DR,Raasch P,Dehne F,Provero P,Tegner J,Aronow BJ,Langston MA,Benson M

    更新日期:2009-09-29 00:00:00

  • Wnt-regulated lncRNA discovery enhanced by in vivo identification and CRISPRi functional validation.

    abstract:BACKGROUND:Wnt signaling is an evolutionarily conserved developmental pathway that is frequently hyperactivated in cancer. While multiple protein-coding genes regulated by Wnt signaling are known, the functional lncRNAs regulated by Wnt signaling have not been systematically characterized. METHODS:We comprehensively m...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-020-00788-5

    authors: Liu S,Harmston N,Glaser TL,Wong Y,Zhong Z,Madan B,Virshup DM,Petretto E

    更新日期:2020-10-22 00:00:00

  • Haploinsufficiency of Hedgehog interacting protein causes increased emphysema induced by cigarette smoke through network rewiring.

    abstract:BACKGROUND:The HHIP gene, encoding Hedgehog interacting protein, has been implicated in chronic obstructive pulmonary disease (COPD) by genome-wide association studies (GWAS), and our subsequent studies identified a functional upstream genetic variant that decreased HHIP transcription. However, little is known about ho...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-015-0137-3

    authors: Lao T,Glass K,Qiu W,Polverino F,Gupta K,Morrow J,Mancini JD,Vuong L,Perrella MA,Hersh CP,Owen CA,Quackenbush J,Yuan GC,Silverman EK,Zhou X

    更新日期:2015-02-14 00:00:00

  • Pancreatic cancer genomics: insights and opportunities for clinical translation.

    abstract::Pancreatic cancer is a highly lethal tumor type for which there are few viable therapeutic options. It is also caused by the accumulation of mutations in a variety of genes. These genetic alterations can be grouped into those that accumulate during pancreatic intraepithelial neoplasia (precursor lesions) and thus are ...

    journal_title:Genome medicine

    pub_type: 杂志文章,评审

    doi:10.1186/gm430

    authors: Makohon-Moore A,Brosnan JA,Iacobuzio-Donahue CA

    更新日期:2013-03-28 00:00:00

  • Using inactivating mutations to provide insight into drug action.

    abstract::The role of ezetimibe in lowering plasma cholesterol has been established; however, controversy remains about its clinical benefit. A recent study utilizes naturally occurring genetic variation within the NPC1-like 1 gene (NPC1L1) to demonstrate the potential for pharmacologic inhibition of the protein to reduce the r...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-015-0130-x

    authors: Corbin LJ,Timpson NJ

    更新日期:2015-01-28 00:00:00

  • A Klebsiella pneumoniae ST307 outbreak clone from Germany demonstrates features of extensive drug resistance, hypermucoviscosity, and enhanced iron acquisition.

    abstract:BACKGROUND:Antibiotic-resistant Klebsiella pneumoniae are a major cause of hospital- and community-acquired infections, including sepsis, liver abscess, and pneumonia, driven mainly by the emergence of successful high-risk clonal lineages. The K. pneumoniae sequence type (ST) 307 lineage has appeared in several differe...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-020-00814-6

    authors: Heiden SE,Hübner NO,Bohnert JA,Heidecke CD,Kramer A,Balau V,Gierer W,Schaefer S,Eckmanns T,Gatermann S,Eger E,Guenther S,Becker K,Schaufler K

    更新日期:2020-12-09 00:00:00

  • Epigenetics of renal cell carcinoma: the path towards new diagnostics and therapeutics.

    abstract::Aberrant DNA methylation, in particular promoter hypermethylation and transcriptional silencing of tumor suppressor genes, has an important role in the development of many human cancers, including renal cell carcinoma (RCC). Indeed, apart from mutations in the well studied von Hippel-Lindau gene (VHL), the mutation fr...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/gm180

    authors: Morris MR,Maher ER

    更新日期:2010-09-03 00:00:00

  • A population-based gene expression signature of molecular clock phase from a single epidermal sample.

    abstract:BACKGROUND:For circadian medicine to influence health, such as when to take a drug or undergo a procedure, a biomarker of molecular clock phase is required--one that is easily measured and generalizable across a broad population. It is not clear that any circadian biomarker yet satisfies these criteria. METHODS:We ana...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-020-00768-9

    authors: Wu G,Ruben MD,Francey LJ,Smith DF,Sherrill JD,Oblong JE,Mills KJ,Hogenesch JB

    更新日期:2020-08-21 00:00:00

  • Ultradeep analysis of tumor heterogeneity in regions of somatic hypermutation.

    abstract::Tumor heterogeneity is of growing importance in the treatment of cancers. Mutational hot spots are prime locations for determining number and proportions of low variant allele frequency (VAF) tumor subclones by next generation sequencing. Low VAF detection is complicated by poor mapping efficiency in regions with high...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-015-0147-1

    authors: Spence JM,Spence JP,Abumoussa A,Burack WR

    更新日期:2015-03-12 00:00:00

  • Mining the literature: new methods to exploit keyword profiles.

    abstract:UNLABELLED:Bibliographic records in the PubMed database of biomedical literature are annotated with Medical Subject Headings (MeSH) by curators, which summarize the content of the articles. Two recent publications explain how to generate profiles of MeSH terms for a set of bibliographic records and to use them to defin...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/gm382

    authors: Andrade-Navarro MA

    更新日期:2012-10-30 00:00:00

  • Functional profiling of the gut microbiome in disease-associated inflammation.

    abstract::The microbial residents of the human gut are a major factor in the development and lifelong maintenance of health. The gut microbiota differs to a large degree from person to person and has an important influence on health and disease due to its interaction with the human immune system. Its overall composition and mic...

    journal_title:Genome medicine

    pub_type: 杂志文章,评审

    doi:10.1186/gm469

    authors: Börnigen D,Morgan XC,Franzosa EA,Ren B,Xavier RJ,Garrett WS,Huttenhower C

    更新日期:2013-07-31 00:00:00

  • Enabling multiplexed testing of pooled donor cells through whole-genome sequencing.

    abstract::We describe a method that enables the multiplex screening of a pool of many different donor cell lines. Our method accurately predicts each donor proportion from the pool without requiring the use of unique DNA barcodes as markers of donor identity. Instead, we take advantage of common single nucleotide polymorphisms,...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-018-0541-6

    authors: Chan Y,Chan YK,Goodman DB,Guo X,Chavez A,Lim ET,Church GM

    更新日期:2018-04-19 00:00:00

  • A phylogeny-based sampling strategy and power calculator informs genome-wide associations study design for microbial pathogens.

    abstract::Whole genome sequencing is increasingly used to study phenotypic variation among infectious pathogens and to evaluate their relative transmissibility, virulence, and immunogenicity. To date, relatively little has been published on how and how many pathogen strains should be selected for studies associating phenotype a...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-014-0101-7

    authors: Farhat MR,Shapiro BJ,Sheppard SK,Colijn C,Murray M

    更新日期:2014-11-15 00:00:00

  • The pan-cancer landscape of prognostic germline variants in 10,582 patients.

    abstract:BACKGROUND:While clinical factors such as age, grade, stage, and histological subtype provide physicians with information about patient prognosis, genomic data can further improve these predictions. Previous studies have shown that germline variants in known cancer driver genes are predictive of patient outcome, but no...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-020-0718-7

    authors: Chatrath A,Przanowska R,Kiran S,Su Z,Saha S,Wilson B,Tsunematsu T,Ahn JH,Lee KY,Paulsen T,Sobierajska E,Kiran M,Tang X,Li T,Kumar P,Ratan A,Dutta A

    更新日期:2020-02-17 00:00:00

  • Identification of epigenome-wide DNA methylation differences between carriers of APOE ε4 and APOE ε2 alleles.

    abstract:BACKGROUND:The apolipoprotein E (APOE) ε4 allele is the strongest genetic risk factor for late onset Alzheimer's disease, whilst the ε2 allele confers protection. Previous studies report differential DNA methylation of APOE between ε4 and ε2 carriers, but associations with epigenome-wide methylation have not previously...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-020-00808-4

    authors: Walker RM,Vaher K,Bermingham ML,Morris SW,Bretherick AD,Zeng Y,Rawlik K,Amador C,Campbell A,Haley CS,Hayward C,Porteous DJ,McIntosh AM,Marioni RE,Evans KL

    更新日期:2021-01-04 00:00:00

  • Open science versus commercialization: a modern research conflict?

    abstract:BACKGROUND:Efforts to improve research outcomes have resulted in genomic researchers being confronted with complex and seemingly contradictory instructions about how to perform their tasks. Over the past decade, there has been increasing pressure on university researchers to commercialize their work. Concurrently, they...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/gm316

    authors: Caulfield T,Harmon SH,Joly Y

    更新日期:2012-02-27 00:00:00

  • Clinical laboratory test-wide association scan of polygenic scores identifies biomarkers of complex disease.

    abstract:BACKGROUND:Clinical laboratory (lab) tests are used in clinical practice to diagnose, treat, and monitor disease conditions. Test results are stored in electronic health records (EHRs), and a growing number of EHRs are linked to patient DNA, offering unprecedented opportunities to query relationships between genetic ri...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-020-00820-8

    authors: Dennis JK,Sealock JM,Straub P,Lee YH,Hucks D,Actkins K,Faucon A,Feng YA,Ge T,Goleva SB,Niarchou M,Singh K,Morley T,Smoller JW,Ruderfer DM,Mosley JD,Chen G,Davis LK

    更新日期:2021-01-13 00:00:00

  • Cord blood DNA methylome in newborns later diagnosed with autism spectrum disorder reflects early dysregulation of neurodevelopmental and X-linked genes.

    abstract:BACKGROUND:Autism spectrum disorder (ASD) is a neurodevelopmental disorder with complex heritability and higher prevalence in males. The neonatal epigenome has the potential to reflect past interactions between genetic and environmental factors during early development and influence future health outcomes. METHODS:We ...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-020-00785-8

    authors: Mordaunt CE,Jianu JM,Laufer BI,Zhu Y,Hwang H,Dunaway KW,Bakulski KM,Feinberg JI,Volk HE,Lyall K,Croen LA,Newschaffer CJ,Ozonoff S,Hertz-Picciotto I,Fallin MD,Schmidt RJ,LaSalle JM

    更新日期:2020-10-14 00:00:00

  • The futility of genomic counseling: essential role of electronic health records.

    abstract::Technological advances over the past several years have dramatically reduced the cost of whole-genome sequencing. At the same time, understanding of the functional significance of genetic variation has advanced considerably. The routine generation of whole-genome sequence data for individual patients will soon be suff...

    journal_title:Genome medicine

    pub_type: 社论

    doi:10.1186/gm48

    authors: Belmont J,McGuire AL

    更新日期:2009-05-08 00:00:00

  • A comparison of epigenetic mitotic-like clocks for cancer risk prediction.

    abstract:BACKGROUND:DNA methylation changes that accrue in the stem cell pool of an adult tissue in line with the cumulative number of cell divisions may contribute to the observed variation in cancer risk among tissues and individuals. Thus, the construction of epigenetic "mitotic" clocks that can measure the lifetime number o...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-020-00752-3

    authors: Teschendorff AE

    更新日期:2020-06-24 00:00:00

  • The prognostic potential of alternative transcript isoforms across human tumors.

    abstract:BACKGROUND:Phenotypic changes during cancer progression are associated with alterations in gene expression, which can be exploited to build molecular signatures for tumor stage identification and prognosis. However, it is not yet known whether the relative abundance of transcript isoforms may be informative for clinica...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-016-0339-3

    authors: Trincado JL,Sebestyén E,Pagés A,Eyras E

    更新日期:2016-08-17 00:00:00

  • DNA methylation is associated with downregulation of the organic cation transporter OCT1 (SLC22A1) in human hepatocellular carcinoma.

    abstract:BACKGROUND:Organic cation transporters (OCTs) determine not only physiological processes but are also involved in the cellular uptake of anticancer agents. Based on microarray analyses in hepatocellular carcinoma (HCC), SLC22A1/OCT1 mRNA seems to be downregulated, but systematic protein expression data are currently mi...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/gm298

    authors: Schaeffeler E,Hellerbrand C,Nies AT,Winter S,Kruck S,Hofmann U,van der Kuip H,Zanger UM,Koepsell H,Schwab M

    更新日期:2011-12-23 00:00:00

  • Multi-locus models of genetic risk of disease.

    abstract:BACKGROUND:Evidence for genetic contribution to complex diseases is described by recurrence risks to relatives of diseased individuals. Genome-wide association studies allow a description of the genetics of the same diseases in terms of risk loci, their effects and allele frequencies. To reconcile the two descriptions ...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/gm131

    authors: Wray NR,Goddard ME

    更新日期:2010-02-02 00:00:00

  • Next-generation carrier screening: are we ready?

    abstract::Next-generation sequencing (NGS) methodology allows for a major expansion in current carrier screening tests. NGS testing has been shown to be analytically accurate and cost-effective, but major challenges include educational and counseling issues. ...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-014-0062-x

    authors: Prior TW

    更新日期:2014-08-26 00:00:00

  • Prenatal diagnosis of fetal aneuploidies: post-genomic developments.

    abstract::Prenatal diagnosis of fetal aneuploidies and chromosomal anomalies is likely to undergo a profound change in the near future. On the one hand this is mediated by new technical developments, such as chromosomal microarrays, which allow a much more precise delineation of minute sub-microscopic chromosomal aberrancies th...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/gm171

    authors: Hahn S,Jackson LG,Zimmermann BG

    更新日期:2010-08-05 00:00:00

  • Personalizing carbamazepine therapy.

    abstract::The anticonvulsant carbamazepine has a high incidence of cutaneous adverse drug reactions. A recent prospective clinical trial in Taiwan has indicated that HLA-B*1502 screening will reduce the incidence of life-threatening adverse reactions to carbamazepine, while a genome-wide association study has identified the HLA...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/gm243

    authors: Mushiroda T,Nakamura Y

    更新日期:2011-05-30 00:00:00

  • Environment-driven somatic mosaicism in brain disorders.

    abstract::The identification of somatic mosaicism in the brain lends a new perspective to our understanding of the role of gene and environment interactions in psychiatric disease risk. Somatic mutations, such as retrotransposon insertions, that are precipitated by modern environmental factors may alter neuronal function and ne...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/s13073-016-0317-9

    authors: Bedrosian TA,Linker S,Gage FH

    更新日期:2016-05-23 00:00:00

  • Genetic and epigenetic insights into fetal alcohol spectrum disorders.

    abstract::The magnitude of the detrimental effects following in utero alcohol exposure, including fetal alcohol syndrome and other fetal alcohol spectrum disorders (FASD), is globally underestimated. The effects include irreversible cognitive and behavioral disabilities as a result of abnormal brain development, pre- and postna...

    journal_title:Genome medicine

    pub_type: 杂志文章

    doi:10.1186/gm148

    authors: Ramsay M

    更新日期:2010-04-28 00:00:00