TOD-CUP: a gene expression rank-based majority vote algorithm for tissue origin diagnosis of cancers of unknown primary.

Abstract:

:Gene expression profiling holds great potential as a new approach to histological diagnosis and precision medicine of cancers of unknown primary (CUP). Batch effects and different data types greatly decrease the predictive performance of biomarker-based algorithms, and few methods have been widely applied to identify tissue origin of CUP up to now. To address this problem and assist in more precise diagnosis, we have developed a gene expression rank-based majority vote algorithm for tissue origin diagnosis of CUP (TOD-CUP) of most common cancer types. Based on massive tissue-specific RNA-seq data sets (10 553) found in The Cancer Genome Atlas (TCGA), 538 feature genes (biomarkers) were selected based on their gene expression ranks and used to predict tissue types. The top scoring pairs (TSPs) classifier of the tumor type was optimized by the TCGA training samples. To test the prediction accuracy of our TOD-CUP algorithm, we analyzed (1) two microarray data sets (1029 Agilent and 2277 Affymetrix/Illumina chips) and found 91% and 94% prediction accuracy, respectively, (2) RNA-seq data from five cancer types derived from 141 public metastatic cancer tumor samples and achieved 94% accuracy and (3) a total of 25 clinical cancer samples (including 14 metastatic cancer samples) were able to classify 24/25 samples correctly (96.0% accuracy). Taken together, the TOD-CUP algorithm provides a powerful and robust means to accurately identify the tissue origin of 24 cancer types across different data platforms. To make the TOD-CUP algorithm easily accessible for clinical application, we established a Web-based server for tumor tissue origin diagnosis (http://ibi. zju.edu.cn/todcup/).

journal_name

Brief Bioinform

authors

Shen Y,Chu Q,Yin X,He Y,Bai P,Wang Y,Fang W,Timko MP,Fan L,Jiang W

doi

10.1093/bib/bbaa031

subject

Has Abstract

pub_date

2020-04-08 00:00:00

eissn

1467-5463

issn

1477-4054

pii

5817479

pub_type

杂志文章
  • Investigating microRNA-mediated regulation of the nascent nuclear transcripts in plants: a bioinformatics workflow.

    abstract::Most of the microRNAs (miRNAs) play their regulatory roles through posttranscriptional target decay or translational inhibition. For both plants and animals, these regulatory events were previously considered to take place in cytoplasm, as mature miRNAs were observed to be exported to the cytoplasm for Argonaute (AGO)...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbx069

    authors: Yu D,Tang Z,Shao C,Ma X,Xiang T,Fan Z,Wang H,Meng Y

    更新日期:2018-11-27 00:00:00

  • Gene set analysis methods for the functional interpretation of non-mRNA data-Genomic range and ncRNA data.

    abstract::Gene set analysis (GSA) is one of the methods of choice for analyzing the results of current omics studies; however, it has been mainly developed to analyze mRNA (microarray, RNA-Seq) data. The following review includes an update regarding general methods and resources for GSA and then emphasizes GSA methods and tools...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz090

    authors: Mora A

    更新日期:2020-09-25 00:00:00

  • In silico signaling modeling to understand cancer pathways and treatment responses.

    abstract::Precision medicine has changed thinking in cancer therapy, highlighting a better understanding of the individual clinical interventions. But what role do the drivers and pathways identified from pan-cancer genome analysis play in the tumor? In this letter, we will highlight the importance of in silico modeling in prec...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz033

    authors: Kunz M,Jeromin J,Fuchs M,Christoph J,Veronesi G,Flentje M,Nietzer S,Dandekar G,Dandekar T

    更新日期:2020-05-21 00:00:00

  • RNA-mediated translation regulation in viral genomes: computational advances in the recognition of sequences and structures.

    abstract::RNA structures are widely distributed across all life forms. The global conformation of these structures is defined by a variety of constituent structural units such as helices, hairpin loops, kissing-loop motifs and pseudoknots, which often behave in a modular way. Their ubiquitous distribution is associated with a v...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz054

    authors: Gupta A,Bansal M

    更新日期:2020-07-15 00:00:00

  • Opportunities for community awareness platforms in personal genomics and bioinformatics education.

    abstract::Precision and personalized medicine will be increasingly based on the integration of various type of information, particularly electronic health records and genome sequences. The availability of cheap genome sequencing services and the information interoperability will increase the role of online bioinformatics analys...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbw078

    authors: Bianchi L,Liò P

    更新日期:2017-11-01 00:00:00

  • Bioinformatics education--perspectives and challenges out of Africa.

    abstract::The discipline of bioinformatics has developed rapidly since the complete sequencing of the first genomes in the 1990s. The development of many high-throughput techniques during the last decades has ensured that bioinformatics has grown into a discipline that overlaps with, and is required for, the modern practice of ...

    journal_title:Briefings in bioinformatics

    pub_type: 历史文章,杂志文章

    doi:10.1093/bib/bbu022

    authors: Tastan Bishop Ö,Adebiyi EF,Alzohairy AM,Everett D,Ghedira K,Ghouila A,Kumuthini J,Mulder NJ,Panji S,Patterton HG,H3ABioNet Consortium.,H3Africa Consortium.

    更新日期:2015-03-01 00:00:00

  • Are dropout imputation methods for scRNA-seq effective for scHi-C data?

    abstract::The prevalence of dropout events is a serious problem for single-cell Hi-C (scHiC) data due to insufficient sequencing depth and data coverage, which brings difficulties in downstream studies such as clustering and structural analysis. Complicating things further is the fact that dropouts are confounded with structura...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa289

    authors: Han C,Xie Q,Lin S

    更新日期:2020-11-17 00:00:00

  • Proteome-scale analysis of phase-separated proteins in immunofluorescence images.

    abstract::Phase separation is an important mechanism that mediates the spatial distribution of proteins in different cellular compartments. While phase-separated proteins share certain sequence characteristics, including intrinsically disordered regions (IDRs) and prion-like domains, such characteristics are insufficient for ma...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa187

    authors: Yu C,Shen B,You K,Huang Q,Shi M,Wu C,Chen Y,Zhang C,Li T

    更新日期:2020-09-02 00:00:00

  • Irinotecan and vandetanib create synergies for treatment of pancreatic cancer patients with concomitant TP53 and KRAS mutations.

    abstract:BACKGROUND:The most frequently mutated gene pairs in pancreatic adenocarcinoma (PAAD) are KRAS and TP53, and our goal is to illustrate the multiomics and molecular dynamics landscapes of KRAS/TP53 mutation and also to obtain prospective novel drugs for KRAS- and TP53-mutated PAAD patients. Moreover, we also made an att...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa149

    authors: Kaushik AC,Wang YJ,Wang X,Wei DQ

    更新日期:2020-07-31 00:00:00

  • The microRNA target site landscape is a novel molecular feature associating alternative polyadenylation with immune evasion activity in breast cancer.

    abstract::Alternative polyadenylation (APA) in breast tumor samples results in the removal/addition of cis-regulatory elements such as microRNA (miRNA) target sites in the 3'-untranslated region (3'-UTRs) of genes. Although previous computational APA studies focused on a subset of genes strongly affected by APA (APA genes), we ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa191

    authors: Kim S,Bai Y,Fan Z,Diergaarde B,Tseng GC,Park HJ

    更新日期:2020-08-26 00:00:00

  • Comparative genome assembly.

    abstract::One of the most complex and computationally intensive tasks of genome sequence analysis is genome assembly. Even today, few centres have the resources, in both software and hardware, to assemble a genome from the thousands or millions of individual sequences generated in a whole-genome shotgun sequencing project. With...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/5.3.237

    authors: Pop M,Phillippy A,Delcher AL,Salzberg SL

    更新日期:2004-09-01 00:00:00

  • Comprehensive characterization of tissue-specific circular RNAs in the human and mouse genomes.

    abstract::Circular RNA (circRNA) is a group of RNA family generated by RNA circularization, which was discovered ubiquitously across different species and tissues. However, there is no global view of tissue specificity for circRNAs to date. Here we performed the comprehensive analysis to characterize the features of human and m...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbw081

    authors: Xia S,Feng J,Lei L,Hu J,Xia L,Wang J,Xiang Y,Liu L,Zhong S,Han L,He C

    更新日期:2017-11-01 00:00:00

  • iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites.

    abstract::Regulation of proteolysis plays a critical role in a myriad of important cellular processes. The key to better understanding the mechanisms that control this process is to identify the specific substrates that each protease targets. To address this, we have developed iProt-Sub, a powerful bioinformatics tool for the a...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bby028

    authors: Song J,Wang Y,Li F,Akutsu T,Rawlings ND,Webb GI,Chou KC

    更新日期:2019-03-25 00:00:00

  • Drug response in association with pharmacogenomics and pharmacomicrobiomics: towards a better personalized medicine.

    abstract::Researchers have long been presented with the challenge imposed by the role of genetic heterogeneity in drug response. For many years, Pharmacogenomics and pharmacomicrobiomics has been investigating the influence of an individual's genetic background to drug response and disposition. More recently, the human gut micr...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa292

    authors: Hassan R,Allali I,Agamah FE,Elsheikh SSM,Thomford NE,Dandara C,Chimusa ER

    更新日期:2020-12-01 00:00:00

  • Extended application of genomic selection to screen multiomics data for prognostic signatures of prostate cancer.

    abstract::Prognostic tests using expression profiles of several dozen genes help provide treatment choices for prostate cancer (PCa). However, these tests require improvement to meet the clinical need for resolving overtreatment, which continues to be a pervasive problem in PCa management. Genomic selection (GS) methodology, wh...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa197

    authors: Li R,Wang S,Cui Y,Qu H,Chater JM,Zhang L,Wei J,Wang M,Xu Y,Yu L,Lu J,Feng Y,Zhou R,Huang Y,Ma R,Zhu J,Zhong W,Jia Z

    更新日期:2020-09-08 00:00:00

  • Mutational analysis in RNAs: comparing programs for RNA deleterious mutation prediction.

    abstract::Programs for RNA mutational analysis that are structure-based and rely on secondary structure prediction have been developed and expanded in the past several years. They can be used for a variety of purposes, such as in suggesting point mutations that will alter RNA virus replication or translation initiation, investi...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbq059

    authors: Barash D,Churkin A

    更新日期:2011-03-01 00:00:00

  • Toward more realistic drug-target interaction predictions.

    abstract::A number of supervised machine learning models have recently been introduced for the prediction of drug-target interactions based on chemical structure and genomic sequence information. Although these models could offer improved means for many network pharmacology applications, such as repositioning of drugs for new t...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbu010

    authors: Pahikkala T,Airola A,Pietilä S,Shakyawar S,Szwajda A,Tang J,Aittokallio T

    更新日期:2015-03-01 00:00:00

  • SARS-CoV-2 hot-spot mutations are significantly enriched within inverted repeats and CpG island loci.

    abstract::SARS-CoV-2 is an intensively investigated virus from the order Nidovirales (Coronaviridae family) that causes COVID-19 disease in humans. Through enormous scientific effort, thousands of viral strains have been sequenced to date, thereby creating a strong background for deep bioinformatics studies of the SARS-CoV-2 ge...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa385

    authors: Goswami P,Bartas M,Lexa M,Bohálová N,Volná A,Červeň J,Červeňová V,Pečinka P,Špunda V,Fojta M,Brázda V

    更新日期:2020-12-21 00:00:00

  • A practical guide for the functional annotation of genetic variations using SNPnexus.

    abstract::Broader functional annotation of known as well as putative genetic variations is a valuable mean for prioritizing targets in disease studies and large-scale genotyping projects. In this article, we present a practical guide to SNPnexus, a web-based tool that provides an aggregate set of functional annotations for geno...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbt004

    authors: Dayem Ullah AZ,Lemoine NR,Chelala C

    更新日期:2013-07-01 00:00:00

  • A review of bioinformatics education in the UK.

    abstract::If the completion of the first draft of the human genome represents the coming of age of bioinformatics, then the emergence of bioinformatics as a university degree subject represents its establishment. In this paper bioinformatics as a subject for formal study is discussed, rather than as a subject for research, and ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/4.1.7

    authors: Counsell D

    更新日期:2003-03-01 00:00:00

  • Towards scaling elementary flux mode computation.

    abstract::While elementary flux mode (EFM) analysis is now recognized as a cornerstone computational technique for cellular pathway analysis and engineering, EFM application to genome-scale models remains computationally prohibitive. This article provides a review of aspects of EFM computation that elucidates bottlenecks in sca...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz094

    authors: Ullah E,Yosafshahi M,Hassoun S

    更新日期:2020-12-01 00:00:00

  • Single-cell transcriptome-based multilayer network biomarker for predicting prognosis and therapeutic response of gliomas.

    abstract::Occurrence and development of cancers are governed by complex networks of interacting intercellular and intracellular signals. The technology of single-cell RNA sequencing (scRNA-seq) provides an unprecedented opportunity for dissecting the interplay between the cancer cells and the associated microenvironment. Here w...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz040

    authors: Zhang J,Guan M,Wang Q,Zhang J,Zhou T,Sun X

    更新日期:2020-05-21 00:00:00

  • Dr AFC: drug repositioning through anti-fibrosis characteristic.

    abstract::Fibrosis is a key component in the pathogenic mechanism of a variety of diseases. These diseases involving fibrosis may share common mechanisms and therapeutic targets, and therefore common intervention strategies and medicines may be applicable for these diseases. For this reason, deliberately introducing anti-fibros...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa115

    authors: Wu D,Gao W,Li X,Tian C,Jiao N,Fang S,Xiao J,Xu Z,Zhu L,Zhang G,Zhu R

    更新日期:2020-06-22 00:00:00

  • Characteristics and evolution of the ecosystem of software tools supporting research in molecular biology.

    abstract::Daily work in molecular biology presently depends on a large number of computational tools. An in-depth, large-scale study of that 'ecosystem' of Web tools, its characteristics, interconnectivity, patterns of usage/citation, temporal evolution and rate of decay is crucial for understanding the forces that shape it and...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bby001

    authors: Pazos F,Chagoyen M

    更新日期:2019-07-19 00:00:00

  • Bioinformatic analysis of SMN1-ACE/ACE2 interactions hinted at a potential protective effect of spinal muscular atrophy against COVID-19-induced lung injury.

    abstract::Patients with spinal muscular atrophy (SMA) are susceptible to the respiratory infections and might be at a heightened risk of poor clinical outcomes upon contracting coronavirus disease 2019 (COVID-19). In the face of the COVID-19 pandemic, the potential associations of SMA with the susceptibility to and prognosticat...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa285

    authors: Li Z,Li X,Shen J,Tan H,Rong T,Lin Y,Feng E,Chen Z,Jiao Y,Liu G,Zhang L,Vai Chan MT,Kei Wu WK

    更新日期:2020-11-14 00:00:00

  • Strategies for calibrating models of biology.

    abstract::Computational and mathematical modelling has become a valuable tool for investigating biological systems. Modelling enables prediction of how biological components interact to deliver system-level properties and extrapolation of biological system performance to contexts and experimental conditions where this is unknow...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bby092

    authors: Read MN,Alden K,Timmis J,Andrews PS

    更新日期:2018-09-18 00:00:00

  • Advanced bioinformatics methods for practical applications in proteomics.

    abstract::Mass spectrometry (MS)-based proteomics has undergone rapid advancements in recent years, creating challenging problems for bioinformatics. We focus on four aspects where bioinformatics plays a crucial role (and proteomics is needed for clinical application): peptide-spectra matching (PSM) based on the new data-indepe...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbx128

    authors: Goh WWB,Wong L

    更新日期:2019-01-18 00:00:00

  • Understanding the unimodal distributions of cancer occurrence rates: it takes two factors for a cancer to occur.

    abstract::Data from the SEER reports reveal that the occurrence rate of a cancer type generally follows a unimodal distribution over age, peaking at an age that is cancer-type specific and ranges from 30+ through 70+. Previous studies attribute such bell-shaped distributions to the reduced proliferative potential in senior year...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa349

    authors: Qiu S,An Z,Tan R,He PA,Jing J,Li H,Wu S,Xu Y

    更新日期:2020-12-30 00:00:00

  • Identifying miRNAs, targets and functions.

    abstract::microRNAs (miRNAs) are small endogenous non-coding RNAs that function as the universal specificity factors in post-transcriptional gene silencing. Discovering miRNAs, identifying their targets and further inferring miRNA functions have been a critical strategy for understanding normal biological processes of miRNAs an...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbs075

    authors: Liu B,Li J,Cairns MJ

    更新日期:2014-01-01 00:00:00

  • A dynamic framework for quantifying the genetic architecture of phenotypic plasticity.

    abstract::Despite its central role in the adaptation and microevolution of traits, the genetic architecture of phenotypic plasticity, i.e. multiple phenotypes produced by a single genotype in changing environments, remains elusive. We know little about the genes that underlie the plastic response of traits to the environment, t...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbs009

    authors: Wang Z,Pang X,Lv Y,Xu F,Zhou T,Li X,Feng S,Li J,Li Z,Wu R

    更新日期:2013-01-01 00:00:00