Advanced bioinformatics methods for practical applications in proteomics.

Abstract:

:Mass spectrometry (MS)-based proteomics has undergone rapid advancements in recent years, creating challenging problems for bioinformatics. We focus on four aspects where bioinformatics plays a crucial role (and proteomics is needed for clinical application): peptide-spectra matching (PSM) based on the new data-independent acquisition (DIA) paradigm, resolving missing proteins (MPs), dealing with biological and technical heterogeneity in data and statistical feature selection (SFS). DIA is a brute-force strategy that provides greater width and depth but, because it indiscriminately captures spectra such that signal from multiple peptides is mixed, getting good PSMs is difficult. We consider two strategies: simplification of DIA spectra to pseudo-data-dependent acquisition spectra or, alternatively, brute-force search of each DIA spectra against known reference libraries. The MP problem arises when proteins are never (or inconsistently) detected by MS. When observed in at least one sample, imputation methods can be used to guess the approximate protein expression level. If never observed at all, network/protein complex-based contextualization provides an independent prediction platform. Data heterogeneity is a difficult problem with two dimensions: technical (batch effects), which should be removed, and biological (including demography and disease subpopulations), which should be retained. Simple normalization is seldom sufficient, while batch effect-correction algorithms may create errors. Batch effect-resistant normalization methods are a viable alternative. Finally, SFS is vital for practical applications. While many methods exist, there is no best method, and both upstream (e.g. normalization) and downstream processing (e.g. multiple-testing correction) are performance confounders. We also discuss signal detection when class effects are weak.

journal_name

Brief Bioinform

authors

Goh WWB,Wong L

doi

10.1093/bib/bbx128

subject

Has Abstract

pub_date

2019-01-18 00:00:00

pages

347-355

issue

1

eissn

1467-5463

issn

1477-4054

pii

4372411

journal_volume

20

pub_type

杂志文章
  • An open-pollinated design for mapping imprinting genes in natural populations.

    abstract::With the increasing recognition of its role in trait and disease development, it is crucial to account for genetic imprinting to illustrate the genetic architecture of complex traits. Genetic mapping can be innovated to test and estimate effects of genetic imprinting in a segregating population derived from experiment...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbu019

    authors: Sun L,Zhu X,Bo W,Xu F,Cheng T,Zhang Q,Wu R

    更新日期:2015-05-01 00:00:00

  • Statistical detection of differentially expressed genes based on RNA-seq: from biological to phylogenetic replicates.

    abstract::RNA-seq has been an increasingly popular high-throughput platform to identify differentially expressed (DE) genes, which is much more reproducible and accurate than the previous microarray technology. Yet, a number of statistical issues remain to be resolved in data analysis, largely due to the high-throughput data vo...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbv035

    authors: Gu X

    更新日期:2016-03-01 00:00:00

  • Opportunities for community awareness platforms in personal genomics and bioinformatics education.

    abstract::Precision and personalized medicine will be increasingly based on the integration of various type of information, particularly electronic health records and genome sequences. The availability of cheap genome sequencing services and the information interoperability will increase the role of online bioinformatics analys...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbw078

    authors: Bianchi L,Liò P

    更新日期:2017-11-01 00:00:00

  • Towards a comprehensive picture of the genetic landscape of complex traits.

    abstract::The formation of phenotypic traits, such as biomass production, tumor volume and viral abundance, undergoes a complex process in which interactions between genes and developmental stimuli take place at each level of biological organization from cells to organisms. Traditional studies emphasize the impact of genes by d...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbs049

    authors: Wang Z,Wang Y,Wang N,Wang J,Wang Z,Vallejos CE,Wu R

    更新日期:2014-01-01 00:00:00

  • Elucidating the editome: bioinformatics approaches for RNA editing detection.

    abstract::RNA editing is a widespread co/posttranscriptional mechanism affecting primary RNAs by specific nucleotide modifications, which plays relevant roles in molecular processes including regulation of gene expression and/or the processing of noncoding RNAs. In recent years, the detection of editing sites has been improved ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbx129

    authors: Diroma MA,Ciaccia L,Pesole G,Picardi E

    更新日期:2019-03-22 00:00:00

  • AlzRiskMR database: an online database for the impact of exposure factors on Alzheimer's disease.

    abstract::In view of great difficulties in the pathogenesis analysis of Alzheimer's disease (AD) presently, profiling the modifiable risk factors is crucial for early detection and intervention of AD. However, the causal associations among them have yet to be identified, and the effective integration and application of these da...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa213

    authors: Wang Z,Meng L,Liu H,Shen L,Ji HF

    更新日期:2020-09-21 00:00:00

  • In silico signaling modeling to understand cancer pathways and treatment responses.

    abstract::Precision medicine has changed thinking in cancer therapy, highlighting a better understanding of the individual clinical interventions. But what role do the drivers and pathways identified from pan-cancer genome analysis play in the tumor? In this letter, we will highlight the importance of in silico modeling in prec...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz033

    authors: Kunz M,Jeromin J,Fuchs M,Christoph J,Veronesi G,Flentje M,Nietzer S,Dandekar G,Dandekar T

    更新日期:2020-05-21 00:00:00

  • Agents in bioinformatics, computational and systems biology.

    abstract::The adoption of agent technologies and multi-agent systems constitutes an emerging area in bioinformatics. In this article, we report on the activity of the Working Group on Agents in Bioinformatics (BIOAGENTS) founded during the first AgentLink III Technical Forum meeting on the 2nd of July, 2004, in Rome. The meetin...

    journal_title:Briefings in bioinformatics

    pub_type:

    doi:10.1093/bib/bbl014

    authors: Merelli E,Armano G,Cannata N,Corradini F,d'Inverno M,Doms A,Lord P,Martin A,Milanesi L,Möller S,Schroeder M,Luck M

    更新日期:2007-01-01 00:00:00

  • Circular RNA identification based on multiple seed matching.

    abstract::Computational detection methods have been widely used in studies on the biogenesis and the function of circular RNAs (circRNAs). However, all of the existing tools showed disadvantages on certain aspects of circRNA detection. Here, we propose an improved multithreading detection tool, CIRI2, which used an adapted maxi...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbx014

    authors: Gao Y,Zhang J,Zhao F

    更新日期:2018-09-28 00:00:00

  • Current development of integrated web servers for preclinical safety and pharmacokinetics assessments in drug development.

    abstract::In drug development, preclinical safety and pharmacokinetics assessments of candidate drugs to ensure the safety profile are a must. While in vivo and in vitro tests are traditionally used, experimental determinations have disadvantages, as they are usually time-consuming and costly. In silico predictions of these pre...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa160

    authors: Hsiao Y,Su BH,Tseng YJ

    更新日期:2020-08-07 00:00:00

  • Computational prediction of species-specific yeast DNA replication origin via iterative feature representation.

    abstract::Deoxyribonucleic acid replication is one of the most crucial tasks taking place in the cell, and it has to be precisely regulated. This process is initiated in the replication origins (ORIs), and thus it is essential to identify such sites for a deeper understanding of the cellular processes and functions related to t...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa304

    authors: Manavalan B,Basith S,Shin TH,Lee G

    更新日期:2020-11-25 00:00:00

  • The computational challenges of applying comparative-based computational methods to whole genomes.

    abstract::The explosion in genomic sequence available in public databases has resulted in an unprecedented opportunity for computational whole genome analyses. A number of promising comparative-based approaches have been developed for gene finding, regulatory element discovery and other purposes, and it is clear that these tool...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/3.1.18

    authors: Dubchak I,Pachter L

    更新日期:2002-03-01 00:00:00

  • Benchmarking computational tools for polymorphic transposable element detection.

    abstract::Transposable elements (TEs) are an important source of human genetic variation with demonstrable effects on phenotype. Recently, a number of computational methods for the detection of polymorphic TE (polyTE) insertion sites from next-generation sequence data have been developed. The use of such tools will become incre...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbw072

    authors: Rishishwar L,Mariño-Ramírez L,Jordan IK

    更新日期:2017-11-01 00:00:00

  • Methodological aspects of whole-genome bisulfite sequencing analysis.

    abstract::The combination of DNA bisulfite treatment with high-throughput sequencing technologies has enabled investigation of genome-wide DNA methylation beyond CpG sites and CpG islands. These technologies have opened new avenues to understand the interplay between epigenetic events, chromatin plasticity and gene regulation. ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbu016

    authors: Adusumalli S,Mohd Omar MF,Soong R,Benoukraf T

    更新日期:2015-05-01 00:00:00

  • Biodiversity informatics: organizing and linking information across the spectrum of life.

    abstract::Biological knowledge can be inferred from three major levels of information: molecules, organisms and ecologies. Bioinformatics is an established field that has made significant advances in the development of systems and techniques to organize contemporary molecular data; biodiversity informatics is an emerging discip...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbm037

    authors: Sarkar IN

    更新日期:2007-09-01 00:00:00

  • Hybrid modelling of biological systems using fuzzy continuous Petri nets.

    abstract::Integrated modelling of biological systems is challenged by composing components with sufficient kinetic data and components with insufficient kinetic data or components built only using experts' experience and knowledge. Fuzzy continuous Petri nets (FCPNs) combine continuous Petri nets with fuzzy inference systems, a...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz114

    authors: Liu F,Sun W,Heiner M,Gilbert D

    更新日期:2021-01-18 00:00:00

  • Bioinformatics education--perspectives and challenges out of Africa.

    abstract::The discipline of bioinformatics has developed rapidly since the complete sequencing of the first genomes in the 1990s. The development of many high-throughput techniques during the last decades has ensured that bioinformatics has grown into a discipline that overlaps with, and is required for, the modern practice of ...

    journal_title:Briefings in bioinformatics

    pub_type: 历史文章,杂志文章

    doi:10.1093/bib/bbu022

    authors: Tastan Bishop Ö,Adebiyi EF,Alzohairy AM,Everett D,Ghedira K,Ghouila A,Kumuthini J,Mulder NJ,Panji S,Patterton HG,H3ABioNet Consortium.,H3Africa Consortium.

    更新日期:2015-03-01 00:00:00

  • Cloud 3D-QSAR: a web tool for the development of quantitative structure-activity relationship models in drug discovery.

    abstract::Effective drug discovery contributes to the treatment of numerous diseases but is limited by high costs and long cycles. The Quantitative Structure-Activity Relationship (QSAR) method was introduced to evaluate the activity of a large number of compounds virtually, reducing the time and labor costs required for chemic...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa276

    authors: Wang YL,Wang F,Shi XX,Jia CY,Wu FX,Hao GF,Yang GF

    更新日期:2020-11-03 00:00:00

  • A feature-based approach to predict hot spots in protein-DNA binding interfaces.

    abstract::DNA-binding hot spot residues of proteins are dominant and fundamental interface residues that contribute most of the binding free energy of protein-DNA interfaces. As experimental methods for identifying hot spots are expensive and time consuming, computational approaches are urgently required in predicting hot spots...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz037

    authors: Zhang S,Zhao L,Zheng CH,Xia J

    更新日期:2020-05-21 00:00:00

  • Teaching the bioinformatics of signaling networks: an integrated approach to facilitate multi-disciplinary learning.

    abstract::The number of bioinformatics tools and resources that support molecular and cell biology approaches is continuously expanding. Moreover, systems and network biology analyses are accompanied more and more by integrated bioinformatics methods. Traditional information-centered university teaching methods often fail, as (...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbt024

    authors: Korcsmaros T,Dunai ZA,Vellai T,Csermely P

    更新日期:2013-09-01 00:00:00

  • Data-driven rational biosynthesis design: from molecules to cell factories.

    abstract::A proliferation of chemical, reaction and enzyme databases, new computational methods and software tools for data-driven rational biosynthesis design have emerged in recent years. With the coming of the era of big data, particularly in the bio-medical field, data-driven rational biosynthesis design could potentially b...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz065

    authors: Chen F,Yuan L,Ding S,Tian Y,Hu QN

    更新日期:2020-07-15 00:00:00

  • A computing platform to map ecological metabolism by integrating functional mapping and the metabolic theory of ecology.

    abstract::Whole-organism metabolic rate co-varies allometrically with body mass, and is also affected by temperature through different biochemical mechanisms. Here we implement a computational platform to map specific quantitative trait loci (QTLs) that govern the dependence of metabolic rate on size and temperature. The model ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbv116

    authors: Yan Q,Zhu X,Jiang L,Ye M,Sun L,Terblanche JS,Wu R

    更新日期:2017-01-01 00:00:00

  • Conceptual and computational framework for logical modelling of biological networks deregulated in diseases.

    abstract::Mathematical models can serve as a tool to formalize biological knowledge from diverse sources, to investigate biological questions in a formal way, to test experimental hypotheses, to predict the effect of perturbations and to identify underlying mechanisms. We present a pipeline of computational tools that performs ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbx163

    authors: Montagud A,Traynard P,Martignetti L,Bonnet E,Barillot E,Zinovyev A,Calzone L

    更新日期:2019-07-19 00:00:00

  • Mutational analysis in RNAs: comparing programs for RNA deleterious mutation prediction.

    abstract::Programs for RNA mutational analysis that are structure-based and rely on secondary structure prediction have been developed and expanded in the past several years. They can be used for a variety of purposes, such as in suggesting point mutations that will alter RNA virus replication or translation initiation, investi...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbq059

    authors: Barash D,Churkin A

    更新日期:2011-03-01 00:00:00

  • iProt-Sub: a comprehensive package for accurately mapping and predicting protease-specific substrates and cleavage sites.

    abstract::Regulation of proteolysis plays a critical role in a myriad of important cellular processes. The key to better understanding the mechanisms that control this process is to identify the specific substrates that each protease targets. To address this, we have developed iProt-Sub, a powerful bioinformatics tool for the a...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bby028

    authors: Song J,Wang Y,Li F,Akutsu T,Rawlings ND,Webb GI,Chou KC

    更新日期:2019-03-25 00:00:00

  • Bioinformatic analysis of SMN1-ACE/ACE2 interactions hinted at a potential protective effect of spinal muscular atrophy against COVID-19-induced lung injury.

    abstract::Patients with spinal muscular atrophy (SMA) are susceptible to the respiratory infections and might be at a heightened risk of poor clinical outcomes upon contracting coronavirus disease 2019 (COVID-19). In the face of the COVID-19 pandemic, the potential associations of SMA with the susceptibility to and prognosticat...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa285

    authors: Li Z,Li X,Shen J,Tan H,Rong T,Lin Y,Feng E,Chen Z,Jiao Y,Liu G,Zhang L,Vai Chan MT,Kei Wu WK

    更新日期:2020-11-14 00:00:00

  • Architecture for interoperable software in biology.

    abstract::Understanding biological complexity demands a combination of high-throughput data and interdisciplinary skills. One way to bring to bear the necessary combination of data types and expertise is by encapsulating domain knowledge in software and composing that software to create a customized data analysis environment. T...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbs074

    authors: Bare JC,Baliga NS

    更新日期:2014-07-01 00:00:00

  • Comparing enrichment analysis and machine learning for identifying gene properties that discriminate between gene classes.

    abstract::Biologists very often use enrichment methods based on statistical hypothesis tests to identify gene properties that are significantly over-represented in a given set of genes of interest, by comparison with a 'background' set of genes. These enrichment methods, although based on rigorous statistical foundations, are n...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz028

    authors: Fabris F,Palmer D,de Magalhães JP,Freitas AA

    更新日期:2020-05-21 00:00:00

  • The Beta Workbench: a computational tool to study the dynamics of biological systems.

    abstract::We introduce the Beta Workbench (BWB), a scalable tool built on top of the newly defined BlenX language to model, simulate and analyse biological systems. We show the features and the incremental modelling process supported by the BWB on a running example based on the mitogen-activated kinase pathway. Finally, we prov...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbn023

    authors: Dematté L,Priami C,Romanel A

    更新日期:2008-09-01 00:00:00

  • The GTPB training programme in Portugal.

    abstract::The Gulbenkian Training Programme in Bioinformatics has been offering hands-on training courses in Oeiras, PT for more than a decade. This article is a review of its functional organization and evolution. We aim to share our experience with people considering setting-up similar training facilities elsewhere. More than...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbq063

    authors: Fernandes PL

    更新日期:2010-11-01 00:00:00