Abstract:
:The increasing ease with which massive genetic information can be obtained from patients or healthy individuals has stimulated the development of interpretive bioinformatics tools as aids in clinical practice. Most such tools analyze evolutionary information and simple physical-chemical properties to predict whether replacement of one amino acid residue with another will be tolerated or cause disease. Those approaches achieve up to 80-85% accuracy as binary classifiers (neutral/pathogenic). As such accuracy is insufficient for medical decision to be based on, and it does not appear to be increasing, more precise methods, such as full-atom molecular dynamics (MD) simulations in explicit solvent, are also discussed. Then, to describe the goal of interpreting human genetic variations at large scale through MD simulations, we restrictively refer to all possible protein variants carrying single-amino-acid substitutions arising from single-nucleotide variations as the human variome. We calculate its size and develop a simple model that allows calculating the simulation time needed to have a 0.99 probability of observing unfolding events of any unstable variant. The knowledge of that time enables performing a binary classification of the variants (stable-potentially neutral/unstable-pathogenic). Our model indicates that the human variome cannot be simulated with present computing capabilities. However, if they continue to increase as per Moore's law, it could be simulated (at 65°C) spending only 3 years in the task if we started in 2031. The simulation of individual protein variomes is achievable in short times starting at present. International coordination seems appropriate to embark upon massive MD simulations of protein variants.
journal_name
Brief Bioinformjournal_title
Briefings in bioinformaticsauthors
Galano-Frutos JJ,García-Cebollada H,Sancho Jdoi
10.1093/bib/bbz146subject
Has Abstractpub_date
2021-01-18 00:00:00pages
3-19issue
1eissn
1467-5463issn
1477-4054pii
5669862journal_volume
22pub_type
杂志文章abstract::If the completion of the first draft of the human genome represents the coming of age of bioinformatics, then the emergence of bioinformatics as a university degree subject represents its establishment. In this paper bioinformatics as a subject for formal study is discussed, rather than as a subject for research, and ...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章,评审
doi:10.1093/bib/4.1.7
更新日期:2003-03-01 00:00:00
abstract::Computational and mathematical modelling has become a valuable tool for investigating biological systems. Modelling enables prediction of how biological components interact to deliver system-level properties and extrapolation of biological system performance to contexts and experimental conditions where this is unknow...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bby092
更新日期:2018-09-18 00:00:00
abstract::In the business and healthcare sectors data warehousing has provided effective solutions for information usage and knowledge discovery from databases. However, data warehousing applications in the biological research and development (R&D) sector are lagging far behind. The fuzziness and complexity of biological data r...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/1.2.190
更新日期:2000-05-01 00:00:00
abstract::This Briefing reviews the widely used, currently active, up-to-date databases derived from the worldwide Protein Data Bank (PDB) to facilitate browsing, finding and exploring its entries. These databases contain visualization and analysis tools tailored to specific kinds of molecules and interactions, often including ...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbw049
更新日期:2017-07-01 00:00:00
abstract::Understanding the interconnections of microbial pathogenicity phenomena, such as biofilm formation, quorum sensing and antimicrobial resistance, is a tremendous open challenge for biomedical research. Progress made by wet-lab researchers and bioinformaticians in understanding the underlying regulatory phenomena has be...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章,评审
doi:10.1093/bib/bbt071
更新日期:2015-01-01 00:00:00
abstract::A class-imbalanced classifier is a decision rule to predict the class membership of new samples from an available data set where the class sizes differ considerably. When the class sizes are very different, most standard classification algorithms may favor the larger (majority) class resulting in poor accuracy in the ...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章,评审
doi:10.1093/bib/bbs006
更新日期:2013-01-01 00:00:00
abstract::Researchers have long been presented with the challenge imposed by the role of genetic heterogeneity in drug response. For many years, Pharmacogenomics and pharmacomicrobiomics has been investigating the influence of an individual's genetic background to drug response and disposition. More recently, the human gut micr...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbaa292
更新日期:2020-12-01 00:00:00
abstract::Cell-penetrating peptides (CPPs) facilitate the delivery of therapeutically relevant molecules, including DNA, proteins and oligonucleotides, into cells both in vitro and in vivo. This unique ability explores the possibility of CPPs as therapeutic delivery and its potential applications in clinical therapy. Over the l...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bby124
更新日期:2020-03-23 00:00:00
abstract::Despite its central role in the adaptation and microevolution of traits, the genetic architecture of phenotypic plasticity, i.e. multiple phenotypes produced by a single genotype in changing environments, remains elusive. We know little about the genes that underlie the plastic response of traits to the environment, t...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章,评审
doi:10.1093/bib/bbs009
更新日期:2013-01-01 00:00:00
abstract::With the availability of gene expression data by RNA-seq, powerful statistical approaches for grouping similar gene expression profiles across different environments have become increasingly important. We describe and assess a computational model for clustering genes into distinct groups based on the pattern of gene e...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbt029
更新日期:2014-07-01 00:00:00
abstract::Correlated reaction sets (Co-Sets) are mathematically defined modules in biochemical reaction networks which facilitate the study of biological processes by decomposing complex reaction networks into conceptually simple units. According to the degree of association, Co-Sets can be classified into three types: perfect,...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbp068
更新日期:2011-03-01 00:00:00
abstract:UNLABELLED:So-called next-generation sequencing (NGS) has provided the ability to sequence on a massive scale at low cost, enabling biologists to perform powerful experiments and gain insight into biological processes. BamView has been developed to visualize and analyse sequence reads from NGS platforms, which have bee...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbr073
更新日期:2013-03-01 00:00:00
abstract::Protein dynamics is central to all biological processes, including signal transduction, cellular regulation and biological catalysis. Among them, in-depth exploration of ligand-driven protein dynamics contributes to an optimal understanding of protein function, which is particularly relevant to drug discovery. Hence, ...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbz141
更新日期:2020-12-01 00:00:00
abstract::Verification in phylogenetics represents an extremely difficult subject. Phylogenetic analysis deals with the reconstruction of evolutionary histories of species, and as long as mankind is not able to travel in time, it will not be possible to verify deep evolutionary histories reconstructed with modern computational ...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbq079
更新日期:2011-05-01 00:00:00
abstract::Access to gene expression data has become increasingly common in recent years; however, analysis has become more difficult as it is often desirable to integrate data from different platforms. Probe mapping across microarray platforms is the first and most crucial step for data integration. In this article, we systemat...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbr076
更新日期:2012-09-01 00:00:00
abstract::Heterophylly, i.e. morphological changes in leaves along the axis of an individual plant, is regarded as a strategy used by plants to cope with environmental change. However, little is known of the extent to which heterophylly is controlled by genes and how each underlying gene exerts its effect on heterophyllous vari...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbx011
更新日期:2018-07-20 00:00:00
abstract::Circular RNA (circRNA) is a group of RNA family generated by RNA circularization, which was discovered ubiquitously across different species and tissues. However, there is no global view of tissue specificity for circRNAs to date. Here we performed the comprehensive analysis to characterize the features of human and m...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbw081
更新日期:2017-11-01 00:00:00
abstract::Synonymous mutations do not change the encoded amino acids but may alter the structure or function of an mRNA in ways that impact gene function. Advances in next generation sequencing technologies have detected numerous synonymous mutations in the human genome. Several computational models have been proposed to predic...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbz047
更新日期:2020-05-21 00:00:00
abstract:MOTIVATION:Computational methods accelerate drug discovery and play an important role in biomedicine, such as molecular property prediction and compound-protein interaction (CPI) identification. A key challenge is to learn useful molecular representation. In the early years, molecular properties are mainly calculated b...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbaa266
更新日期:2020-11-04 00:00:00
abstract::The use of multiple testing procedures in the context of gene-set testing is an important but relatively underexposed topic. If a multiple testing method is used, this is usually a standard familywise error rate (FWER) or false discovery rate (FDR) controlling procedure in which the logical relationships that exist be...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbv091
更新日期:2016-09-01 00:00:00
abstract::Phylogenomic databases provide orthology predictions for species with fully sequenced genomes. Although the goal seems well-defined, the content of these databases differs greatly. Seven ortholog databases (Ensembl Compara, eggNOG, HOGENOM, InParanoid, OMA, OrthoDB, Panther) were compared on the basis of reference tre...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbr034
更新日期:2011-09-01 00:00:00
abstract::Moonlighting proteins provide more options for cells to execute multiple functions without increasing the genome and transcriptome complexity. Although there have long been calls for computational methods for the prediction of moonlighting proteins, no method has been designed for determining moonlighting long noncodi...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbz154
更新日期:2021-01-18 00:00:00
abstract::With the increasing recognition of its role in trait and disease development, it is crucial to account for genetic imprinting to illustrate the genetic architecture of complex traits. Genetic mapping can be innovated to test and estimate effects of genetic imprinting in a segregating population derived from experiment...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbu019
更新日期:2015-05-01 00:00:00
abstract::SARS-CoV-2 is an intensively investigated virus from the order Nidovirales (Coronaviridae family) that causes COVID-19 disease in humans. Through enormous scientific effort, thousands of viral strains have been sequenced to date, thereby creating a strong background for deep bioinformatics studies of the SARS-CoV-2 ge...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbaa385
更新日期:2020-12-21 00:00:00
abstract::Gene expression data have played an essential role in many biomedical studies. When the number of genes is large and sample size is limited, there is a 'lack of information' problem, leading to low-quality findings. To tackle this problem, both horizontal and vertical data integrations have been developed, where verti...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbaa169
更新日期:2020-08-14 00:00:00
abstract::The understanding of complex biological networks often relies on both a dedicated layout and a topology. Currently, there are three major competing layout-aware systems biology formats, but there are no software tools or software libraries supporting all of them. This complicates the management of molecular network la...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbz067
更新日期:2020-07-15 00:00:00
abstract::Protein phosphorylation is a reversible and ubiquitous post-translational modification that primarily occurs at serine, threonine and tyrosine residues and regulates a variety of biological processes. In this paper, we first briefly summarized the current progresses in computational prediction of eukaryotic protein ph...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bby122
更新日期:2020-03-23 00:00:00
abstract::Cells are compartmentalized by numerous membrane-bounded organelles and membraneless organelles (MLOs) to ensure temporal and spatial regulation of various biological processes. A number of MLOs, such as nucleoli, nuclear speckles and stress granules, exist as liquid droplets within the cells and arise from the conden...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbaa271
更新日期:2020-10-30 00:00:00
abstract::One of the most complex and computationally intensive tasks of genome sequence analysis is genome assembly. Even today, few centres have the resources, in both software and hardware, to assemble a genome from the thousands or millions of individual sequences generated in a whole-genome shotgun sequencing project. With...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/5.3.237
更新日期:2004-09-01 00:00:00
abstract::The so-called 'omics' approaches used in modern biology aim at massively characterizing the molecular repertories of living systems at different levels. Metabolomics is one of the last additions to the 'omics' family and it deals with the characterization of the set of metabolites in a given biological system. As meta...
journal_title:Briefings in bioinformatics
pub_type: 杂志文章
doi:10.1093/bib/bbs055
更新日期:2013-11-01 00:00:00