The computational challenges of applying comparative-based computational methods to whole genomes.

Abstract:

:The explosion in genomic sequence available in public databases has resulted in an unprecedented opportunity for computational whole genome analyses. A number of promising comparative-based approaches have been developed for gene finding, regulatory element discovery and other purposes, and it is clear that these tools will play a fundamental role in analysing the enormous amount of new data that is currently being generated. The synthesis of computationally intensive comparative computational approaches with the requirement for whole genome analysis represents both an unprecedented challenge and opportunity for computational scientists. We focus on a few of these challenges, using by way of example the problems of alignment, gene finding and regulatory element discovery, and discuss the issues that have arisen in attempts to solve these problems in the context of whole genome analysis pipelines.

journal_name

Brief Bioinform

authors

Dubchak I,Pachter L

doi

10.1093/bib/3.1.18

keywords:

subject

Has Abstract

pub_date

2002-03-01 00:00:00

pages

18-22

issue

1

eissn

1467-5463

issn

1477-4054

journal_volume

3

pub_type

杂志文章
  • Cloud 3D-QSAR: a web tool for the development of quantitative structure-activity relationship models in drug discovery.

    abstract::Effective drug discovery contributes to the treatment of numerous diseases but is limited by high costs and long cycles. The Quantitative Structure-Activity Relationship (QSAR) method was introduced to evaluate the activity of a large number of compounds virtually, reducing the time and labor costs required for chemic...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa276

    authors: Wang YL,Wang F,Shi XX,Jia CY,Wu FX,Hao GF,Yang GF

    更新日期:2020-11-03 00:00:00

  • FINDSITE: a combined evolution/structure-based approach to protein function prediction.

    abstract::A key challenge of the post-genomic era is the identification of the function(s) of all the molecules in a given organism. Here, we review the status of sequence and structure-based approaches to protein function inference and ligand screening that can provide functional insights for a significant fraction of the appr...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbp017

    authors: Skolnick J,Brylinski M

    更新日期:2009-07-01 00:00:00

  • CyanoPATH: a knowledgebase of genome-scale functional repertoire for toxic cyanobacterial blooms.

    abstract::CyanoPATH is a database that curates and analyzes the common genomic functional repertoire for cyanobacteria harmful algal blooms (CyanoHABs) in eutrophic waters. Based on the literature of empirical studies and genome/protein databases, it summarizes four types of information: common biological functions (pathways) d...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa375

    authors: Du W,Li G,Ho N,Jenkins L,Hockaday D,Tan J,Cao H

    更新日期:2020-12-16 00:00:00

  • Molecular dynamics simulations for genetic interpretation in protein coding regions: where we are, where to go and when.

    abstract::The increasing ease with which massive genetic information can be obtained from patients or healthy individuals has stimulated the development of interpretive bioinformatics tools as aids in clinical practice. Most such tools analyze evolutionary information and simple physical-chemical properties to predict whether r...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz146

    authors: Galano-Frutos JJ,García-Cebollada H,Sancho J

    更新日期:2021-01-18 00:00:00

  • Computational recognition for long non-coding RNA (lncRNA): Software and databases.

    abstract::Since the completion of the Human Genome Project, it has been widely established that most DNA is not transcribed into proteins. These non-protein-coding regions are believed to be moderators within transcriptional and post-transcriptional processes, which play key roles in the onset of diseases. Long non-coding RNAs ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbv114

    authors: Yotsukura S,duVerle D,Hancock T,Natsume-Kitatani Y,Mamitsuka H

    更新日期:2017-01-01 00:00:00

  • Resolving the problem of multiple accessions of the same transcript deposited across various public databases.

    abstract::Maintaining the consistency of genomic annotations is an increasingly complex task because of the iterative and dynamic nature of assembly and annotation, growing numbers of biological databases and insufficient integration of annotations across databases. As information exchange among databases is poor, a 'novel' seq...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbw017

    authors: Weirick T,John D,Uchida S

    更新日期:2017-03-01 00:00:00

  • CeRNASeek: an R package for identification and analysis of ceRNA regulation.

    abstract::Competitive endogenous RNA (ceRNA) represents a novel layer of gene regulation that controls both physiological and pathological processes. However, there is still lack of computational tools for quickly identifying ceRNA regulation. To address this problem, we presented an R-package, CeRNASeek, which allows identifyi...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa048

    authors: Zhang M,Jin X,Li J,Tian Y,Wang Q,Li X,Xu J,Li Y,Li X

    更新日期:2020-05-04 00:00:00

  • A solid quality-control analysis of AB SOLiD short-read sequencing data.

    abstract::Next generation sequencers have greatly improved our ability to mine polymorphisms and mutations out of entire (or portions of) genomes. The reliability of their outputs, though, showed to be very related to the sequencing chemistry and to deeply affect the quality of the downstream analyses. We focus here on the two-...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbs048

    authors: Castellana S,Romani M,Valente EM,Mazza T

    更新日期:2013-11-01 00:00:00

  • Structural database resources for biological macromolecules.

    abstract::This Briefing reviews the widely used, currently active, up-to-date databases derived from the worldwide Protein Data Bank (PDB) to facilitate browsing, finding and exploring its entries. These databases contain visualization and analysis tools tailored to specific kinds of molecules and interactions, often including ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbw049

    authors: Abriata LA

    更新日期:2017-07-01 00:00:00

  • Multilevel heterogeneous omics data integration with kernel fusion.

    abstract::High-throughput omics data are generated almost with no limit nowadays. It becomes increasingly important to integrate different omics data types to disentangle the molecular machinery of complex diseases with the hope for better disease prevention and treatment. Since the relationship among different omics data featu...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bby115

    authors: Yang H,Cao H,He T,Wang T,Cui Y

    更新日期:2018-11-29 00:00:00

  • Proteome-scale analysis of phase-separated proteins in immunofluorescence images.

    abstract::Phase separation is an important mechanism that mediates the spatial distribution of proteins in different cellular compartments. While phase-separated proteins share certain sequence characteristics, including intrinsically disordered regions (IDRs) and prion-like domains, such characteristics are insufficient for ma...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa187

    authors: Yu C,Shen B,You K,Huang Q,Shi M,Wu C,Chen Y,Zhang C,Li T

    更新日期:2020-09-02 00:00:00

  • The GTPB training programme in Portugal.

    abstract::The Gulbenkian Training Programme in Bioinformatics has been offering hands-on training courses in Oeiras, PT for more than a decade. This article is a review of its functional organization and evolution. We aim to share our experience with people considering setting-up similar training facilities elsewhere. More than...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbq063

    authors: Fernandes PL

    更新日期:2010-11-01 00:00:00

  • TOD-CUP: a gene expression rank-based majority vote algorithm for tissue origin diagnosis of cancers of unknown primary.

    abstract::Gene expression profiling holds great potential as a new approach to histological diagnosis and precision medicine of cancers of unknown primary (CUP). Batch effects and different data types greatly decrease the predictive performance of biomarker-based algorithms, and few methods have been widely applied to identify ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa031

    authors: Shen Y,Chu Q,Yin X,He Y,Bai P,Wang Y,Fang W,Timko MP,Fan L,Jiang W

    更新日期:2020-04-08 00:00:00

  • Public data and open source tools for multi-assay genomic investigation of disease.

    abstract::Molecular interrogation of a biological sample through DNA sequencing, RNA and microRNA profiling, proteomics and other assays, has the potential to provide a systems level approach to predicting treatment response and disease progression, and to developing precision therapies. Large publicly funded projects have gene...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbv080

    authors: Kannan L,Ramos M,Re A,El-Hachem N,Safikhani Z,Gendoo DM,Davis S,Gomez-Cabrero D,Castelo R,Hansen KD,Carey VJ,Morgan M,Culhane AC,Haibe-Kains B,Waldron L

    更新日期:2016-07-01 00:00:00

  • Evaluation of research in biomedical ontologies.

    abstract::Ontologies are now pervasive in biomedicine, where they serve as a means to standardize terminology, to enable access to domain knowledge, to verify data consistency and to facilitate integrative analyses over heterogeneous biomedical data. For this purpose, research on biomedical ontologies applies theories and metho...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbs053

    authors: Hoehndorf R,Dumontier M,Gkoutos GV

    更新日期:2013-11-01 00:00:00

  • Methodological aspects of whole-genome bisulfite sequencing analysis.

    abstract::The combination of DNA bisulfite treatment with high-throughput sequencing technologies has enabled investigation of genome-wide DNA methylation beyond CpG sites and CpG islands. These technologies have opened new avenues to understand the interplay between epigenetic events, chromatin plasticity and gene regulation. ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbu016

    authors: Adusumalli S,Mohd Omar MF,Soong R,Benoukraf T

    更新日期:2015-05-01 00:00:00

  • Common introns within orthologous genes: software and application to plants.

    abstract::The residence of spliceosomal introns within protein-coding genes can fluctuate over time, with genes gaining, losing or conserving introns in a complex process that is not entirely understood. One approach for studying intron evolution is to compare introns with respect to position and type within closely related gen...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbp051

    authors: Wilkerson MD,Ru Y,Brendel VP

    更新日期:2009-11-01 00:00:00

  • Conceptual framework and pilot study to benchmark phylogenomic databases based on reference gene trees.

    abstract::Phylogenomic databases provide orthology predictions for species with fully sequenced genomes. Although the goal seems well-defined, the content of these databases differs greatly. Seven ortholog databases (Ensembl Compara, eggNOG, HOGENOM, InParanoid, OMA, OrthoDB, Panther) were compared on the basis of reference tre...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbr034

    authors: Boeckmann B,Robinson-Rechavi M,Xenarios I,Dessimoz C

    更新日期:2011-09-01 00:00:00

  • Small noncoding RNA discovery and profiling with sRNAtools based on high-throughput sequencing.

    abstract::Small noncoding RNAs (sRNA/sncRNAs) are generated from different genomic loci and play important roles in biological processes, such as cell proliferation and the regulation of gene expression. Next-generation sequencing (NGS) has provided an unprecedented opportunity to discover and quantify diverse kinds of sncRNA, ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz151

    authors: Liu Q,Ding C,Lang X,Guo G,Chen J,Su X

    更新日期:2021-01-18 00:00:00

  • Mutational analysis in RNAs: comparing programs for RNA deleterious mutation prediction.

    abstract::Programs for RNA mutational analysis that are structure-based and rely on secondary structure prediction have been developed and expanded in the past several years. They can be used for a variety of purposes, such as in suggesting point mutations that will alter RNA virus replication or translation initiation, investi...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbq059

    authors: Barash D,Churkin A

    更新日期:2011-03-01 00:00:00

  • Dynamics of transcriptional and post-transcriptional regulation.

    abstract::Despite gene expression programs being notoriously complex, RNA abundance is usually assumed as a proxy for transcriptional activity. Recently developed approaches, able to disentangle transcriptional and post-transcriptional regulatory processes, have revealed a more complex scenario. It is now possible to work out h...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa389

    authors: Furlan M,de Pretis S,Pelizzola M

    更新日期:2020-12-22 00:00:00

  • HVIDB: a comprehensive database for human-virus protein-protein interactions.

    abstract::While leading to millions of people's deaths every year the treatment of viral infectious diseases remains a huge public health challenge.Therefore, an in-depth understanding of human-virus protein-protein interactions (PPIs) as the molecular interface between a virus and its host cell is of paramount importance to ob...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa425

    authors: Yang X,Lian X,Fu C,Wuchty S,Yang S,Zhang Z

    更新日期:2021-01-30 00:00:00

  • Fuzzy Petri nets for modelling of uncertain biological systems.

    abstract::The modelling of biological systems is accompanied with epistemic uncertainties that range from structural uncertainty to parametric uncertainty due to such limitations as insufficient understanding of the underlying mechanism and incomplete measurement data of a system. Fuzzy logic approaches such as fuzzy Petri nets...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bby118

    authors: Liu F,Heiner M,Gilbert D

    更新日期:2018-12-27 00:00:00

  • Pioneering topological methods for network-based drug-target prediction by exploiting a brain-network self-organization theory.

    abstract::The bipartite network representation of the drug-target interactions (DTIs) in a biosystem enhances understanding of the drugs' multifaceted action modes, suggests therapeutic switching for approved drugs and unveils possible side effects. As experimental testing of DTIs is costly and time-consuming, computational pre...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbx041

    authors: Durán C,Daminelli S,Thomas JM,Haupt VJ,Schroeder M,Cannistraci CV

    更新日期:2018-11-27 00:00:00

  • Computational prediction of species-specific yeast DNA replication origin via iterative feature representation.

    abstract::Deoxyribonucleic acid replication is one of the most crucial tasks taking place in the cell, and it has to be precisely regulated. This process is initiated in the replication origins (ORIs), and thus it is essential to identify such sites for a deeper understanding of the cellular processes and functions related to t...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa304

    authors: Manavalan B,Basith S,Shin TH,Lee G

    更新日期:2020-11-25 00:00:00

  • MloDisDB: a manually curated database of the relations between membraneless organelles and diseases.

    abstract::Cells are compartmentalized by numerous membrane-bounded organelles and membraneless organelles (MLOs) to ensure temporal and spatial regulation of various biological processes. A number of MLOs, such as nucleoli, nuclear speckles and stress granules, exist as liquid droplets within the cells and arise from the conden...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa271

    authors: Hou C,Xie H,Fu Y,Ma Y,Li T

    更新日期:2020-10-30 00:00:00

  • The digital revolution in phenotyping.

    abstract::Phenotypes have gained increased notoriety in the clinical and biological domain owing to their application in numerous areas such as the discovery of disease genes and drug targets, phylogenetics and pharmacogenomics. Phenotypes, defined as observable characteristics of organisms, can be seen as one of the bridges th...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbv083

    authors: Oellrich A,Collier N,Groza T,Rebholz-Schuhmann D,Shah N,Bodenreider O,Boland MR,Georgiev I,Liu H,Livingston K,Luna A,Mallon AM,Manda P,Robinson PN,Rustici G,Simon M,Wang L,Winnenburg R,Dumontier M

    更新日期:2016-09-01 00:00:00

  • Pattern recognition analysis on long noncoding RNAs: a tool for prediction in plants.

    abstract:MOTIVATION:Long noncoding RNAs (lncRNAs) correspond to a eukaryotic noncoding RNA class that gained great attention in the past years as a higher layer of regulation for gene expression in cells. There is, however, a lack of specific computational approaches to reliably predict lncRNA in plants, which contrast the vari...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bby034

    authors: Negri TDC,Alves WAL,Bugatti PH,Saito PTM,Domingues DS,Paschoal AR

    更新日期:2019-03-25 00:00:00

  • Opportunities for community awareness platforms in personal genomics and bioinformatics education.

    abstract::Precision and personalized medicine will be increasingly based on the integration of various type of information, particularly electronic health records and genome sequences. The availability of cheap genome sequencing services and the information interoperability will increase the role of online bioinformatics analys...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbw078

    authors: Bianchi L,Liò P

    更新日期:2017-11-01 00:00:00

  • Class-imbalanced classifiers for high-dimensional data.

    abstract::A class-imbalanced classifier is a decision rule to predict the class membership of new samples from an available data set where the class sizes differ considerably. When the class sizes are very different, most standard classification algorithms may favor the larger (majority) class resulting in poor accuracy in the ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbs006

    authors: Lin WJ,Chen JJ

    更新日期:2013-01-01 00:00:00