New developments of alignment-free sequence comparison: measures, statistics and next-generation sequencing.

Abstract:

:With the development of next-generation sequencing (NGS) technologies, a large amount of short read data has been generated. Assembly of these short reads can be challenging for genomes and metagenomes without template sequences, making alignment-based genome sequence comparison difficult. In addition, sequence reads from NGS can come from different regions of various genomes and they may not be alignable. Sequence signature-based methods for genome comparison based on the frequencies of word patterns in genomes and metagenomes can potentially be useful for the analysis of short reads data from NGS. Here we review the recent development of alignment-free genome and metagenome comparison based on the frequencies of word patterns with emphasis on the dissimilarity measures between sequences, the statistical power of these measures when two sequences are related and the applications of these measures to NGS data.

journal_name

Brief Bioinform

authors

Song K,Ren J,Reinert G,Deng M,Waterman MS,Sun F

doi

10.1093/bib/bbt067

subject

Has Abstract

pub_date

2014-05-01 00:00:00

pages

343-53

issue

3

eissn

1467-5463

issn

1477-4054

pii

bbt067

journal_volume

15

pub_type

杂志文章,评审
  • The mechanistic, diagnostic and therapeutic novel nucleic acids for hepatocellular carcinoma emerging in past score years.

    abstract::Despite The Central Dogma states the destiny of gene as 'DNA makes RNA and RNA makes protein', the nucleic acids not only store and transmit genetic information but also, surprisingly, join in intracellular vital movement as a regulator of gene expression. Bioinformatics has contributed to knowledge for a series of em...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa023

    authors: Zhang S,Zhou Y,Wang Y,Wang Z,Xiao Q,Zhang Y,Lou Y,Qiu Y,Zhu F

    更新日期:2020-04-06 00:00:00

  • Molecular dynamics simulations for genetic interpretation in protein coding regions: where we are, where to go and when.

    abstract::The increasing ease with which massive genetic information can be obtained from patients or healthy individuals has stimulated the development of interpretive bioinformatics tools as aids in clinical practice. Most such tools analyze evolutionary information and simple physical-chemical properties to predict whether r...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz146

    authors: Galano-Frutos JJ,García-Cebollada H,Sancho J

    更新日期:2021-01-18 00:00:00

  • Methodological aspects of whole-genome bisulfite sequencing analysis.

    abstract::The combination of DNA bisulfite treatment with high-throughput sequencing technologies has enabled investigation of genome-wide DNA methylation beyond CpG sites and CpG islands. These technologies have opened new avenues to understand the interplay between epigenetic events, chromatin plasticity and gene regulation. ...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbu016

    authors: Adusumalli S,Mohd Omar MF,Soong R,Benoukraf T

    更新日期:2015-05-01 00:00:00

  • Pattern recognition analysis on long noncoding RNAs: a tool for prediction in plants.

    abstract:MOTIVATION:Long noncoding RNAs (lncRNAs) correspond to a eukaryotic noncoding RNA class that gained great attention in the past years as a higher layer of regulation for gene expression in cells. There is, however, a lack of specific computational approaches to reliably predict lncRNA in plants, which contrast the vari...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bby034

    authors: Negri TDC,Alves WAL,Bugatti PH,Saito PTM,Domingues DS,Paschoal AR

    更新日期:2019-03-25 00:00:00

  • Protein functional annotation of simultaneously improved stability, accuracy and false discovery rate achieved by a sequence-based deep learning.

    abstract::Functional annotation of protein sequence with high accuracy has become one of the most important issues in modern biomedical studies, and computational approaches of significantly accelerated analysis process and enhanced accuracy are greatly desired. Although a variety of methods have been developed to elevate prote...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz081

    authors: Hong J,Luo Y,Zhang Y,Ying J,Xue W,Xie T,Tao L,Zhu F

    更新日期:2020-07-15 00:00:00

  • A brief history of bioinformatics.

    abstract::It is easy for today's students and researchers to believe that modern bioinformatics emerged recently to assist next-generation sequencing data analysis. However, the very beginnings of bioinformatics occurred more than 50 years ago, when desktop computers were still a hypothesis and DNA could not yet be sequenced. T...

    journal_title:Briefings in bioinformatics

    pub_type: 历史文章,杂志文章,评审

    doi:10.1093/bib/bby063

    authors: Gauthier J,Vincent AT,Charette SJ,Derome N

    更新日期:2019-11-27 00:00:00

  • Understanding the unimodal distributions of cancer occurrence rates: it takes two factors for a cancer to occur.

    abstract::Data from the SEER reports reveal that the occurrence rate of a cancer type generally follows a unimodal distribution over age, peaking at an age that is cancer-type specific and ranges from 30+ through 70+. Previous studies attribute such bell-shaped distributions to the reduced proliferative potential in senior year...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa349

    authors: Qiu S,An Z,Tan R,He PA,Jing J,Li H,Wu S,Xu Y

    更新日期:2020-12-30 00:00:00

  • Optimizing drug development in oncology by clinical trial simulation: Why and how?

    abstract::In therapeutic research, the safety and efficacy of pharmaceutical products are necessarily tested on humans via clinical trials after an extensive and expensive preclinical development period. Methodologies such as computer modeling and clinical trial simulation (CTS) might represent a valuable option to reduce anima...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbx055

    authors: Gal J,Milano G,Ferrero JM,Saâda-Bouzid E,Viotti J,Chabaud S,Gougis P,Le Tourneau C,Schiappa R,Paquet A,Chamorey E

    更新日期:2018-11-27 00:00:00

  • Empirical comparison and analysis of web-based cell-penetrating peptide prediction tools.

    abstract::Cell-penetrating peptides (CPPs) facilitate the delivery of therapeutically relevant molecules, including DNA, proteins and oligonucleotides, into cells both in vitro and in vivo. This unique ability explores the possibility of CPPs as therapeutic delivery and its potential applications in clinical therapy. Over the l...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bby124

    authors: Su R,Hu J,Zou Q,Manavalan B,Wei L

    更新日期:2020-03-23 00:00:00

  • Identifying drug-target interactions based on graph convolutional network and deep neural network.

    abstract::Identification of new drug-target interactions (DTIs) is an important but a time-consuming and costly step in drug discovery. In recent years, to mitigate these drawbacks, researchers have sought to identify DTIs using computational approaches. However, most existing methods construct drug networks and target networks...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa044

    authors: Zhao T,Hu Y,Valsdottir LR,Zang T,Peng J

    更新日期:2020-05-04 00:00:00

  • Computational prediction of species-specific yeast DNA replication origin via iterative feature representation.

    abstract::Deoxyribonucleic acid replication is one of the most crucial tasks taking place in the cell, and it has to be precisely regulated. This process is initiated in the replication origins (ORIs), and thus it is essential to identify such sites for a deeper understanding of the cellular processes and functions related to t...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa304

    authors: Manavalan B,Basith S,Shin TH,Lee G

    更新日期:2020-11-25 00:00:00

  • Privacy-preserving techniques of genomic data-a survey.

    abstract::Genomic data hold salient information about the characteristics of a living organism. Throughout the past decade, pinnacle developments have given us more accurate and inexpensive methods to retrieve genome sequences of humans. However, with the advancement of genomic research, there is a growing privacy concern regar...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbx139

    authors: Aziz MMA,Sadat MN,Alhadidi D,Wang S,Jiang X,Brown CL,Mohammed N

    更新日期:2019-05-21 00:00:00

  • A network-based algorithm for the identification of moonlighting noncoding RNAs and its application in sepsis.

    abstract::Moonlighting proteins provide more options for cells to execute multiple functions without increasing the genome and transcriptome complexity. Although there have long been calls for computational methods for the prediction of moonlighting proteins, no method has been designed for determining moonlighting long noncodi...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz154

    authors: Liu X,Xu Y,Wang R,Liu S,Wang J,Luo Y,Leung KS,Cheng L

    更新日期:2021-01-18 00:00:00

  • HVIDB: a comprehensive database for human-virus protein-protein interactions.

    abstract::While leading to millions of people's deaths every year the treatment of viral infectious diseases remains a huge public health challenge.Therefore, an in-depth understanding of human-virus protein-protein interactions (PPIs) as the molecular interface between a virus and its host cell is of paramount importance to ob...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa425

    authors: Yang X,Lian X,Fu C,Wuchty S,Yang S,Zhang Z

    更新日期:2021-01-30 00:00:00

  • Pathogenicity phenomena in three model systems: from network mining to emerging system-level properties.

    abstract::Understanding the interconnections of microbial pathogenicity phenomena, such as biofilm formation, quorum sensing and antimicrobial resistance, is a tremendous open challenge for biomedical research. Progress made by wet-lab researchers and bioinformaticians in understanding the underlying regulatory phenomena has be...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbt071

    authors: Castelhano Santos N,Pereira MO,Lourenço A

    更新日期:2015-01-01 00:00:00

  • Towards deep phenotyping pregnancy: a systematic review on artificial intelligence and machine learning methods to improve pregnancy outcomes.

    abstract:OBJECTIVE:Development of novel informatics methods focused on improving pregnancy outcomes remains an active area of research. The purpose of this study is to systematically review the ways that artificial intelligence (AI) and machine learning (ML), including deep learning (DL), methodologies can inform patient care d...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa369

    authors: Davidson L,Boland MR

    更新日期:2021-01-06 00:00:00

  • HpQTL: a geometric morphometric platform to compute the genetic architecture of heterophylly.

    abstract::Heterophylly, i.e. morphological changes in leaves along the axis of an individual plant, is regarded as a strategy used by plants to cope with environmental change. However, little is known of the extent to which heterophylly is controlled by genes and how each underlying gene exerts its effect on heterophyllous vari...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbx011

    authors: Sun L,Wang J,Zhu X,Jiang L,Gosik K,Sang M,Sun F,Cheng T,Zhang Q,Wu R

    更新日期:2018-07-20 00:00:00

  • Computational prediction and analysis of species-specific fungi phosphorylation via feature optimization strategy.

    abstract::Protein phosphorylation is a reversible and ubiquitous post-translational modification that primarily occurs at serine, threonine and tyrosine residues and regulates a variety of biological processes. In this paper, we first briefly summarized the current progresses in computational prediction of eukaryotic protein ph...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bby122

    authors: Cao M,Chen G,Yu J,Shi S

    更新日期:2020-03-23 00:00:00

  • A solid quality-control analysis of AB SOLiD short-read sequencing data.

    abstract::Next generation sequencers have greatly improved our ability to mine polymorphisms and mutations out of entire (or portions of) genomes. The reliability of their outputs, though, showed to be very related to the sequencing chemistry and to deeply affect the quality of the downstream analyses. We focus here on the two-...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbs048

    authors: Castellana S,Romani M,Valente EM,Mazza T

    更新日期:2013-11-01 00:00:00

  • Survey of miRNA-miRNA cooperative regulation principles across cancer types.

    abstract::Cooperative regulation among multiple microRNAs (miRNAs) is a complex type of posttranscriptional regulation in human; however, the global view of the system-level regulatory principles across cancers is still unclear. Here, we investigated miRNA-miRNA cooperative regulatory landscape across 18 cancer types and summar...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bby038

    authors: Shao T,Wang G,Chen H,Xie Y,Jin X,Bai J,Xu J,Li X,Huang J,Jin Y,Li Y

    更新日期:2019-09-27 00:00:00

  • MloDisDB: a manually curated database of the relations between membraneless organelles and diseases.

    abstract::Cells are compartmentalized by numerous membrane-bounded organelles and membraneless organelles (MLOs) to ensure temporal and spatial regulation of various biological processes. A number of MLOs, such as nucleoli, nuclear speckles and stress granules, exist as liquid droplets within the cells and arise from the conden...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbaa271

    authors: Hou C,Xie H,Fu Y,Ma Y,Li T

    更新日期:2020-10-30 00:00:00

  • Comparison of haplotype-based tests for detecting gene-environment interactions with rare variants.

    abstract::Dissecting the genetic mechanism underlying a complex disease hinges on discovering gene-environment interactions (GXE). However, detecting GXE is a challenging problem especially when the genetic variants under study are rare. Haplotype-based tests have several advantages over the so-called collapsing tests for detec...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz031

    authors: Papachristou C,Biswas S

    更新日期:2020-05-21 00:00:00

  • Iteratively reweighted LASSO for mapping multiple quantitative trait loci.

    abstract::The iteratively reweighted least square (IRLS) method is mostly identical to maximum likelihood (ML) method in terms of parameter estimation and power of quantitative trait locus (QTL) detection. But the IRLS is greatly superior to ML in terms of computing speed and the robustness of parameter estimation. In conjuncti...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbs062

    authors: Liu Y,Yang T,Li H,Yang R

    更新日期:2014-01-01 00:00:00

  • Comparing enrichment analysis and machine learning for identifying gene properties that discriminate between gene classes.

    abstract::Biologists very often use enrichment methods based on statistical hypothesis tests to identify gene properties that are significantly over-represented in a given set of genes of interest, by comparison with a 'background' set of genes. These enrichment methods, although based on rigorous statistical foundations, are n...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbz028

    authors: Fabris F,Palmer D,de Magalhães JP,Freitas AA

    更新日期:2020-05-21 00:00:00

  • BioModels.net Web Services, a free and integrated toolkit for computational modelling software.

    abstract::Exchanging and sharing scientific results are essential for researchers in the field of computational modelling. BioModels.net defines agreed-upon standards for model curation. A fundamental one, MIRIAM (Minimum Information Requested in the Annotation of Models), standardises the annotation and curation process of qua...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbp056

    authors: Li C,Courtot M,Le Novère N,Laibe C

    更新日期:2010-05-01 00:00:00

  • Teaching the bioinformatics of signaling networks: an integrated approach to facilitate multi-disciplinary learning.

    abstract::The number of bioinformatics tools and resources that support molecular and cell biology approaches is continuously expanding. Moreover, systems and network biology analyses are accompanied more and more by integrated bioinformatics methods. Traditional information-centered university teaching methods often fail, as (...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbt024

    authors: Korcsmaros T,Dunai ZA,Vellai T,Csermely P

    更新日期:2013-09-01 00:00:00

  • A practical guide for the functional annotation of genetic variations using SNPnexus.

    abstract::Broader functional annotation of known as well as putative genetic variations is a valuable mean for prioritizing targets in disease studies and large-scale genotyping projects. In this article, we present a practical guide to SNPnexus, a web-based tool that provides an aggregate set of functional annotations for geno...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bbt004

    authors: Dayem Ullah AZ,Lemoine NR,Chelala C

    更新日期:2013-07-01 00:00:00

  • Mutational analysis in RNAs: comparing programs for RNA deleterious mutation prediction.

    abstract::Programs for RNA mutational analysis that are structure-based and rely on secondary structure prediction have been developed and expanded in the past several years. They can be used for a variety of purposes, such as in suggesting point mutations that will alter RNA virus replication or translation initiation, investi...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章,评审

    doi:10.1093/bib/bbq059

    authors: Barash D,Churkin A

    更新日期:2011-03-01 00:00:00

  • HITS-PR-HHblits: protein remote homology detection by combining PageRank and Hyperlink-Induced Topic Search.

    abstract::As one of the most important fundamental problems in protein sequence analysis, protein remote homology detection is critical for both theoretical research (protein structure and function studies) and real world applications (drug design). Although several computational predictors have been proposed, their detection p...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bby104

    authors: Liu B,Jiang S,Zou Q

    更新日期:2018-11-07 00:00:00

  • Fuzzy Petri nets for modelling of uncertain biological systems.

    abstract::The modelling of biological systems is accompanied with epistemic uncertainties that range from structural uncertainty to parametric uncertainty due to such limitations as insufficient understanding of the underlying mechanism and incomplete measurement data of a system. Fuzzy logic approaches such as fuzzy Petri nets...

    journal_title:Briefings in bioinformatics

    pub_type: 杂志文章

    doi:10.1093/bib/bby118

    authors: Liu F,Heiner M,Gilbert D

    更新日期:2018-12-27 00:00:00