Reprint of "Abstraction for data integration: Fusing mammalian molecular, cellular and phenotype big datasets for better knowledge extraction".

Abstract:

:With advances in genomics, transcriptomics, metabolomics and proteomics, and more expansive electronic clinical record monitoring, as well as advances in computation, we have entered the Big Data era in biomedical research. Data gathering is growing rapidly while only a small fraction of this data is converted to useful knowledge or reused in future studies. To improve this, an important concept that is often overlooked is data abstraction. To fuse and reuse biomedical datasets from diverse resources, data abstraction is frequently required. Here we summarize some of the major Big Data biomedical research resources for genomics, proteomics and phenotype data, collected from mammalian cells, tissues and organisms. We then suggest simple data abstraction methods for fusing this diverse but related data. Finally, we demonstrate examples of the potential utility of such data integration efforts, while warning about the inherit biases that exist within such data.

journal_name

Comput Biol Chem

authors

Rouillard AD,Wang Z,Ma'ayan A

doi

10.1016/j.compbiolchem.2015.08.005

subject

Has Abstract

pub_date

2015-12-01 00:00:00

pages

123-38

eissn

1476-9271

issn

1476-928X

pii

S1476-9271(15)00083-3

journal_volume

59 Pt B

pub_type

杂志文章,评审
  • Structure based pharmacophore study to identify possible natural selective PARP-1 trapper as anti-cancer agent.

    abstract::Inhibition of poly(ADP-ribose) polymerase-1 (PARP-1) has turned out an innovative approach for cancer therapy due to its involvement in DNA repair pathways. Although several potent PARP-1 inhibitors have been identified, they exhibit high toxicity, resistivity and diverse pharmacological profile in clinical trials, wh...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2019.04.018

    authors: Kumar C,P T V L,Arunachalam A

    更新日期:2019-06-01 00:00:00

  • RNA-binding residues in sequence space: conservation and interaction patterns.

    abstract::RNA-binding proteins (RBPs) perform fundamental and diverse functions within the cell. Approximately 15% of proteins sequences are annotated as RNA-binding, but with a significant number of proteins without functional annotation, many RBPs are yet to be identified. A percentage of uncharacterised proteins can be annot...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2009.07.012

    authors: Spriggs RV,Jones S

    更新日期:2009-10-01 00:00:00

  • Optimal hybrid sequencing and assembly: Feasibility conditions for accurate genome reconstruction and cost minimization strategy.

    abstract::Recent advances in high-throughput genome sequencing technologies have enabled the systematic study of various genomes by making whole genome sequencing affordable. Modern sequencers generate a huge number of small sequence fragments called reads, where the read length and the per-base sequencing cost depend on the te...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2017.03.016

    authors: Chen CC,Ghaffari N,Qian X,Yoon BJ

    更新日期:2017-08-01 00:00:00

  • Exploring two-dimensional graphene and boron-nitride as potential nanocarriers for cytarabine and clofarabine anti-cancer drugs.

    abstract::Development in two-dimensional (2D) drug-delivery materials have quickly translated into biological and pharmacological fields. In this present work, pristine graphene (PG) and hexagonal boron nitride (h-BN) sheets are explored as a drug carrier for cytarabine (CYT) and clofarabine (CLF) anti-cancer drugs using densit...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2020.107334

    authors: Saravanan V,Rajamani A,Subramani M,Ramasamy S

    更新日期:2020-10-01 00:00:00

  • Why does beta-secretase zymogen possess catalytic activity? Molecular modeling and molecular dynamics simulation studies.

    abstract::Beta-secretase is a potential target for inhibitory drugs against Alzheimer's disease as it cleaves amyloid precursor protein (APP) to form insoluble amyloid plaques and vascular deposits in the brain. Beta-secretase is matured from its precursor protein, called beta-secretase zymogen, which, different from most of ot...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2007.03.007

    authors: Zuo Z,Gang C,Zou H,Mok PC,Zhu W,Chen K,Jiang H

    更新日期:2007-06-01 00:00:00

  • Markovian encoding models in human splice site recognition using SVM.

    abstract::Splice site recognition is among the most significant and challenging tasks in bioinformatics due to its key role in gene annotation. Effective prediction of splice site requires nucleotide encoding methods that reveal the characteristics of DNA sequences to provide appropriate features to serve as input of machine le...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2018.02.005

    authors: Pashaei E,Aydin N

    更新日期:2018-04-01 00:00:00

  • Interactions of 2-phenyl-benzotriazole xenobiotic compounds with human Cytochrome P450-CYP1A1 by means of docking, molecular dynamics simulations and MM-GBSA calculations.

    abstract::2-phenyl-benzotriazole xenobiotic compounds (PBTA-4, PBTA-6, PBTA-7 and PBTA-8) that were previously isolated and identified in waters of the Yodo river, in Japan (Nukaya et al., 2001; Ohe et al., 2004; Watanabe et al., 2001) were characterized as powerful pro-mutagens. In order to predict the activation mechanism of ...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2018.04.004

    authors: Mena-Ulecia K,MacLeod-Carey D

    更新日期:2018-06-01 00:00:00

  • Ab-initio prediction and reliability of protein structural genomics by PROPAINOR algorithm.

    abstract::We have formulated the ab-initio prediction of the 3D-structure of proteins as a probabilistic programming problem where the inter-residue 3D-distances are treated as random variables. Lower and upper bounds for these random variables and the corresponding probabilities are estimated by nonparametric statistical metho...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/s0097-8485(02)00074-8

    authors: Joshi RR,Jyothi S

    更新日期:2003-07-01 00:00:00

  • Detection of nucleotide sequences capable of forming non-canonical DNA structures: Application of automata theory.

    abstract::In this study, we develop a program that allows us to reveal DNA receptors, i.e. nucleotide sequences that may form more than one non-canonical structure. The data obtained may be analysed either experimentally or using DNA banks, and refers to the coding, non-coding or promotor region of the gene. These results provi...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2019.04.009

    authors: Yurushkin MV,Gervich LR,Bachurin SS,Kletskii ME

    更新日期:2019-06-01 00:00:00

  • GeneMCL in microarray analysis.

    abstract::Accurately and reliably identifying the actual number of clusters present with a dataset of gene expression profiles, when no additional information on cluster structure is available, is a problem addressed by few algorithms. GeneMCL transforms microarray analysis data into a graph consisting of nodes connected by edg...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2005.07.002

    authors: Samuel Lattimore B,van Dongen S,Crabbe MJ

    更新日期:2005-10-01 00:00:00

  • Multi-group cancer outlier differential gene expression detection.

    abstract::It has recently been shown that cancer genes (oncogenes) tend to have heterogeneous expressions across disease samples. So it is reasonable to assume that in a microarray data only a subset of disease samples will be activated (often referred to as outliers), which presents some new challenges for statistical analysis...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2007.02.004

    authors: Liu F,Wu B

    更新日期:2007-04-01 00:00:00

  • CAMWI: Detecting protein complexes using weighted clustering coefficient and weighted density.

    abstract::Detection of protein complexes is very important to understand the principles of cellular organization and function. Recently, large protein-protein interactions (PPIs) networks have become available using high-throughput experimental techniques. These networks make it possible to develop computational methods for pro...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2015.07.012

    authors: Lakizadeh A,Jalili S,Marashi SA

    更新日期:2015-10-01 00:00:00

  • Pharmacoinformatics exploration of polyphenol oxidases leading to novel inhibitors by virtual screening and molecular dynamic simulation study.

    abstract::Polyphenol oxidases (PPOs)/tyrosinases are metal-dependent enzymes and known as important targets for melanogenesis. Although considerable attempts have been conducted to control the melanin-associated diseases by using various inhibitors. However, the exploration of the best anti-melanin inhibitor without side effect...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2017.02.012

    authors: Hassan M,Abbas Q,Ashraf Z,Moustafa AA,Seo SY

    更新日期:2017-06-01 00:00:00

  • Synthesis, biological evaluation, and computational studies of novel fused six-membered O-containing heterocycles as potential acetylcholinesterase inhibitors.

    abstract::An efficient, borax-catalyzed protocol for the synthesis of novel 4-aryl-substituted-4H-pyran derivatives fused to α-pyrone ring in a one-pot is described. By this achievement, some novel 4-aryl substituted 4H-pyrans fused to the α-pyrone ring as potential acetylcholinesterase inhibitors (AChEIs) with good to excellen...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2019.04.004

    authors: Pourshojaei Y,Abiri A,Eskandari R,Dourandish F,Eskandari K,Asadipour A

    更新日期:2019-06-01 00:00:00

  • On interaction of arginine, cysteine and guanine with a nano-TiO2 cluster.

    abstract::Nanoscopic properties of TiO2 augmented with its physicochemical properties and biocompatibility make it a material interest in the biomedical field. Efficient methods to design of such materials require a thorough understanding of associated nano-bio interfaces. In the present study, density functional theory calcula...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2020.107236

    authors: Sai Phani Kumar V,Verma M,Deshpande PA

    更新日期:2020-06-01 00:00:00

  • A novel k-word relative measure for sequence comparison.

    abstract::In order to extract phylogenetic information from DNA sequences, the new normalized k-word average relative distance is proposed in this paper. The proposed measure was tested by discriminate analysis and phylogenetic analysis. The phylogenetic trees based on the Manhattan distance measure are reconstructed with k ran...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2014.10.007

    authors: Tang J,Hua K,Chen M,Zhang R,Xie X

    更新日期:2014-12-01 00:00:00

  • Workflow based framework for life science informatics.

    abstract::Workflow technology is a generic mechanism to integrate diverse types of available resources (databases, servers, software applications and different services) which facilitate knowledge exchange within traditionally divergent fields such as molecular biology, clinical research, computational science, physics, chemist...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章,评审

    doi:10.1016/j.compbiolchem.2007.08.009

    authors: Tiwari A,Sekhar AK

    更新日期:2007-10-01 00:00:00

  • Functional and structural insights into novel DREB1A transcription factors in common wheat (Triticum aestivum L.): A molecular modeling approach.

    abstract::Triticum aestivum L. known as common wheat is one of the most important cereal crops feeding a large and growing population. Various environmental stress factors including drought, high salinity and heat etc. adversely affect wheat production in a significant manner. Dehydration-responsive element-binding (DREB1A) fac...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2016.07.008

    authors: Kumar A,Kumar S,Kumar U,Suravajhala P,Gajula MN

    更新日期:2016-10-01 00:00:00

  • Genome-wide predicting disease-related protein complexes by walking on the heterogeneous network based on data integration and laplacian normalization.

    abstract:BACKGROUND:Associating protein complexes to human inherited diseases is critical for better understanding of biological processes and functional mechanisms of the disease. Many protein complexes have been identified and functionally annotated by computational and purification methods so far, however, the particular rol...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2017.04.007

    authors: Liu Z,Luo J

    更新日期:2017-08-01 00:00:00

  • In silico pharmacophore modeling and simulation studies for searching potent antileishmanials targeted against Leishmania donovani nicotinamidase.

    abstract::Nicotinamidase is a key enzyme for the salvage pathway catalyzing the first step for the conversion of nicotinamide (NAm) to nicotinic acid (NA) required for the synthesis of Nicotinamide Adenine Dinucleotide (NAD+) in the subsequent steps. Leishmania protozoan parasites are NAD+ auxotrophs and need precursors (nicoti...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2019.107150

    authors: Chauhan N,Poddar R

    更新日期:2019-12-01 00:00:00

  • Profiling of molecular pathways regulated by microRNA 601.

    abstract::MicroRNAs (miRNAs) have been implicated in complex vertebrate developmental and pathological systems as a versatile class of molecules involved in the regulation of various biological processes and molecular pathways. To elucidate the role of miRNAs in human somatic cells, an understanding of the molecular framework r...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2009.09.003

    authors: Ohdaira H,Nakagawa H,Yoshida K

    更新日期:2009-12-01 00:00:00

  • Automated prediction of three-way junction topological families in RNA secondary structures.

    abstract::We present an algorithm for automatically predicting the topological family of any RNA three-way junction, given only the information from the secondary structure: the sequence and the Watson-Crick pairings. The parameters of the algorithm have been determined on a data set of 33 three-way junctions whose 3D conformat...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2011.11.001

    authors: Lamiable A,Barth D,Denise A,Quessette F,Vial S,Westhof E

    更新日期:2012-04-01 00:00:00

  • Predicting microRNA biological functions based on genes discriminant analysis.

    abstract::Although thousands of microRNAs (miRNAs) have been identified in recent experimental efforts, it remains a challenge to explore their specific biological functions through molecular biological experiments. Since those members from same family share same or similar biological functions, classifying new miRNAs into thei...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2017.09.008

    authors: Ding T,Xu J,Sun M,Zhu S,Gao J

    更新日期:2017-12-01 00:00:00

  • Theoretical studies and NMR assay of coumarins and neoflavanones derivatives as potential inhibitors of acetylcholinesterase.

    abstract::Currently Alzheimer's disease (AD) is a devastating neurological disorder that mainly affects the elderly. The treatment of AD has as main objective to increase the levels of ACh in the synaptic cleft by inhibiting the cholinesterase enzymes, which are responsible for the degradation of ACh. Twenty one synthesized cou...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2020.107293

    authors: de Souza LG,Moraes PF,Leão RAC,Costa PRR,Soares RO,Pascutti PG,Figueroa-Villar JD,Rennó MN

    更新日期:2020-05-29 00:00:00

  • In silico analyses of a new group of fungal and plant RecQ4-homologous proteins.

    abstract::Bacterial and eukaryotic RecQ helicases comprise a family of homologous proteins necessary for maintaining genomic integrity during the cell cycle and DNA repair. There is one known bacterial RecQ helicase, and five eukaryotic RecQ helicases that have been described: RecQ1p, RecQ4p, RecQ5p, Bloom, and Werner. While th...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2008.07.005

    authors: Barea F,Tessaro S,Bonatto D

    更新日期:2008-10-01 00:00:00

  • A Computational workflow for the identification of the potent inhibitor of type II secretion system traffic ATPase of Pseudomonas aeruginosa.

    abstract::Bacterial type II secretion system has now become an attractive target for antivirulence drug development. The aim of the present study was to characterize the binding site of the type II secretion system traffic ATPase GspER of Pseudomonas aeruginosa, and identify potent inhibitors using extensive computational and v...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2018.07.012

    authors: Arifuzzaman M,Mitra S,Jahan SI,Jakaria M,Abeda T,Absar N,Dash R

    更新日期:2018-10-01 00:00:00

  • GADS software for parametric linkage analysis of quantitative traits distributed as a point-mass mixture.

    abstract::Often the quantitative data coming from proteomics and metabolomics studies have irregular distribution with a spike. None of the wide used methods for human QTL mapping are applicable to such traits. Researchers have to reduce the sample, excluding the spike, and analyze only continuous measurements. In this study, w...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2011.11.004

    authors: Axenovich TI,Zorkoltseva IV

    更新日期:2012-02-01 00:00:00

  • Biocomputational identification and validation of novel microRNAs predicted from bubaline whole genome shotgun sequences.

    abstract::MicroRNAs (miRNAs) are small (19-25 base long), non-coding RNAs that regulate post-transcriptional gene expression by cleaving targeted mRNAs in several eukaryotes. The miRNAs play vital roles in multiple biological and metabolic processes, including developmental timing, signal transduction, cell maintenance and diff...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2017.08.005

    authors: Manku HK,Dhanoa JK,Kaur S,Arora JS,Mukhopadhyay CS

    更新日期:2017-10-01 00:00:00

  • New insights on gene regulation in archaea.

    abstract::Archaea represent an important and vast domain of life. This cellular domain includes a large diversity of organisms characterized as prokaryotes with basal transcriptional machinery similar to eukarya. In this work we explore the most recent findings concerning the transcriptional regulatory organization in archaeal ...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章,评审

    doi:10.1016/j.compbiolchem.2011.10.006

    authors: Tenorio-Salgado S,Huerta-Saquero A,Perez-Rueda E

    更新日期:2011-12-14 00:00:00

  • The aspartate aminotransferase-like domain of Firmicutes MocR transcriptional regulators.

    abstract::Bacterial MocR transcriptional regulators possess an N-terminal DNA-binding domain containing a conserved helix-turn-helix module and an effector-binding and/or oligomerization domain at the C-terminus, homologous to fold type-I pyridoxal 5'-phosphate (PLP) enzymes. Since a comprehensive structural analysis of the Moc...

    journal_title:Computational biology and chemistry

    pub_type: 杂志文章

    doi:10.1016/j.compbiolchem.2015.05.003

    authors: Milano T,Contestabile R,Lo Presti A,Ciccozzi M,Pascarella S

    更新日期:2015-10-01 00:00:00