Prediction and mechanistic analysis of drug-induced liver injury (DILI) based on chemical structure.

Abstract:

BACKGROUND:Drug-induced liver injury (DILI) is a major safety concern characterized by a complex and diverse pathogenesis. In order to identify DILI early in drug development, a better understanding of the injury and models with better predictivity are urgently needed. One approach in this regard are in silico models which aim at predicting the risk of DILI based on the compound structure. However, these models do not yet show sufficient predictive performance or interpretability to be useful for decision making by themselves, the former partially stemming from the underlying problem of labeling the in vivo DILI risk of compounds in a meaningful way for generating machine learning models. RESULTS:As part of the Critical Assessment of Massive Data Analysis (CAMDA) "CMap Drug Safety Challenge" 2019 ( http://camda2019.bioinf.jku.at ), chemical structure-based models were generated using the binarized DILIrank annotations. Support Vector Machine (SVM) and Random Forest (RF) classifiers showed comparable performance to previously published models with a mean balanced accuracy over models generated using 5-fold LOCO-CV inside a 10-fold training scheme of 0.759 ± 0.027 when predicting an external test set. In the models which used predicted protein targets as compound descriptors, we identified the most information-rich proteins which agreed with the mechanisms of action and toxicity of nonsteroidal anti-inflammatory drugs (NSAIDs), one of the most important drug classes causing DILI, stress response via TP53 and biotransformation. In addition, we identified multiple proteins involved in xenobiotic metabolism which could be novel DILI-related off-targets, such as CLK1 and DYRK2. Moreover, we derived potential structural alerts for DILI with high precision, including furan and hydrazine derivatives; however, all derived alerts were present in approved drugs and were over specific indicating the need to consider quantitative variables such as dose. CONCLUSION:Using chemical structure-based descriptors such as structural fingerprints and predicted protein targets, DILI prediction models were built with a predictive performance comparable to previous literature. In addition, we derived insights on proteins and pathways statistically (and potentially causally) linked to DILI from these models and inferred new structural alerts related to this adverse endpoint.

journal_name

Biol Direct

journal_title

Biology direct

authors

Liu A,Walter M,Wright P,Bartosik A,Dolciami D,Elbasir A,Yang H,Bender A

doi

10.1186/s13062-020-00285-0

subject

Has Abstract

pub_date

2021-01-18 00:00:00

pages

6

issue

1

issn

1745-6150

pii

10.1186/s13062-020-00285-0

journal_volume

16

pub_type

杂志文章
  • A computational approach to candidate gene prioritization for X-linked mental retardation using annotation-based binary filtering and motif-based linear discriminatory analysis.

    abstract:BACKGROUND:Several computational candidate gene selection and prioritization methods have recently been developed. These in silico selection and prioritization techniques are usually based on two central approaches--the examination of similarities to known disease genes and/or the evaluation of functional annotation of...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-6-30

    authors: Lombard Z,Park C,Makova KD,Ramsay M

    更新日期:2011-06-13 00:00:00

  • Use of designed sequences in protein structure recognition.

    abstract:BACKGROUND:Knowledge of the protein structure is a pre-requisite for improved understanding of molecular function. The gap in the sequence-structure space has increased in the post-genomic era. Grouping related protein sequences into families can aid in narrowing the gap. In the Pfam database, structure description is ...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-018-0209-6

    authors: Kumar G,Mudgal R,Srinivasan N,Sandhya S

    更新日期:2018-05-09 00:00:00

  • Interplay of recombination and selection in the genomes of Chlamydia trachomatis.

    abstract:BACKGROUND:Chlamydia trachomatis is an obligate intracellular bacterial parasite, which causes several severe and debilitating diseases in humans. This study uses comparative genomic analyses of 12 complete published C. trachomatis genomes to assess the contribution of recombination and selection in this pathogen and t...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-6-28

    authors: Joseph SJ,Didelot X,Gandhi K,Dean D,Read TD

    更新日期:2011-05-26 00:00:00

  • Systematic evaluation of supervised machine learning for sample origin prediction using metagenomic sequencing data.

    abstract:BACKGROUND:The advent of metagenomic sequencing provides microbial abundance patterns that can be leveraged for sample origin prediction. Supervised machine learning classification approaches have been reported to predict sample origin accurately when the origin has been previously sampled. Using metagenomic datasets p...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-020-00287-y

    authors: Chen JC,Tyler AD

    更新日期:2020-12-10 00:00:00

  • The multiple personalities of Watson and Crick strands.

    abstract:BACKGROUND:In genetics it is customary to refer to double-stranded DNA as containing a "Watson strand" and a "Crick strand." However, there seems to be no consensus in the literature on the exact meaning of these two terms, and the many usages contradict one another as well as the original definition. Here, we review t...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-6-7

    authors: Cartwright RA,Graur D

    更新日期:2011-02-08 00:00:00

  • PEPstrMOD: structure prediction of peptides containing natural, non-natural and modified residues.

    abstract:BACKGROUND:In the past, many methods have been developed for peptide tertiary structure prediction but they are limited to peptides having natural amino acids. This study describes a method PEPstrMOD, which is an updated version of PEPstr, developed specifically for predicting the structure of peptides containing natur...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-015-0103-4

    authors: Singh S,Singh H,Tuknait A,Chaudhary K,Singh B,Kumaran S,Raghava GP

    更新日期:2015-12-21 00:00:00

  • Comprehensive comparative-genomic analysis of type 2 toxin-antitoxin systems and related mobile stress response systems in prokaryotes.

    abstract:BACKGROUND:The prokaryotic toxin-antitoxin systems (TAS, also referred to as TA loci) are widespread, mobile two-gene modules that can be viewed as selfish genetic elements because they evolved mechanisms to become addictive for replicons and cells in which they reside, but also possess "normal" cellular functions in v...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-4-19

    authors: Makarova KS,Wolf YI,Koonin EV

    更新日期:2009-06-03 00:00:00

  • xHMMER3x2: Utilizing HMMER3's speed and HMMER2's sensitivity and specificity in the glocal alignment mode for improved large-scale protein domain annotation.

    abstract:BACKGROUND:While the local-mode HMMER3 is notable for its massive speed improvement, the slower glocal-mode HMMER2 is more exact for domain annotation by enforcing full domain-to-sequence alignments. Since a unit of domain necessarily implies a unit of function, local-mode HMMER3 alone remains insufficient for precise ...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-016-0163-0

    authors: Yap CK,Eisenhaber B,Eisenhaber F,Wong WC

    更新日期:2016-11-29 00:00:00

  • The common ancestry of life.

    abstract:BACKGROUND:It is common belief that all cellular life forms on earth have a common origin. This view is supported by the universality of the genetic code and the universal conservation of multiple genes, particularly those that encode key components of the translation system. A remarkable recent study claims to provide...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-5-64

    authors: Koonin EV,Wolf YI

    更新日期:2010-11-18 00:00:00

  • Orphan SelD proteins and selenium-dependent molybdenum hydroxylases.

    abstract::Bacterial and Archaeal cells use selenium structurally in selenouridine-modified tRNAs, in proteins translated with selenocysteine, and in the selenium-dependent molybdenum hydroxylases (SDMH). The first two uses both require the selenophosphate synthetase gene, selD. Examining over 500 complete prokaryotic genomes fi...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-3-4

    authors: Haft DH,Self WT

    更新日期:2008-02-20 00:00:00

  • MutL homologs in restriction-modification systems and the origin of eukaryotic MORC ATPases.

    abstract::The provenance and biochemical roles of eukaryotic MORC proteins have remained poorly understood since the discovery of their prototype MORC1, which is required for meiotic nuclear division in animals. The MORC family contains a combination of a gyrase, histidine kinase, and MutL (GHKL) and S5 domains that together co...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-3-8

    authors: Iyer LM,Abhiman S,Aravind L

    更新日期:2008-03-17 00:00:00

  • From tumors to species: a SCANDAL hypothesis.

    abstract::ᅟ: Some tumor cells can evolve into transmissible parasites. Notable examples include the Tasmanian devil facial tumor disease, the canine transmissible venereal tumor and transmissible cancers of mollusks. We present a hypothesis that such transmissible tumors existed in the past and that some modern animal taxa are ...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-019-0233-1

    authors: Panchin AY,Aleoshin VV,Panchin YV

    更新日期:2019-01-23 00:00:00

  • Activating and inhibiting connections in biological network dynamics.

    abstract:BACKGROUND:Many studies of biochemical networks have analyzed network topology. Such work has suggested that specific types of network wiring may increase network robustness and therefore confer a selective advantage. However, knowledge of network topology does not allow one to predict network dynamical behavior--for e...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-3-49

    authors: McDonald D,Waterbury L,Knight R,Betterton MD

    更新日期:2008-12-04 00:00:00

  • Elusive data underlying debate at the prokaryote-eukaryote divide.

    abstract:BACKGROUND:The origin of eukaryotic cells was an important transition in evolution. The factors underlying the origin and evolutionary success of the eukaryote lineage are still discussed. One camp argues that mitochondria were essential for eukaryote origin because of the unique configuration of internalized bioenerge...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-018-0221-x

    authors: Gerlitz M,Knopp M,Kapust N,Xavier JC,Martin WF

    更新日期:2018-10-03 00:00:00

  • Component retention in principal component analysis with application to cDNA microarray data.

    abstract::Shannon entropy is used to provide an estimate of the number of interpretable components in a principal component analysis. In addition, several ad hoc stopping rules for dimension determination are reviewed and a modification of the broken stick model is presented. The modification incorporates a test for the presenc...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-2-2

    authors: Cangelosi R,Goriely A

    更新日期:2007-01-17 00:00:00

  • The UBR-box and its relationship to binuclear RING-like treble clef zinc fingers.

    abstract:BACKGROUND:The N-end rule pathway is a part of the ubiquitin-dependent proteolytic system wherein N-recognin proteins recognize the amino terminal degradation signals (N-degrons) of the substrate. The type 1 N-degron recognizing UBR-box domain of the eukaryotic Arg/N-end rule pathway is known to possess a novel three-z...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-015-0066-5

    authors: Kaur G,Subramanian S

    更新日期:2015-07-17 00:00:00

  • Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea.

    abstract:BACKGROUND:An evolutionary classification of genes from sequenced genomes that distinguishes between orthologs and paralogs is indispensable for genome annotation and evolutionary reconstruction. Shortly after multiple genome sequences of bacteria, archaea, and unicellular eukaryotes became available, an attempt on suc...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-2-33

    authors: Makarova KS,Sorokin AV,Novichkov PS,Wolf YI,Koonin EV

    更新日期:2007-11-27 00:00:00

  • On origin of genetic code and tRNA before translation.

    abstract:BACKGROUND:Synthesis of proteins is based on the genetic code - a nearly universal assignment of codons to amino acids (aas). A major challenge to the understanding of the origins of this assignment is the archetypal "key-lock vs. frozen accident" dilemma. Here we re-examine this dilemma in light of 1) the fundamental ...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-6-14

    authors: Rodin AS,Szathmáry E,Rodin SN

    更新日期:2011-02-22 00:00:00

  • Assessment of urban microbiome assemblies with the help of targeted in silico gold standards.

    abstract:BACKGROUND:Microbial communities play a crucial role in our environment and may influence human health tremendously. Despite being the place where human interaction is most abundant we still know little about the urban microbiome. This is highlighted by the large amount of unclassified DNA reads found in urban metageno...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-018-0225-6

    authors: Gerner SM,Rattei T,Graf AB

    更新日期:2018-10-12 00:00:00

  • Issues associated with the use of phosphospecific antibodies to localise active and inactive pools of GSK-3 in cells.

    abstract:BACKGROUND:Glycogen synthase kinase-3 (GSK-3) is a ubiquitously expressed serine/threonine (Ser/Thr) kinase comprising two isoforms, GSK-3α and GSK-3β. Both enzymes are similarly inactivated by serine phosphorylation (GSK-3α at Ser21 and GSK-3β at Ser9) and activated by tyrosine phosphorylation (GSK-3α at Tyr279 and GS...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-6-4

    authors: Campa VM,Kypta RM

    更新日期:2011-01-24 00:00:00

  • Outer membrane protein genes and their small non-coding RNA regulator genes in Photorhabdus luminescens.

    abstract:INTRODUCTION:Three major outer membrane protein genes of Escherichia coli, ompF, ompC, and ompA respond to stress factors. Transcripts from these genes are regulated by the small non-coding RNAs micF, micC, and micA, respectively. Here we examine Photorhabdus luminescens, an organism that has a different habitat from E...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-1-12

    authors: Papamichail D,Delihas N

    更新日期:2006-05-22 00:00:00

  • The origins of phagocytosis and eukaryogenesis.

    abstract:BACKGROUND:Phagocytosis, that is, engulfment of large particles by eukaryotic cells, is found in diverse organisms and is often thought to be central to the very origin of the eukaryotic cell, in particular, for the acquisition of bacterial endosymbionts including the ancestor of the mitochondrion. RESULTS:Comparisons...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-4-9

    authors: Yutin N,Wolf MY,Wolf YI,Koonin EV

    更新日期:2009-02-26 00:00:00

  • Why eukaryotic cells use introns to enhance gene expression: splicing reduces transcription-associated mutagenesis by inhibiting topoisomerase I cutting activity.

    abstract:BACKGROUND:The costs and benefits of spliceosomal introns in eukaryotes have not been established. One recognized effect of intron splicing is its known enhancement of gene expression. However, the mechanism regulating such splicing-mediated expression enhancement has not been defined. Previous studies have shown that ...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-6-24

    authors: Niu DK,Yang YF

    更新日期:2011-05-18 00:00:00

  • IPC - Isoelectric Point Calculator.

    abstract:BACKGROUND:Accurate estimation of the isoelectric point (pI) based on the amino acid sequence is useful for many analytical biochemistry and proteomics techniques such as 2-D polyacrylamide gel electrophoresis, or capillary isoelectric focusing used in combination with high-throughput mass spectrometry. Additionally, p...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-016-0159-9

    authors: Kozlowski LP

    更新日期:2016-10-21 00:00:00

  • A novel superfamily containing the beta-grasp fold involved in binding diverse soluble ligands.

    abstract:BACKGROUND:Domains containing the beta-grasp fold are utilized in a great diversity of physiological functions but their role, if any, in soluble or small molecule ligand recognition is poorly studied. RESULTS:Using sensitive sequence and structure similarity searches we identify a novel superfamily containing the bet...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-2-4

    authors: Burroughs AM,Balaji S,Iyer LM,Aravind L

    更新日期:2007-01-24 00:00:00

  • The archaeo-eukaryotic GINS proteins and the archaeal primase catalytic subunit PriS share a common domain.

    abstract:UNLABELLED:Primase and GINS are essential factors for chromosomal DNA replication in eukaryotic and archaeal cells. Here we describe a previously undetected relationship between the C-terminal domain of the catalytic subunit (PriS) of archaeal primase and the B-domains of the archaeo-eukaryotic GINS proteins in the for...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-5-17

    authors: Swiatek A,Macneill SA

    更新日期:2010-04-12 00:00:00

  • Rotational restriction of nascent peptides as an essential element of co-translational protein folding: possible molecular players and structural consequences.

    abstract:BACKGROUND:A basic tenet of protein science is that all information about the spatial structure of proteins is present in their sequences. Nonetheless, many proteins fail to attain native structure upon experimental denaturation and refolding in vitro, raising the question of the specific role of cellular machinery in ...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/s13062-017-0186-1

    authors: Sorokina I,Mushegian A

    更新日期:2017-05-31 00:00:00

  • Is pre-Darwinian evolution plausible?

    abstract:BACKGROUND:This essay highlights critical aspects of the plausibility of pre-Darwinian evolution. It is based on a critical review of some better-known open, far-from-equilibrium system-based scenarios supposed to explain processes that took place before Darwinian evolution had emerged and that resulted in the origin o...

    journal_title:Biology direct

    pub_type: 杂志文章,评审

    doi:10.1186/s13062-018-0216-7

    authors: Tessera M

    更新日期:2018-09-21 00:00:00

  • The fundamental units, processes and patterns of evolution, and the tree of life conundrum.

    abstract:BACKGROUND:The elucidation of the dominant role of horizontal gene transfer (HGT) in the evolution of prokaryotes led to a severe crisis of the Tree of Life (TOL) concept and intense debates on this subject. CONCEPT:Prompted by the crisis of the TOL, we attempt to define the primary units and the fundamental patterns ...

    journal_title:Biology direct

    pub_type: 杂志文章

    doi:10.1186/1745-6150-4-33

    authors: Koonin EV,Wolf YI

    更新日期:2009-09-29 00:00:00

  • Once upon a time the cell membranes: 175 years of cell boundary research.

    abstract::All modern cells are bounded by cell membranes best described by the fluid mosaic model. This statement is so widely accepted by biologists that little attention is generally given to the theoretical importance of cell membranes in describing the cell. This has not always been the case. When the Cell Theory was first ...

    journal_title:Biology direct

    pub_type: 历史文章,杂志文章,评审

    doi:10.1186/s13062-014-0032-7

    authors: Lombard J

    更新日期:2014-12-19 00:00:00