Abstract:
:This study has assessed the use of consensus regression, as compared to single multiple linear regression, models for the development of quantitative structure-activity relationships (QSARs). To provide a comparison, four data sets of varying size and complexity were analyzed: silastic membrane flux, toxicity of phenols to Tetrahymena pyriformis, acute toxicity to the fathead minnow and flash point. For each data set, a genetic algorithm was used to develop a model population and the performance of consensus models was compared to that of the best single model. Two consensus models were developed, one using the top 10 models, and the other using a subset of models chosen to provide maximal coverage of model space. The results highlight the ability of the genetic algorithm to develop predictive models from a large descriptor pool. However, the consensus models were shown to offer no significant improvements over single regression models, which are as statistically robust as the equivalent consensus models. Consensus models developed from a selection of the best QSARs were shown not to be superior to a selection of diverse in "model space" QSARs. For the data sets analyzed in this study, and in light of the Organization for Economic Cooperation and Development principles for the validation of QSARs, the increase in model complexity when using consensus models does not seem warranted given the minimal improvement in model statistics.
journal_name
J Chem Inf Modeljournal_title
Journal of chemical information and modelingauthors
Hewitt M,Cronin MT,Madden JC,Rowe PH,Johnson C,Obi A,Enoch SJdoi
10.1021/ci700016dsubject
Has Abstractpub_date
2007-07-01 00:00:00pages
1460-8issue
4eissn
1549-9596issn
1549-960Xjournal_volume
47pub_type
杂志文章abstract::It is demonstrated that the fragmentation of druglike molecules by applying simplistic pseudo-retrosynthesis results in a stock of chemically meaningful building blocks for de novo molecule generation. A stochastic search algorithm in conjunction with ligand-based similarity scoring (Flux: fragment-based ligand builde...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci0503560
更新日期:2006-03-01 00:00:00
abstract::Theoretical calculation of equilibrium dissociation constants is a very computationally demanding and time-consuming process since it requires an extremely accurate computation of the solvation free energy changes for each of the species involved. By correlating the minimum surface electrostatic potential (VS,min) on ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b01173
更新日期:2020-03-23 00:00:00
abstract::An index of the activation of Class A G-protein-coupled receptors (GPCRs) has been trained using interhelix distances from a series of microsecond molecular-dynamics simulations and tested for 268 published X-ray structures. In a three-class model that includes intermediate structures, 63% of the active structures are...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00604
更新日期:2019-09-23 00:00:00
abstract::The important role of water molecules in protein-ligand binding energetics has attracted wide attention in recent years. A range of computational methods has been developed to predict the favorable locations of water molecules in a protein binding pocket. Most of the current methods are based on extensive molecular dy...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00619
更新日期:2020-09-28 00:00:00
abstract::The spatial sign is a multivariate extension of the concept of sign. Recently multivariate estimators of covariance structures based on spatial signs have been examined by various authors. These new estimators are found to be robust to outlying observations. From a computational point of view, estimators based on spat...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci050498u
更新日期:2006-05-01 00:00:00
abstract::SIRT2, which is a NAD+ (nicotinamide adenine dinucleotide) dependent deacetylase, has been demonstrated to play an important role in the occurrence and development of a variety of diseases such as cancer, ischemia-reperfusion, and neurodegenerative diseases. Small molecule inhibitors of SIRT2 are thought to be potenti...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00714
更新日期:2017-04-24 00:00:00
abstract::Side-chain modeling is critical for protein structure prediction since the uniqueness of the protein structure is largely determined by its side-chain packing conformation. In this paper, differing from most approaches that rely on rotamer library sampling, we first propose a novel side-chain rotamer prediction method...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00951
更新日期:2020-12-28 00:00:00
abstract::Predictive metabolism methods can be used in drug discovery projects to enhance the understanding of structure-metabolism relationships. The present study uses data mining methods to exploit biotransformation data that have been recorded in the MDL Metabolite database. Reacting center fingerprints were derived from a ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci600376q
更新日期:2007-03-01 00:00:00
abstract::Physicochemical properties of compounds have been instrumental in selecting lead compounds with increased drug-likeness. However, the relationship between physicochemical properties of constituent drugs and the tendency to exhibit drug interaction has not been systematically studied. We assembled physicochemical descr...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci500276x
更新日期:2014-08-25 00:00:00
abstract::The chronic toxicity is fundamental for toxicological risk assessment, but its correlation with the chemical structures has been studied only little. This is partly due to the complexity of such an experimental test that embraces a plethora of different biological effects and mechanisms of action, making (Q)SAR studie...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci8001974
更新日期:2008-10-01 00:00:00
abstract::Increasing protein kinase C (PKC) activity is of potential therapeutic value. Its activation involves an interaction between the C1 domain and diacylglycerol (DAG) at intracellular membrane surfaces; DAG mimetics hold promise as new drugs. We previously developed the isophthalate derivative HMI-1a3, an effective but h...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c00624
更新日期:2020-11-23 00:00:00
abstract::The momentum gained by research on biologics has not been met yet with equal thrust on the informatics side. There is a noticeable lack of software for data management that empowers the bench scientists working on the development of biologic therapeutics. SARvision|Biologics is a tool to analyze data associated with b...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci400333x
更新日期:2013-10-28 00:00:00
abstract::Machine learning has exhibited powerful capabilities in many areas. However, machine learning models are mostly database dependent, requiring a new model if the database changes. Therefore, a universal model is highly desired to accommodate the widest variety of databases. Fortunately, this universality may be achieve...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00878
更新日期:2019-05-28 00:00:00
abstract::Deep learning has drawn significant attention in different areas including drug discovery. It has been proposed that it could outperform other machine learning algorithms, especially with big data sets. In the field of pharmaceutical industry, machine learning models are built to understand quantitative structure-acti...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00671
更新日期:2019-03-25 00:00:00
abstract::The consistent handling of molecules is probably the most basic and important requirement in the field of cheminformatics. Reliable results can only be obtained if the underlying calculations are independent of the specific way molecules are represented in the input data. However, ensuring consistency is a complex tas...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci400724v
更新日期:2014-03-24 00:00:00
abstract::The configuring of a radial basis function network (RBFN) consists of selecting the network parameters (centers and widths in RBF units and weights between the hidden and output layers) and network architecture. The issues of suboptimum and overfitting, however, often occur in RBFN configuring. This paper presented a ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci600218d
更新日期:2006-11-01 00:00:00
abstract::Model lipid membranes made from binary mixtures of dimyristoylphosphatidylcholine/dipalmitoylphosphatidylcholine (DMPC/DPPC) and dimyristoylphosphatidylcholine/cholesterol (DMPC/Chol) exhibit coexistence of diverse lipid phases at appropriate temperature and composition. Since lipids in different phases show different...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci0501793
更新日期:2005-11-01 00:00:00
abstract::In this DFT study, activities of 11 different N2O4, N2O3, and NO2 core containing Zr(IV) complexes, 4,13-diaza-18-crown-6 (I'N2O4), 1,4,10-trioxa-7,13-diazacyclopentadecane (I'N2O3), and 2-(2-methoxy)ethanol (I'NO2), respectively, and their analogues in peptide hydrolysis have been investigated. Based on the experimen...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.6b00781
更新日期:2017-05-22 00:00:00
abstract::We have developed a computer program for molecular dynamics (MD) simulation that implements the Split Integration Symplectic Method (SISM) and is designed to run on specialized parallel computers. The MD integration is performed by the SISM, which analytically treats high-frequency vibrational motion and thus enables ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci050216q
更新日期:2005-11-01 00:00:00
abstract::We have performed a systematic study of the entropy term in the MM/GBSA (molecular mechanics combined with generalized Born and surface-area solvation) approach to calculate ligand-binding affinities. The entropies are calculated by a normal-mode analysis of harmonic frequencies from minimized snapshots of molecular d...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci3001919
更新日期:2012-08-27 00:00:00
abstract::Metal-ligand (M-L) bond lengths for a range of ligands (carboxylates, chlorides, pyridines, water, tertiary phosphines, and alkenes) and a variety of metals have been retrieved from the Cambridge Structural Database, CSD. Analysis of the factors which affect M-L bond lengths (for example, ligand coordination mode, oxi...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci0500785
更新日期:2005-11-01 00:00:00
abstract::While critically reviewing the current status of what is known about C28H14 and C30H14 benzenoid isomers, which are ubiquitous pyrolytic constituents, some new insights will be presented. Representative isomers belonging to these benzenoid hydrocarbons are at the crossroads to homologous series that extend to infinite...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci050298i
更新日期:2006-03-01 00:00:00
abstract::The protection exerted by 3,5-dihydroxy-4-methoxybenzyl alcohol (DHMBA), a phenolic compound recently isolated from the Pacific oyster, against oxidative stress (OS) is investigated using the density functional theory. Our results indicate that DHMBA is an outstanding peroxyl radical scavenger, being about 15 times an...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.5b00513
更新日期:2015-12-28 00:00:00
abstract::The purpose of this investigation is to contribute to the development of new anticonvulsant drugs to treat patients with refractory epilepsy. We applied a virtual screening protocol that involved the search into molecular databases of new compounds and known drugs to find small molecules that interact with the open co...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.7b00721
更新日期:2018-07-23 00:00:00
abstract::Nonfibrillar neurotoxic amyloid β (Aβ) oligomer structures are typically rich in β-sheets, which could be promoted by metal ions like Zn(2+). Here, using molecular dynamics (MD) simulations, we systematically examined combinations of Aβ40 peptide conformations and Zn(2+) binding modes to probe the effects of secondary...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.5b00063
更新日期:2015-06-22 00:00:00
abstract::The community structure-activity resource (CSAR) data sets are used to develop and test a support vector machine-based scoring function in regression mode (SVR). Two scoring functions (SVR-KB and SVR-EP) are derived with the objective of reproducing the trend of the experimental binding affinities provided within the ...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci200078f
更新日期:2011-09-26 00:00:00
abstract::Spin diffusion is a formidable problem when interpreting NMR data of chemical compounds. We developed a method to reconstruct the conformational ensemble of flexible molecules displaying spin diffusion, which minimizes the subjective bias in the interpretation of experimental data and which can be used routinely to ob...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.9b00259
更新日期:2019-06-24 00:00:00
abstract::Soluble low-molecular-weight oligomers formed during the early stage of amyloid aggregation are considered the major toxic species in amyloidosis. The structure-function relationship between oligomeric assemblies and the cytotoxicity in amyloid diseases are still elusive due to the heterogeneous and transient nature o...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.0c01319
更新日期:2021-01-14 00:00:00
abstract::Designing organic saccharide sensors for use in aqueous solution is a nontrivial endeavor. Incorporation of hydrogen bonding groups on a sensor's receptor unit to target saccharides is an obvious strategy but not one that is likely to ensure analyte-receptor interactions over analyte-solvent or receptor-solvent intera...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/acs.jcim.8b00987
更新日期:2019-05-28 00:00:00
abstract::Many of the conventional similarity methods assume that molecular fragments that do not relate to biological activity carry the same weight as the important ones. One possible approach to this problem is to use the Bayesian inference network (BIN), which models molecules and reference structures as probabilistic infer...
journal_title:Journal of chemical information and modeling
pub_type: 杂志文章
doi:10.1021/ci100232h
更新日期:2011-01-24 00:00:00