QSAR Models for Active Substances against <i>Pseudomonas aeruginosa</i> Using Disk-Diffusion Test Data

<i>Pseudomonas aeruginosa</i> is a Gram-negative bacillus included among the six “ESKAPE” microbial species with an outstanding ability to “escape” currently used antibiotics and developing new antibiotics against it is of the highest priority. Whereas minimum inhibitory concentration (M...

Full description

Bibliographic Details
Main Authors: Cosmin Alexandru Bugeac, Robert Ancuceanu, Mihaela Dinu
Format: Article
Language:English
Published: MDPI AG 2021-03-01
Series:Molecules
Subjects:
Online Access:https://www.mdpi.com/1420-3049/26/6/1734
Description
Summary:<i>Pseudomonas aeruginosa</i> is a Gram-negative bacillus included among the six “ESKAPE” microbial species with an outstanding ability to “escape” currently used antibiotics and developing new antibiotics against it is of the highest priority. Whereas minimum inhibitory concentration (MIC) values against <i>Pseudomonas aeruginosa</i> have been used previously for QSAR model development, disk diffusion results (inhibition zones) have not been apparently used for this purpose in the literature and we decided to explore their use in this sense. We developed multiple QSAR methods using several machine learning algorithms (support vector classifier, K nearest neighbors, random forest classifier, decision tree classifier, AdaBoost classifier, logistic regression and naïve Bayes classifier). We used four sets of molecular descriptors and fingerprints and three different methods of data balancing, together with the “native” data set. In total, 32 models were built for each set of descriptors or fingerprint and balancing method, of which 28 were selected and stacked to create meta-models. In terms of balanced accuracy, the best performance was provided by KNN, logistic regression and decision tree classifier, but the ensemble method had slightly superior results in nested cross-validation.
ISSN:1420-3049