Learning from the ligand: using ligand-based features to improve binding affinity prediction

Machine learning scoring functions for protein-ligand binding affinity prediction have been found to consistently outperform classical scoring functions. Structure-based scoring functions for universal affinity prediction typically use features describing interactions derived from the protein-ligand...

Full beskrivning

Bibliografiska uppgifter
Huvudupphovsmän:	Boyles, F, Deane, C, Morris, G
Materialtyp:	Journal article
Publicerad:	2019

_version_	1826282762340925440
author	Boyles, F Deane, C Morris, G
author_facet	Boyles, F Deane, C Morris, G
author_sort	Boyles, F
collection	OXFORD
description	Machine learning scoring functions for protein-ligand binding affinity prediction have been found to consistently outperform classical scoring functions. Structure-based scoring functions for universal affinity prediction typically use features describing interactions derived from the protein-ligand complex, with limited information about the chemical or topological properties of the ligand itself. We demonstrate that the performance of machine learning scoring functions are consistently improved by the inclusion of diverse ligand-based features. For example, a Random Forest combining the features of RF-Score v3 with RDKit molecular descriptors achieved Pearson correlation coefficients of up to 0.831, 0.785, and 0.821 on the PDBbind 2007, 2013, and 2016 core sets respectively, compared to 0.790, 0.737, and 0.797 when using the features of RF-Score v3 alone. Excluding proteins and/or ligands that are similar to those in the test sets from the training set has a significant effect on scoring function performance, but does not remove the predictive power of ligand-based features. Furthermore a Random Forest using only ligand-based features is predictive at a level similar to classical scoring functions and it appears to be predicting the mean binding affinity of a ligand for its protein targets.
first_indexed	2024-03-07T00:48:44Z
format	Journal article
id	oxford-uuid:859e4ea6-731f-4af3-8cfb-a31adec9ca8a
institution	University of Oxford
last_indexed	2024-03-07T00:48:44Z
publishDate	2019
record_format	dspace
spelling	oxford-uuid:859e4ea6-731f-4af3-8cfb-a31adec9ca8a2022-03-26T21:58:45ZLearning from the ligand: using ligand-based features to improve binding affinity predictionJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:859e4ea6-731f-4af3-8cfb-a31adec9ca8aSymplectic Elements at Oxford2019Boyles, FDeane, CMorris, GMachine learning scoring functions for protein-ligand binding affinity prediction have been found to consistently outperform classical scoring functions. Structure-based scoring functions for universal affinity prediction typically use features describing interactions derived from the protein-ligand complex, with limited information about the chemical or topological properties of the ligand itself. We demonstrate that the performance of machine learning scoring functions are consistently improved by the inclusion of diverse ligand-based features. For example, a Random Forest combining the features of RF-Score v3 with RDKit molecular descriptors achieved Pearson correlation coefficients of up to 0.831, 0.785, and 0.821 on the PDBbind 2007, 2013, and 2016 core sets respectively, compared to 0.790, 0.737, and 0.797 when using the features of RF-Score v3 alone. Excluding proteins and/or ligands that are similar to those in the test sets from the training set has a significant effect on scoring function performance, but does not remove the predictive power of ligand-based features. Furthermore a Random Forest using only ligand-based features is predictive at a level similar to classical scoring functions and it appears to be predicting the mean binding affinity of a ligand for its protein targets.
spellingShingle	Boyles, F Deane, C Morris, G Learning from the ligand: using ligand-based features to improve binding affinity prediction
title	Learning from the ligand: using ligand-based features to improve binding affinity prediction
title_full	Learning from the ligand: using ligand-based features to improve binding affinity prediction
title_fullStr	Learning from the ligand: using ligand-based features to improve binding affinity prediction
title_full_unstemmed	Learning from the ligand: using ligand-based features to improve binding affinity prediction
title_short	Learning from the ligand: using ligand-based features to improve binding affinity prediction
title_sort	learning from the ligand using ligand based features to improve binding affinity prediction
work_keys_str_mv	AT boylesf learningfromtheligandusingligandbasedfeaturestoimprovebindingaffinityprediction AT deanec learningfromtheligandusingligandbasedfeaturestoimprovebindingaffinityprediction AT morrisg learningfromtheligandusingligandbasedfeaturestoimprovebindingaffinityprediction

Learning from the ligand: using ligand-based features to improve binding affinity prediction

Liknande verk