Data driven approaches to improve the drug discovery process: a virtual screening quest in drug discovery

<p>Drug discovery has witnessed an increase in the application of in silico methods to complement existing <em>in vitro</em> and <em>in vivo</em> experiments, in an attempt to 'fail fast' and reduce the high attrition rates of clinical phases. Computer algori...

Full description

Bibliographic Details
Main Author: Ebejer, JP
Other Authors: Deane, CM
Format: Thesis
Language:English
Published: 2014
Subjects:
_version_ 1797112603511619584
author Ebejer, JP
author2 Deane, CM
author_facet Deane, CM
Ebejer, JP
author_sort Ebejer, JP
collection OXFORD
description <p>Drug discovery has witnessed an increase in the application of in silico methods to complement existing <em>in vitro</em> and <em>in vivo</em> experiments, in an attempt to 'fail fast' and reduce the high attrition rates of clinical phases. Computer algorithms have been successfully employed for many tasks including biological target selection, hit identification, lead optimization, binding affinity determination, ADME and toxicity prediction, side-effect prediction, drug repurposing, and, in general, to direct experimental work.</p><p>This thesis describes a multifaceted approach to virtual screening, to computationally identify small-molecule inhibitors against a biological target of interest. Conformer generation is a critical step in all virtual screening methods that make use of atomic 3D data. We therefore analysed the ability of computational tools to reproduce high quality, experimentally resolved conformations of organic small-molecules. We selected the best performing method (RDKit), and developed a protocol that generates a non-redundant conformer ensemble which tends to contain low-energy structures close to those experimentally observed.</p><p>We then outline the steps we took to build a multi-million, small-molecule database (including molecule standardization and efficient exact, substructure and similarity searching capabilities), for use in our virtual screening experiments. We generated conformers and descriptors for the molecules in the database. We tagged a subset of the database as `drug-like' and clustered this to provide a reduced, diverse set of molecules for use in more computationally-intensive virtual screening protocols.</p><p>We next describe a novel virtual screening method we developed, called Ligity, that makes use of known protein-ligand <em>holo</em> structures as queries to search the small-molecule database for putative actives. Ligity has been validated against targets from the DUD-E dataset, and has shown, on average, better performance than other 3D methods. We also show that performance improved when we fused the results from multiple input structures. This bodes well for Ligity's future use, especially when considering that protein structure databases such as the Protein Data Bank are growing exponentially every year.</p><p>Lastly, we describe the fruitful application of structure-based and ligand-based virtual screening methods to <em>Plasmodium falciparum</em> Subtilisin-like Protease 1 (PfSUB1), an important drug target in the human stages of the life-cycle of the malaria parasite. Our ligand-based virtual screening study resulted in the discovery of novel PfSUB1 inhibitors. Further lead optimization of these compounds, to improve binding affinity in the nanomolar range, may promote them as drug candidates.</p><p>In this thesis we postulate that the accuracy of computational tools in drug discovery may be enhanced to take advantage of the exponential increase of experimental data and the availability of cheaper computational power such as cloud computing.</p>
first_indexed 2024-03-07T08:26:28Z
format Thesis
id oxford-uuid:96d73300-f767-4ed6-8dda-a13a4aeb40e0
institution University of Oxford
language English
last_indexed 2024-03-07T08:26:28Z
publishDate 2014
record_format dspace
spelling oxford-uuid:96d73300-f767-4ed6-8dda-a13a4aeb40e02024-02-19T12:10:35ZData driven approaches to improve the drug discovery process: a virtual screening quest in drug discoveryThesishttp://purl.org/coar/resource_type/c_db06uuid:96d73300-f767-4ed6-8dda-a13a4aeb40e0Computational chemistryBioinformatics (life sciences)EnglishOxford University Research Archive - Valet2014Ebejer, JPDeane, CMMorris, GM<p>Drug discovery has witnessed an increase in the application of in silico methods to complement existing <em>in vitro</em> and <em>in vivo</em> experiments, in an attempt to 'fail fast' and reduce the high attrition rates of clinical phases. Computer algorithms have been successfully employed for many tasks including biological target selection, hit identification, lead optimization, binding affinity determination, ADME and toxicity prediction, side-effect prediction, drug repurposing, and, in general, to direct experimental work.</p><p>This thesis describes a multifaceted approach to virtual screening, to computationally identify small-molecule inhibitors against a biological target of interest. Conformer generation is a critical step in all virtual screening methods that make use of atomic 3D data. We therefore analysed the ability of computational tools to reproduce high quality, experimentally resolved conformations of organic small-molecules. We selected the best performing method (RDKit), and developed a protocol that generates a non-redundant conformer ensemble which tends to contain low-energy structures close to those experimentally observed.</p><p>We then outline the steps we took to build a multi-million, small-molecule database (including molecule standardization and efficient exact, substructure and similarity searching capabilities), for use in our virtual screening experiments. We generated conformers and descriptors for the molecules in the database. We tagged a subset of the database as `drug-like' and clustered this to provide a reduced, diverse set of molecules for use in more computationally-intensive virtual screening protocols.</p><p>We next describe a novel virtual screening method we developed, called Ligity, that makes use of known protein-ligand <em>holo</em> structures as queries to search the small-molecule database for putative actives. Ligity has been validated against targets from the DUD-E dataset, and has shown, on average, better performance than other 3D methods. We also show that performance improved when we fused the results from multiple input structures. This bodes well for Ligity's future use, especially when considering that protein structure databases such as the Protein Data Bank are growing exponentially every year.</p><p>Lastly, we describe the fruitful application of structure-based and ligand-based virtual screening methods to <em>Plasmodium falciparum</em> Subtilisin-like Protease 1 (PfSUB1), an important drug target in the human stages of the life-cycle of the malaria parasite. Our ligand-based virtual screening study resulted in the discovery of novel PfSUB1 inhibitors. Further lead optimization of these compounds, to improve binding affinity in the nanomolar range, may promote them as drug candidates.</p><p>In this thesis we postulate that the accuracy of computational tools in drug discovery may be enhanced to take advantage of the exponential increase of experimental data and the availability of cheaper computational power such as cloud computing.</p>
spellingShingle Computational chemistry
Bioinformatics (life sciences)
Ebejer, JP
Data driven approaches to improve the drug discovery process: a virtual screening quest in drug discovery
title Data driven approaches to improve the drug discovery process: a virtual screening quest in drug discovery
title_full Data driven approaches to improve the drug discovery process: a virtual screening quest in drug discovery
title_fullStr Data driven approaches to improve the drug discovery process: a virtual screening quest in drug discovery
title_full_unstemmed Data driven approaches to improve the drug discovery process: a virtual screening quest in drug discovery
title_short Data driven approaches to improve the drug discovery process: a virtual screening quest in drug discovery
title_sort data driven approaches to improve the drug discovery process a virtual screening quest in drug discovery
topic Computational chemistry
Bioinformatics (life sciences)
work_keys_str_mv AT ebejerjp datadrivenapproachestoimprovethedrugdiscoveryprocessavirtualscreeningquestindrugdiscovery