Virtual Screening Using Binary Kernel Discrimination: Effect of Noisy Training Data and the Optimization of Performance
Virtual Screening Using Binary Kernel Discrimination: Effect of Noisy Training Data and the Optimization of Performance
Binary kernel discrimination (BKD) uses a training set of compounds, for which structural and qualitative activity data are available, to produce a model that can then be applied to the structures of other compounds in order to predict their likely activity. Experiments with the MDL Drug Data Report database show that the optimal value of the smoothing parameter, and hence the predictive power of BKD, is crucially dependent on the number of false positives in the training set. It is also shown that the best results for BKD are achieved using one particular optimization method for the determination of the smoothing parameter that lies at the heart of the method and using the Jaccard/Tanimoto coefficient in the kernel function that is used to compute the similarity between a test set molecule and the members of the training set.
478-486
Chen, Beining
84b464ee-4c98-429d-a92d-a2c3549fd00d
Harrison, Robert F.
c3ce2e0f-5408-4db9-90bd-a3149a932a72
Pasupa, Kitsuchart
952ededb-8c97-41b7-a65b-6aba31de2669
Willett, Peter
1e5de175-dff4-4449-8e65-98263351544b
Wilton, David J.
827a0ec9-83e0-4da0-88d5-5ef039445bf2
Wood, David J.
5a02ca09-d82e-4b2d-9638-0750fa4ca7d7
21 February 2006
Chen, Beining
84b464ee-4c98-429d-a92d-a2c3549fd00d
Harrison, Robert F.
c3ce2e0f-5408-4db9-90bd-a3149a932a72
Pasupa, Kitsuchart
952ededb-8c97-41b7-a65b-6aba31de2669
Willett, Peter
1e5de175-dff4-4449-8e65-98263351544b
Wilton, David J.
827a0ec9-83e0-4da0-88d5-5ef039445bf2
Wood, David J.
5a02ca09-d82e-4b2d-9638-0750fa4ca7d7
Chen, Beining, Harrison, Robert F., Pasupa, Kitsuchart, Willett, Peter, Wilton, David J. and Wood, David J.
(2006)
Virtual Screening Using Binary Kernel Discrimination: Effect of Noisy Training Data and the Optimization of Performance.
Journal of Chemical Information and Modeling, 46 (2), .
(doi:10.1021/ci0505426).
Abstract
Binary kernel discrimination (BKD) uses a training set of compounds, for which structural and qualitative activity data are available, to produce a model that can then be applied to the structures of other compounds in order to predict their likely activity. Experiments with the MDL Drug Data Report database show that the optimal value of the smoothing parameter, and hence the predictive power of BKD, is crucially dependent on the number of false positives in the training set. It is also shown that the best results for BKD are achieved using one particular optimization method for the determination of the smoothing parameter that lies at the heart of the method and using the Jaccard/Tanimoto coefficient in the kernel function that is used to compute the similarity between a test set molecule and the members of the training set.
This record has no associated files available for download.
More information
Published date: 21 February 2006
Organisations:
Electronics & Computer Science
Identifiers
Local EPrints ID: 266574
URI: http://eprints.soton.ac.uk/id/eprint/266574
ISSN: 1549-9596
PURE UUID: 4c65343a-3d75-4160-b34f-872763231efd
Catalogue record
Date deposited: 14 Aug 2008 12:48
Last modified: 14 Mar 2024 08:29
Export record
Altmetrics
Contributors
Author:
Beining Chen
Author:
Robert F. Harrison
Author:
Kitsuchart Pasupa
Author:
Peter Willett
Author:
David J. Wilton
Author:
David J. Wood
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics