Inductive logic programming for data mining in economics
Inductive logic programming for data mining in economics
This paper addresses the problem of data mining in Inductive Logic Programming (ILP) motivated by its application in the domain of economics. ILP systems have been largely applied to data mining classification tasks with considerable success. The use of ILP systems in regression tasks has been far less successful. Current systems have very limited numerical reasoning capabilities, which limits the application of ILP to discovery of functional relationships of numerical nature. This paper proposes improvements in numerical reasoning capabilities of ILP systems for dealing with regression tasks. It proposes the use of statistical-based techniques like Model Validation and Model Selection to improve noise handling and it introduces a new search stopping criterion inspired in the PAC learning framework. We have found these extensions essential to improve on results over machine learning and statistical-based algorithms used in the empirical evaluation study.
Couto Alves, Alexessander
87b9179e-abde-4ca5-abfc-4b7c5ac8b03b
Camacho, Rui
8ff65c85-a4e7-4278-ad6c-e27a2d469416
Oliveira, Eugene
88b6f1c1-04b4-4e28-a016-87bd26e4ffea
September 2004
Couto Alves, Alexessander
87b9179e-abde-4ca5-abfc-4b7c5ac8b03b
Camacho, Rui
8ff65c85-a4e7-4278-ad6c-e27a2d469416
Oliveira, Eugene
88b6f1c1-04b4-4e28-a016-87bd26e4ffea
Couto Alves, Alexessander, Camacho, Rui and Oliveira, Eugene
(2004)
Inductive logic programming for data mining in economics.
In Proceedings of the 2nd International Workshop on Data Mining and Adaptive Modeling Methods for Economics and Management, Pisa.
15 pp
.
Record type:
Conference or Workshop Item
(Paper)
Abstract
This paper addresses the problem of data mining in Inductive Logic Programming (ILP) motivated by its application in the domain of economics. ILP systems have been largely applied to data mining classification tasks with considerable success. The use of ILP systems in regression tasks has been far less successful. Current systems have very limited numerical reasoning capabilities, which limits the application of ILP to discovery of functional relationships of numerical nature. This paper proposes improvements in numerical reasoning capabilities of ILP systems for dealing with regression tasks. It proposes the use of statistical-based techniques like Model Validation and Model Selection to improve noise handling and it introduces a new search stopping criterion inspired in the PAC learning framework. We have found these extensions essential to improve on results over machine learning and statistical-based algorithms used in the empirical evaluation study.
This record has no associated files available for download.
More information
Published date: September 2004
Identifiers
Local EPrints ID: 494906
URI: http://eprints.soton.ac.uk/id/eprint/494906
PURE UUID: 879c67df-e218-4952-b8c9-aa470e7d827a
Catalogue record
Date deposited: 22 Oct 2024 16:47
Last modified: 23 Oct 2024 02:10
Export record
Contributors
Author:
Alexessander Couto Alves
Author:
Rui Camacho
Author:
Eugene Oliveira
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics