The application and effectiveness of a multi-objective metaheuristic algorithm for partial classification

B. de la Iglesia, G. Richards, M. S. Philpott, V. J. Rayward-Smith

Research output: Contribution to journalArticle

29 Citations (Scopus)

Abstract

In this paper, we present an application of multi-objective metaheuristics to the field of data mining. We introduce the data mining task of nugget discovery (also known as partial classification) and show how the multi-objective metaheuristic algorithm NSGA II can be modified to solve this problem. We also present an alternative algorithm for the same task, the ARAC algorithm, which can find all rules that are best according to some measures of interest subject to certain constraints. The ARAC algorithm provides an excellent basis for comparison with the results of the multi-objective metaheuristic algorithm as it can deliver the Pareto optimal front consisting of all partial classification rules that lie in the upper confidence/coverage border, for databases of limited size. We present the results of experiments with various well-known databases for both algorithms. We also discuss how the two methods can be used complementarily for large databases to deliver a set of best rules according to some predefined criteria, providing a powerful tool for knowledge discovery in databases.
Original languageEnglish
Pages (from-to)898-917
Number of pages20
JournalEuropean Journal of Operational Research
Volume169
Issue number3
DOIs
Publication statusPublished - 16 Mar 2006

Cite this