Artificial Intelligence Research Group

Some Past and Current Projects
Under Development

15. CUPID: An Iterative Knowledge Discovery Framework
Investigator(s): Mr Jason Mallen and Professor Max Bramer

Knowledge Discovery from Databases (KDD) is concerned with utilising techniques borrowed from fields such as machine learning, statistics and databases to search for relationships and global patterns that may exist in large databases but are hidden among the vast amounts of data. The discovered knowledge can be helpful for building knowledge based systems and data analysis. The underlying principle behind CUPID is the use of a quantitative measure for the interest of a hypothesis. This measure provides a method of ranking competing hypotheses and thus allows the system to store the best or most interesting rules describing a database. CUPID is based on the ITRule algorithm of Smyth and Goodman (1992) and extends that algorithm with added functionality. CUPID provides four fundamental features. One, background knowledge in the form of attribute value generalisation hierarchies may be utilised. Two, prior domain knowledge which may be incorrect and incomplete may be provided by a domain expert. Three, knowledge may be re-used. Four, noise in the dataset is handled in a well-founded manner.

J.I.Mallen and M.A.Bramer (1994). CUPID: An Iterative Knowledge Discovery Framework. In "Research and Development in Expert Systems XI". SGES Publications, 1994.