Types and Classes of Machine Learning and Data Mining

Lloyd Allison, School of Computer Science & Software Engineering, Monash University, Clayton, Victoria, Australia 3800

Abstract: The notion of a statistical model, as inferred and used in statistics, machine learning and data mining, is examined from a semantic point of view. Data types and type-classes for models are developed that allow models to be manipulated in a type-safe yet flexible way. The programming language Haskell-98, with its system of polymorphic types and type-classes, is used as the meta-language for this exercise so one of the by-products is a running program.

Keywords: Classes, data mining, data types, functional programming, inductive inference, machine learning.

© 2003, Australian Computer Society, Inc. This paper appeared at the twenty-fifth Twenty-Sixth Australasian Computer Science Conference (ACSC2003) pp.207-215, Adelaide, Australia, 4-7 February 2003. Conferences in Research and Practice in Information Technology, Vol.16. Michael Oudshoorn, Ed. Reproduction for academic, not-for profit purposes permitted provided this text is included.

Full paper: pdf@[acm.org]['22] and locally [paper.pdf].

Also see [JFP].