TY  - RPRT
ID  - jf:TUD-KE-2007-06
T1  - From Local Patterns to Global Models: The {LeGo} Approach to Data Mining
A1  - Crémilleux, Bruno
A1  - Fürnkranz, Johannes
A1  - Knobbe, Arno J.
A1  - Scholz, Martin
Y1  - 2007
IS  - TUD-KE-2007-06
T2  - TU Darmstadt, Knowledge Engineering Group
UR  - /publications/reports/tud-ke-2007-06.pdf
N2  - In this paper we present LeGo, a generic framework that utilizes existing local pattern mining
techniques for global modeling in a variety of diverse data mining tasks. In the spirit of well
known KDD process models, our work identifies different phases within the data mining step,
each of which is formulated in terms of different formal constraints. It starts with a phase of
mining patterns that are individually promising. Later phases establish the context given by the
global data mining task by selecting groups of diverse and highly informative patterns, which are
finally combined to one or more global models that address the overall data mining task(s). The
paper discusses the connection to various learning techniques, and illustrates that our framework
is broad enough to cover and leverage frequent pattern mining, subgroup discovery, pattern teams,
multi-view learning, and several other popular algorithms. The Safarii learning toolbox serves
as a proof-of-concept of its high potential for practical data mining applications. Finally, we
point out several challenging open research questions that naturally emerge in a constraint-based
local-to-global pattern mining, selection, and combination framework.
ER  -