| Relational Modeling | Fundamentals of Induction | Meta-Analysis of Techniques | Hybrid Models |
The first field that addressed the task of learning classification models from such complex representation was Inductive Logic Programming (ILP) combining the expressive power of First Order Logic and induction methods for model estimation.
More recently the machine learning community proposed upgrades of existing propositional (feature vector based) models drawing from ILP and statistical methods.
The most common upgrade approach is the transform the multi-relational domain into a feauture vector representation. This process involves two steps, the identification of related entities (based on idetifiers and keys) and the aggregation of sets of related entities into atomic feature-values.
The focus of my work has been on the aggregation of entities with high-dimensional categorical attributes from 1-n relationships. My theoretical work (see for instance KDD 2003) includes a taxonomy of relational concepts that can be expressed given a set of assumptions for aggregation.
For my dissertation I have developed a set of aggregation operators that are based on class-conditional density estimates and vector distances that miniumize the degree of information loss during aggregation and preserve task-specific predictive information. This methodology has outperformed alternative approaches (including ILP and existing upgrades) on a number of domains.