JSAI Technical Report, Type 2 SIG
Online ISSN : 2436-5556
Mining Discriminative Subgraphs with Pruning based on Upper-bound of Ingormation Gain
Masahiro HARAKiyoto TAKABAYASHIKouzou OHARAHiroshi MOTODATakashi WASHIO
Author information
RESEARCH REPORT / TECHNICAL REPORT FREE ACCESS

2007 Volume 2007 Issue DMSM-A603 Pages 13-

Details
Abstract

A graph mining technique called Chunkingless Graph-Based Induction (Cl-GBI) can extract discriminative subgraphs from graph structured data by the operation called chunkingless pairwise expansion which constructs pseudo-nodes from selected pairs of nodes in the data. Because of the time and space complexities, it happens that Cl-GBI cannot extract good enough to describe characteristics of data. Thus, to improve its efficiency, we propose a pruning method based on the upper-bound of information gain. Information gain is used as a criterion of discriminativity in Cl-GBI and the upper-bound of information gain of a subgraph is the maximal one that its super graph can achieve. The proposed method allows Cl-GBI to exclude from its search space unfruitful subgraphs that cannot yield the most discriminative one, by comparing the upper-bound of information gain of each subgraph at hand with the best information gain at the moment. Furthermore, in this paper, we experimentally show the usefulness of the proposed pruning method by applying CI-GBI adopting it to both a real world dataset and an artificial datasets.

Content from these authors
© 2007 Authors
Previous article Next article
feedback
Top