A Clustering based Prediction Scheme for High Utility Itemsets

Thumbnail Image
Supplemental Files
Date
2017-01-01
Authors
Lakhawat, Piyush
Major Professor
Advisor
Committee Member
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract

We strongly believe that the current Utility Itemset Mining (UIM) problem model can be extended with a key modeling capability of predicting future itemsets based on prior knowledge of clusters in the dataset. Information in transactions fairly representative of a cluster type is more a characteristic of the cluster type than the the entire data. Subjecting such transactions to the common threshold in the UIM problem leads to information loss. We identify that an implicit use of the cluster structure of data in the UIM problem model will address this limitation. We achieve this by introducing a new clustering based utility in the definition of the UIM problem model and modifying the definitions of absolute utilities based on it. This enhances the UIM model by including a predictive aspect to it, thereby enabling the cluster specific patterns to emerge while still mining the inter-cluster patterns. By performing experiments on two real data sets we are able to verify that our proposed predictive UIM problem model extracts more useful information than the current UIM model with high accuracy.

Series Number
Journal Issue
Is Version Of
Versions
Series
Academic or Administrative Unit
Type
article
Comments

This proceeding is published as Piyush Lakhawat and Arun K. Somani, “A Clustering based Prediction Scheme for High Utility Itemsets.” In: Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, 123-134, 2017, Funchal, Madeira, Portugal. DOI: 10.5220/0006590001230134. Published in SCITEPRESS Digital Library. Posted with permission.

Rights Statement
Copyright
Sun Jan 01 00:00:00 UTC 2017
Funding
DOI
Supplemental Resources