A Clustering based Prediction Scheme for High Utility Itemsets

Lakhawat, Piyush; Somani, Arun

A Clustering based Prediction Scheme for High Utility Itemsets

File

2017_SomaniArun_ClusteringBased.pdf (968.31 KB)

Date

2017-01-01

Authors

Lakhawat, Piyush

Somani, Arun

Abstract

We strongly believe that the current Utility Itemset Mining (UIM) problem model can be extended with a key modeling capability of predicting future itemsets based on prior knowledge of clusters in the dataset. Information in transactions fairly representative of a cluster type is more a characteristic of the cluster type than the the entire data. Subjecting such transactions to the common threshold in the UIM problem leads to information loss. We identify that an implicit use of the cluster structure of data in the UIM problem model will address this limitation. We achieve this by introducing a new clustering based utility in the definition of the UIM problem model and modifying the definitions of absolute utilities based on it. This enhances the UIM model by including a predictive aspect to it, thereby enabling the cluster specific patterns to emerge while still mining the inter-cluster patterns. By performing experiments on two real data sets we are able to verify that our proposed predictive UIM problem model extracts more useful information than the current UIM model with high accuracy.

Academic or Administrative Unit

Department of Electrical and Computer Engineering

Type

article

Comments

This proceeding is published as Piyush Lakhawat and Arun K. Somani, “A Clustering based Prediction Scheme for High Utility Itemsets.” In: Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KDIR, 123-134, 2017, Funchal, Madeira, Portugal. DOI: 10.5220/0006590001230134. Published in SCITEPRESS Digital Library. Posted with permission.

Copyright

Sun Jan 01 00:00:00 UTC 2017