Discordance Minimization-based Imputation Algorithms for Missing Values in Rating Data

Thumbnail Image
Date
2023
Authors
Major Professor
Advisor
Committee Member
Journal Title
Journal ISSN
Volume Title
Publisher
Authors
Person
Park, Young-Woong
Assistant Professor
Person
Zhu, Dan
Professor
Research Projects
Organizational Units
Organizational Unit
Information Systems and Business Analytics
In today’s business landscape, information systems and business analytics are pivotal elements that drive success. Information systems form the digital foundation of modern enterprises, while business analytics involves the strategic analysis of data to extract meaningful insights. Information systems have the power to create and restructure industries, empower individuals and firms, and dramatically reduce costs. Business analytics empowers organizations to make precise, data-driven decisions that optimize operations, enhance strategies, and fuel overall growth. Explore these essential fields to understand how data and technology come together, providing the knowledge needed to make informed decisions and achieve remarkable outcomes.
Journal Issue
Is Version Of
Versions
Series
Abstract
Ratings are frequently used to evaluate and compare subjects in various applications, from education to healthcare, because ratings provide succinct yet credible measures for comparing subjects. However, when multiple rating lists are combined or considered together, subjects often have missing ratings, because most rating lists do not rate every subject in the combined list. In this study, we propose analyses on missing value patterns using six real-world data sets in various applications, as well as the conditions for applicability of imputation algorithms. Based on the special structures and properties derived from the analyses, we propose optimization models and algorithms that minimize the total rating discordance across rating providers to impute missing ratings in the combined rating lists, using only the known rating information. The total rating discordance is defined as the sum of the pairwise discordance metric, which can be written as a quadratic function. Computational experiments based on real-world and synthetic rating data sets show that the proposed methods outperform the state-of-the-art general imputation methods in the literature in terms of imputation accuracy.
Comments
This preprint is cited as Park, Y.W., Kim, J., Zhu, D., Discordance Minimization-based Imputation Algorithms for Missing Values in Rating Data. arXiv, 2023; https://doi.org/10.48550/arXiv.2311.04035. Posted with permission.
https://creativecommons.org/licenses/by/4.0/
Description
Keywords
Citation
DOI
Copyright
Collections