Statistical inference using Regularized M-estimation in the reproducing kernel Hilbert space for handling missing data

dc.contributor.author Wang, Hengfang
dc.contributor.author Kim, Jae Kwang
dc.contributor.department Statistics (LAS)
dc.date.accessioned 2022-05-26T14:13:48Z
dc.date.available 2022-05-26T14:13:48Z
dc.date.issued 2021
dc.description.abstract Imputation and propensity score weighting are two popular techniques for handling missing data. We address these problems using the regularized M-estimation techniques in the reproducing kernel Hilbert space. Specifically, we first use the kernel ridge regression to develop imputation for handling item nonresponse. While this nonparametric approach is potentially promising for imputation, its statistical properties are not investigated in the literature. Under some conditions on the order of the tuning parameter, we first establish the root-n consistency of the kernel ridge regression imputation estimator and show that it achieves the lower bound of the semiparametric asymptotic variance. A nonparametric propensity score estimator using the reproducing kernel Hilbert space is also developed by a novel application of the maximum entropy method for the density ratio function estimation. We show that the resulting propensity score estimator is asymptotically equivalent to the kernel ridge regression imputation estimator. Results from a limited simulation study are also presented to confirm our theory. The proposed method is applied to analyze the air pollution data measured in Beijing, China.
dc.description.comments This is a pre-print of the article Wang, Hengfang, and Jae Kwang Kim. "Statistical inference using Regularized M-estimation in the reproducing kernel Hilbert space for handling missing data." arXiv preprint arXiv:2107.07371 (2021). DOI: 10.48550/arXiv.2107.07371. Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0). Copyright 2021 The Authors. Posted with permission.
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/1wgePpAr
dc.language.iso en
dc.publisher arXiv
dc.source.uri https://doi.org/10.48550/arXiv.2107.07371 *
dc.subject.keywords Imputation
dc.subject.keywords Kernel ridge regression;
dc.subject.keywords Missing at random;
dc.subject.keywords Propensity score
dc.title Statistical inference using Regularized M-estimation in the reproducing kernel Hilbert space for handling missing data
dc.type Preprint
dspace.entity.type Publication
relation.isAuthorOfPublication fdf914ae-e48d-4f4e-bfa2-df7a755320f4
relation.isOrgUnitOfPublication 264904d9-9e66-4169-8e11-034e537ddbca
File
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
2022-KimJaeKwang-StatisticalInference.pdf
Size:
433.08 KB
Format:
Adobe Portable Document Format
Description:
Collections