Statistical Data Integration in Survey Sampling: A Review

dc.contributor.author Yang, Shu
dc.contributor.author Kim, Jae Kwang
dc.contributor.author Kim, Jae Kwang
dc.contributor.department Statistics
dc.date 2020-10-07T20:51:44.000
dc.date.accessioned 2021-02-26T13:20:30Z
dc.date.available 2021-02-26T13:20:30Z
dc.date.copyright Wed Jan 01 00:00:00 UTC 2020
dc.date.issued 2020-01-01
dc.description.abstract <p>Finite population inference is a central goal in survey sampling. Probability sampling is the main statistical approach to finite population inference. Challenges arise due to high cost and increasing non-response rates. Data integration provides a timely solution by leveraging multiple data sources to provide more robust and efficient inference than using any single data source alone. The technique for data integration varies depending on types of samples and available information to be combined. This article provides a systematic review of data integration techniques for combining probability samples, probability and non-probability samples, and probability and big data samples. We discuss a wide range of integration methods such as generalized least squares, calibration weighting, inverse probability weighting, mass imputation and doubly robust methods. Finally, we highlight important questions for future research.</p>
dc.description.comments <p>This preprint is made available through arXiv: <a href="https://arxiv.org/abs/2001.03259" target="_blank">https://arxiv.org/abs/2001.03259</a>.</p>
dc.format.mimetype application/pdf
dc.identifier archive/lib.dr.iastate.edu/stat_las_pubs/310/
dc.identifier.articleid 1308
dc.identifier.contextkey 19705725
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath stat_las_pubs/310
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/98886
dc.language.iso en
dc.source.bitstream archive/lib.dr.iastate.edu/stat_las_pubs/310/2020_Kim_StatisticalDataPreprint.pdf|||Fri Jan 14 23:31:16 UTC 2022
dc.subject.disciplines Categorical Data Analysis
dc.subject.disciplines Design of Experiments and Sample Surveys
dc.subject.disciplines Statistical Methodology
dc.subject.keywords Data fusion
dc.subject.keywords Generalizability
dc.subject.keywords Meta analysis
dc.subject.keywords Missingness at random
dc.subject.keywords Transportability
dc.title Statistical Data Integration in Survey Sampling: A Review
dc.type article
dc.type.genre article
dspace.entity.type Publication
relation.isAuthorOfPublication fdf914ae-e48d-4f4e-bfa2-df7a755320f4
relation.isOrgUnitOfPublication 264904d9-9e66-4169-8e11-034e537ddbca
File
Original bundle
Now showing 1 - 1 of 1
Name:
2020_Kim_StatisticalDataPreprint.pdf
Size:
360.2 KB
Format:
Adobe Portable Document Format
Description:
Collections