Detecting Rater Effects Using Many-Facet Rasch Models and Bootstrap Techniques

dc.contributor.author Zhou, Ziwei
dc.contributor.department Statistics (LAS)
dc.contributor.majorProfessor Amy Froelich
dc.date 2021-01-08T15:02:40.000
dc.date.accessioned 2021-02-25T00:04:22Z
dc.date.available 2021-02-25T00:04:22Z
dc.date.copyright Wed Jan 01 00:00:00 UTC 2020
dc.date.embargo 2020-10-03
dc.date.issued 2020-01-01
dc.description.abstract <p>The quality of ratings provided by expert raters in evaluating language learners’ constructed responses in performance assessment is typically investigated by means of statistical modeling. Several rater effects, including severity/leniency, central tendency, and randomness, have been well documented in the psychometrics literature (Myford & Wolfe, 2003). This study applies the Many-Facets Rasch Models to detect these rater effects for an in-house speaking assessment for international teaching assistants (ITAs) in a US university. The goal of this study is to evaluate the extent to which the models, estimation procedures, and statistics/numerical indices that are adopted in this study would work as intended in this context. Two simulation studies are conducted where different model parameters are simulated from different distributions, and a parametric bootstrap procedure is applied to attest to the statistical properties (i.e., consistency, variability, and mean squared error) of the parameter estimates and fit statistics. Then, the model parameters are estimated from the actual data, and the estimates are compared using different estimation procedures (Joint Maximum Likelihood (JML) vs. Marginal Maximum Likelihood (MML)) and different computational implementations (R vs. Facets). The parametric bootstrap procedure is also applied to provide an estimate of the sampling distributions of the parameters and fit statistics through replications. Finally, the indices for rater effects detection are compared using both numerical summaries and plotting techniques.</p> <p>Results indicated that, when the model parameters and rater effects were simulated, the estimated severity parameters and the fit statistics were sensitive in detecting the intended effects. In comparison, MML estimation method showed certain superiority, in terms of statistical consistency and variability, over JML estimation method. But neither estimation method was free of bias. This was also true when the actual data were analyzed. Moreover, in terms of detecting the centrality or randomness effects in the actual data, evidence from the fit statistics could be used in conjunction with other indices from Facets and visualization techniques. However, the bootstrap results for the fit statistics indicated that, when the empirical distributions of the fit statistics were considered, disagreements between MML and JML were relatively large and the rule-of-thumb critical ranges of the fit statistic may be questionable.</p>
dc.format.mimetype Word
dc.identifier archive/lib.dr.iastate.edu/creativecomponents/692/
dc.identifier.articleid 1681
dc.identifier.contextkey 19652634
dc.identifier.doi https://doi.org/10.31274/cc-20240624-516
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath creativecomponents/692
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/93812
dc.source.bitstream archive/lib.dr.iastate.edu/creativecomponents/692/Zhou_CC_2020.docx|||Sat Jan 15 01:30:57 UTC 2022
dc.source.bitstream archive/lib.dr.iastate.edu/creativecomponents/692/auto_convert.pdf|||Sat Jan 15 01:30:59 UTC 2022
dc.subject.disciplines Social Statistics
dc.subject.keywords speaking assessment
dc.subject.keywords rater effects
dc.subject.keywords Many-Facets Rasch measurement
dc.subject.keywords Monte Carlo
dc.subject.keywords bootstrap
dc.title Detecting Rater Effects Using Many-Facet Rasch Models and Bootstrap Techniques
dc.type creative component
dc.type.genre creative component
dspace.entity.type Publication
relation.isOrgUnitOfPublication 264904d9-9e66-4169-8e11-034e537ddbca
thesis.degree.discipline Statistics
thesis.degree.level creativecomponent
File
Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
auto_convert.pdf
Size:
3.88 MB
Format:
Adobe Portable Document Format
Description:
No Thumbnail Available
Name:
Zhou_CC_2020.docx
Size:
5.4 MB
Format:
Microsoft Word XML
Description: