Handwriting identification using random forests and score-based likelihood ratios

dc.contributor.author Johnson, Madeline Quinn
dc.contributor.author Ommen, Danica M.
dc.contributor.department Center for Statistics and Applications in Forensic Evidence
dc.date.accessioned 2022-02-04T16:46:29Z
dc.date.available 2022-02-04T16:46:29Z
dc.date.issued 2021
dc.description.abstract Handwriting analysis is conducted by forensic document examiners who are able to visually recognize characteristics of writing to evaluate the evidence of writership. Recently, there have been incentives to investigate how to quantify the similarity between two written documents to support the conclusions drawn by experts. We use an automatic algorithm within the “handwriter” package in R, to decompose a handwritten sample into small graphical units of writing. These graphs are sorted into 40 exemplar groups or clusters. We hypothesize that the frequency with which a person contributes graphs to each cluster is characteristic of their handwriting. Given two questioned handwritten documents, we can then use the vectors of cluster frequencies to quantify the similarity between the two documents. We extract features from the difference between the vectors and combine them using a random forest. The output from the random forest is used as the similarity score to compare documents. We estimate the distributions of the similarity scores computed from multiple pairs of documents known to have been written by the same and by different persons, and use these estimated densities to obtain score-based likelihood ratios (SLRs) that rely on different assumptions. We find that the SLRs are able to indicate whether the similarity observed between two documents is more or less likely depending on writership.
dc.description.comments The following is published as Johnson, Madeline Quinn, and Danica M. Ommen. "Handwriting identification using random forests and score‐based likelihood ratios." Statistical Analysis and Data Mining: The ASA Data Science Journal (2021). Posted with permission of CSAFE. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided theoriginal work is properly cited
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/qzoDXD2w
dc.language.iso en_US
dc.publisher © 2021 The Authors.Statistical Analysis and Data Miningpublished by Wiley Periodicals LLC
dc.source.uri https://doi.org/10.1002/sam.11566 *
dc.subject handwriting analysis
dc.subject machine learning
dc.subject SLR
dc.title Handwriting identification using random forests and score-based likelihood ratios
dc.type Article
dspace.entity.type Publication
relation.isOrgUnitOfPublication d8a3c72b-850f-40f6-87c4-8812547080c7
File
Original bundle
Now showing 1 - 1 of 1
Name:
2022-Johnson-HandwritingIdentification.pdf
Size:
6.14 MB
Format:
Adobe Portable Document Format
Description:
Collections