OneStopEnglish corpus: A new corpus for automatic readability assessment and text simplification
dc.contributor.author | Vajjala, Sowmya | |
dc.contributor.author | Lucic, Ivana | |
dc.contributor.department | Department of English | |
dc.date | 2018-06-15T19:26:24.000 | |
dc.date.accessioned | 2020-06-30T02:19:30Z | |
dc.date.available | 2020-06-30T02:19:30Z | |
dc.date.copyright | Mon Jan 01 00:00:00 UTC 2018 | |
dc.date.embargo | 2018-06-15 | |
dc.date.issued | 2018-01-01 | |
dc.description.abstract | <p>This paper describes the collection and compilation of the OneStopEnglish corpus of texts written at three reading levels, and demonstrates its usefulness for through two applications - automatic readability assessment and automatic text simplification. The corpus consists of 189 texts, each in three versions (567 in total). The corpus is now freely available under a CC by-SA 4.0 license1 and we hope that it would foster further research on the topics of readability assessment and text simplification.</p> | |
dc.description.comments | <p>This proceeding is published as Vajjala, Sowmya, and Ivana Lucic. "OneStopEnglish corpus: A new corpus for automatic readability assessment and text simplification." In <em>Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications </em>(2018): 297-304.</p> | |
dc.format.mimetype | application/pdf | |
dc.identifier | archive/lib.dr.iastate.edu/engl_conf/10/ | |
dc.identifier.articleid | 1009 | |
dc.identifier.contextkey | 12326203 | |
dc.identifier.s3bucket | isulib-bepress-aws-west | |
dc.identifier.submissionpath | engl_conf/10 | |
dc.identifier.uri | https://dr.lib.iastate.edu/handle/20.500.12876/23394 | |
dc.language.iso | en | |
dc.source.bitstream | archive/lib.dr.iastate.edu/engl_conf/10/2018_Vajjala_OneStopEnglish.pdf|||Fri Jan 14 18:10:24 UTC 2022 | |
dc.subject.disciplines | Computational Linguistics | |
dc.subject.disciplines | English Language and Literature | |
dc.subject.disciplines | Linguistics | |
dc.title | OneStopEnglish corpus: A new corpus for automatic readability assessment and text simplification | |
dc.type | article | |
dc.type.genre | conference | |
dspace.entity.type | Publication | |
relation.isAuthorOfPublication | da901803-53ba-4a27-ab39-b049f6d505b6 | |
relation.isOrgUnitOfPublication | a7f2ac65-89b1-4c12-b0c2-b9bb01dd641b |
File
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- 2018_Vajjala_OneStopEnglish.pdf
- Size:
- 116.39 KB
- Format:
- Adobe Portable Document Format
- Description: