Evaluating Cloud-Based Speech-to-Text Services

File
ben_simon_creative_component_2021_07_22_final_submitted.pdf (252.57 KB)

File Embargoed Until: (2023-07-21)
Date
2021-01-01
Authors
Simon, Benjamin
Major Professor
Alexander Stoytchev
Advisor
Committee Member
Journal Title
Journal ISSN
Volume Title
Publisher
Altmetrics
Authors
Research Projects
Organizational Units
Journal Issue
Series
Department
Electrical and Computer Engineering
Abstract

This study evaluated the accuracy of cloud-based speech recognition systems developed by Apple, Google, Amazon, and Microsoft. Testing was performed using the TIDIGITS data set, the most popular benchmark for small-vocabulary speech recognition. The results demonstrated that most cloud-based systems had surprisingly low accuracy, with the exception of Microsoft. The low recognition accuracy of these systems suggests that the rapid transition to large-vocabulary speech recognition services may have come at the cost of sacrificing performance on small-vocabulary tasks.

Comments
Description
Keywords
Citation
DOI
Source