Understanding and improving numerical stability of deep learning algorithms

dc.contributor.advisor Le, Wei
dc.contributor.advisor Gao, Hongyang
dc.contributor.advisor Liu, Hailiang
dc.contributor.advisor Quinn, Christopher J
dc.contributor.advisor Wang, Zhengdao
dc.contributor.author Kloberdanz, Eliska
dc.contributor.department Department of Computer Science
dc.date.accessioned 2023-06-20T22:18:57Z
dc.date.available 2023-06-20T22:18:57Z
dc.date.embargo 2023-12-20T00:00:00Z
dc.date.issued 2023-05
dc.date.updated 2023-06-20T22:18:57Z
dc.description.abstract Deep learning (DL) has become an integral part of solutions to various important problems, which is why ensuring the quality of DL systems is essential. One of the challenges of achieving reliability and robustness of DL software is to ensure that algorithm implementations are numerically stable. Numerical stability is a property of numerical algorithms, which governs how changes or errors introduced through inputs or during execution affect the accuracy of algorithm outputs. In numerically unstable algorithms, those errors are magnified and adversely affect the fidelity of algorithm’s outputs via incorrect or inaccurate results. In this thesis we analyze the numerical stability of DL algorithms to better understand and improve numerical stability of DL algorithms. First, we identify and analyze unstable numerical methods and their solutions in DL. Second, we learn assertions on inputs into DL functions that ensure their numerical stability. Third, we focus on neural network quantization, which we found to cause numerical stability issues. Specifically, we propose a new quantization algorithm that optimizes the trade-off between low-bit representation and loss of precision. Next, we focus on analyzing the numerical stability of residual networks by leveraging their dynamic systems interpretation. In particular, we propose that residual networks behave as stiff numerically unstable ordinary differential equations. Finally, we introduce a novel numerically stable solver for neural ordinary differential equations.
dc.format.mimetype PDF
dc.identifier.orcid 0000-0001-7159-2937
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/kv7kJRpv
dc.language.iso en
dc.language.rfc3066 en
dc.subject.disciplines Computer science en_US
dc.subject.keywords artificial intelligence en_US
dc.subject.keywords deep learning en_US
dc.subject.keywords neural networks en_US
dc.subject.keywords numerical analysis en_US
dc.subject.keywords numerical stability en_US
dc.subject.keywords robustness en_US
dc.title Understanding and improving numerical stability of deep learning algorithms
dc.type dissertation en_US
dc.type.genre dissertation en_US
dspace.entity.type Publication
relation.isOrgUnitOfPublication f7be4eb9-d1d0-4081-859b-b15cee251456
thesis.degree.discipline Computer science en_US
thesis.degree.grantor Iowa State University en_US
thesis.degree.level dissertation $
thesis.degree.name Doctor of Philosophy en_US
File
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Kloberdanz_iastate_0097E_20799.pdf
Size:
2 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description: