Understanding and improving numerical stability of deep learning algorithms
dc.contributor.advisor | Le, Wei | |
dc.contributor.advisor | Gao, Hongyang | |
dc.contributor.advisor | Liu, Hailiang | |
dc.contributor.advisor | Quinn, Christopher J | |
dc.contributor.advisor | Wang, Zhengdao | |
dc.contributor.author | Kloberdanz, Eliska | |
dc.contributor.department | Department of Computer Science | |
dc.date.accessioned | 2023-06-20T22:18:57Z | |
dc.date.available | 2023-06-20T22:18:57Z | |
dc.date.embargo | 2023-12-20T00:00:00Z | |
dc.date.issued | 2023-05 | |
dc.date.updated | 2023-06-20T22:18:57Z | |
dc.description.abstract | Deep learning (DL) has become an integral part of solutions to various important problems, which is why ensuring the quality of DL systems is essential. One of the challenges of achieving reliability and robustness of DL software is to ensure that algorithm implementations are numerically stable. Numerical stability is a property of numerical algorithms, which governs how changes or errors introduced through inputs or during execution affect the accuracy of algorithm outputs. In numerically unstable algorithms, those errors are magnified and adversely affect the fidelity of algorithm’s outputs via incorrect or inaccurate results. In this thesis we analyze the numerical stability of DL algorithms to better understand and improve numerical stability of DL algorithms. First, we identify and analyze unstable numerical methods and their solutions in DL. Second, we learn assertions on inputs into DL functions that ensure their numerical stability. Third, we focus on neural network quantization, which we found to cause numerical stability issues. Specifically, we propose a new quantization algorithm that optimizes the trade-off between low-bit representation and loss of precision. Next, we focus on analyzing the numerical stability of residual networks by leveraging their dynamic systems interpretation. In particular, we propose that residual networks behave as stiff numerically unstable ordinary differential equations. Finally, we introduce a novel numerically stable solver for neural ordinary differential equations. | |
dc.format.mimetype | ||
dc.identifier.orcid | 0000-0001-7159-2937 | |
dc.identifier.uri | https://dr.lib.iastate.edu/handle/20.500.12876/kv7kJRpv | |
dc.language.iso | en | |
dc.language.rfc3066 | en | |
dc.subject.disciplines | Computer science | en_US |
dc.subject.keywords | artificial intelligence | en_US |
dc.subject.keywords | deep learning | en_US |
dc.subject.keywords | neural networks | en_US |
dc.subject.keywords | numerical analysis | en_US |
dc.subject.keywords | numerical stability | en_US |
dc.subject.keywords | robustness | en_US |
dc.title | Understanding and improving numerical stability of deep learning algorithms | |
dc.type | dissertation | en_US |
dc.type.genre | dissertation | en_US |
dspace.entity.type | Publication | |
relation.isOrgUnitOfPublication | f7be4eb9-d1d0-4081-859b-b15cee251456 | |
thesis.degree.discipline | Computer science | en_US |
thesis.degree.grantor | Iowa State University | en_US |
thesis.degree.level | dissertation | $ |
thesis.degree.name | Doctor of Philosophy | en_US |
File
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- Kloberdanz_iastate_0097E_20799.pdf
- Size:
- 2 MB
- Format:
- Adobe Portable Document Format
- Description:
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 0 B
- Format:
- Item-specific license agreed upon to submission
- Description: