Leveraging data characteristics for bug localization in deep learning programs

dc.contributor.advisor Rajan, Hridesh
dc.contributor.advisor Prabhu, Gurpur
dc.contributor.advisor Mitra, Simanta
dc.contributor.author Manke, Ruchira
dc.contributor.department Department of Computer Science
dc.date.accessioned 2024-10-15T22:11:54Z
dc.date.available 2024-10-15T22:11:54Z
dc.date.issued 2024-08
dc.date.updated 2024-10-15T22:11:56Z
dc.description.abstract Deep Learning (DL) is a class of machine learning algorithms that are used in a wide variety of applications. Like any software system, DL programs may have bugs. To support bug localization in DL programs, several tools have been proposed in the past. As most of the bugs that occur due to improper model structure known as structural bugs lead to inadequate performance during training, it is challenging for developers to identify the root cause and address these bugs. To support bug detection and localization in DL programs, in this work, we propose Theia, which detects and localizes structural bugs in DL programs. Unlike the previous works, Theia considers the training dataset characteristics to automatically detect bugs in DL programs developed using two deep learning libraries, Keras and PyTorch. Since training the DL models is a time-consuming process, Theia detects these bugs at the beginning of the training process and alerts the developer with informative messages containing the bug’s location and actionable fixes which will help them to improve the structure of the model. We evaluated Theia on a benchmark of 40 real-world buggy DL programs obtained from Stack Overflow. Our results show that Theia successfully localizes 57/75 structural bugs in 40 buggy programs, whereas NeuraLint, a state-of-the-art approach capable of localizing structural bugs before training localizes 17/75 bugs.
dc.format.mimetype PDF
dc.identifier.doi https://doi.org/10.31274/td-20250502-192
dc.identifier.orcid 0009-0007-4729-8421
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/ywAbM1nv
dc.language.iso en
dc.language.rfc3066 en
dc.subject.disciplines Computer science en_US
dc.subject.keywords bug localization en_US
dc.subject.keywords debugging en_US
dc.subject.keywords deep learning en_US
dc.subject.keywords program analysis en_US
dc.title Leveraging data characteristics for bug localization in deep learning programs
dc.type thesis en_US
dc.type.genre thesis en_US
dspace.entity.type Publication
relation.isOrgUnitOfPublication f7be4eb9-d1d0-4081-859b-b15cee251456
thesis.degree.discipline Computer science en_US
thesis.degree.grantor Iowa State University en_US
thesis.degree.level thesis $
thesis.degree.name Master of Science en_US
File
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Manke_iastate_0097M_21623.pdf
Size:
634.18 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description: