Engaging developers in open source software projects: harnessing social and technical data mining to improve software development

dc.contributor.advisor Judy M. Vance
dc.contributor.author Carlson, Patrick
dc.contributor.department Mechanical Engineering
dc.date 2018-08-11T18:11:47.000
dc.date.accessioned 2020-06-30T02:58:17Z
dc.date.available 2020-06-30T02:58:17Z
dc.date.copyright Thu Jan 01 00:00:00 UTC 2015
dc.date.embargo 2001-01-01
dc.date.issued 2015-01-01
dc.description.abstract <p>As software development has evolved, an increasing amount of collaboration and management is done online. Open source software, in particular, has benefited greatly from communication and collaboration on the Internet. As software projects increase in size, the codebase complexity and required communication between developers increases. The barriers of entry for development participation are not only technical in nature but involve understanding the changing dynamics of the community.</p> <p>Social Technical Congruence (STC) attempts to understand and model the synergies between technical development and communication. Motivated by this theory, three algorithms were developed that leverage data from version control history and email mailing list communication to help developers better understand the community. The “code impact” algorithm identifies commits that are impactful to the overall technical structure of the source code. The “developer knowledge” algorithm calculates the knowledge that a developer or developers have for a particular section of code. Lastly, the “communication suggestions” algorithm provides suggestions about who a developer could be increasing communication with on the mailing list based on shared technical dependencies and recent communication. These algorithms were implemented in an online website called “Jamii”. Through an interactive front-end, the website provides relevant development-centric information to the community. The website was evaluated with Mozilla and KDE, two large open source communities.</p> <p>Quantitative and qualitative analysis provide details about the socialization process before the website was developed through an initial survey as well as afterwards. The results from the initial survey paint a primarily positive picture of socialization and inclusion for KDE and a more complex process of socialization and inclusion for Mozilla because of the large size of its community and codebase.</p> <p>In many cases for both communities, the ratings on a number of factors regarding the socialization process and effective coordination were actually higher for the beginner participants as compared to the veterans. This suggests that new participants may have an overly-inflated positive view</p> <p>than veteran members. This work provides fundamental contributions to the field of STC and may generalize to other distributed online collaboration efforts.</p>
dc.format.mimetype application/pdf
dc.identifier archive/lib.dr.iastate.edu/etd/14663/
dc.identifier.articleid 5670
dc.identifier.contextkey 8052026
dc.identifier.doi https://doi.org/10.31274/etd-180810-4215
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath etd/14663
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/28848
dc.language.iso en
dc.source.bitstream archive/lib.dr.iastate.edu/etd/14663/Carlson_iastate_0097E_15059.pdf|||Fri Jan 14 20:24:17 UTC 2022
dc.subject.disciplines Computer Engineering
dc.subject.disciplines Computer Sciences
dc.subject.disciplines Library and Information Science
dc.subject.keywords Human Computer Interaction
dc.subject.keywords algorithms
dc.subject.keywords data mining
dc.subject.keywords human-computer interaction
dc.subject.keywords open source
dc.subject.keywords social technical congruence
dc.subject.keywords software development
dc.title Engaging developers in open source software projects: harnessing social and technical data mining to improve software development
dc.type dissertation
dc.type.genre dissertation
dspace.entity.type Publication
relation.isOrgUnitOfPublication 6d38ab0f-8cc2-4ad3-90b1-67a60c5a6f59
thesis.degree.discipline Human Computer Interaction
thesis.degree.level dissertation
thesis.degree.name Doctor of Philosophy
File
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Carlson_iastate_0097E_15059.pdf
Size:
3.91 MB
Format:
Adobe Portable Document Format
Description: