On accelerating ultra-large-scale mining

dc.contributor.author Upadhyaya, Ganesha
dc.contributor.author Rajan, Hridesh
dc.contributor.author Rajan, Hridesh
dc.contributor.department Computer Science
dc.date 2019-09-22T06:22:13.000
dc.date.accessioned 2020-06-30T01:54:27Z
dc.date.available 2020-06-30T01:54:27Z
dc.date.copyright Sun Jan 01 00:00:00 UTC 2017
dc.date.embargo 2017-09-19
dc.date.issued 2017-01-01
dc.description.abstract <p>Ultra-large-scale mining has been shown to be useful for a number of software engineering tasks e.g. mining specifications, defect prediction. We propose a new research direction for accelerating ultra-large-scale mining that goes beyond parallelization. Our key idea is to analyze the interaction pattern between the mining task and the artifact to cluster artifacts such that running the mining task on one candidate artifact from each cluster is sufficient to produce results for other artifacts in the same cluster. Our artifact clustering criteria go beyond syntactic, semantic, and functional similarities to mining-task-specific similarity, where the interaction pattern between the mining task and the artifact is used for clustering. Our preliminary evaluation demonstrates that our technique significantly reduces the overall mining time.</p>
dc.description.comments <p>This is a manuscript of a proceeding published as Upadhyaya, Ganesha, and Hridesh Rajan. "On accelerating ultra-large-scale mining." In <em>Proceedings of the 39th International Conference on Software Engineering: New Ideas and Emerging Results Track</em>, pp. 39-42. IEEE Press, 2017. doi: <a href="https://doi.org/10.1109/ICSE-NIER.2017.11" target="_self">10.1109/ICSE-NIER.2017.11</a>. Posted with permission.</p>
dc.format.mimetype application/pdf
dc.identifier archive/lib.dr.iastate.edu/cs_conf/18/
dc.identifier.articleid 1003
dc.identifier.contextkey 10767807
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath cs_conf/18
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/19823
dc.language.iso en
dc.source.bitstream archive/lib.dr.iastate.edu/cs_conf/18/2017_Rajan_AcceleratingUltra.pdf|||Fri Jan 14 21:34:23 UTC 2022
dc.source.uri 10.1109/ICSE-NIER.2017.11
dc.subject.disciplines Computer Sciences
dc.subject.disciplines Software Engineering
dc.title On accelerating ultra-large-scale mining
dc.type article
dc.type.genre conference
dspace.entity.type Publication
relation.isAuthorOfPublication 4e3f4631-9a99-4a4d-ab81-491621e94031
relation.isOrgUnitOfPublication f7be4eb9-d1d0-4081-859b-b15cee251456
File
Original bundle
Now showing 1 - 1 of 1
Name:
2017_Rajan_AcceleratingUltra.pdf
Size:
288.13 KB
Format:
Adobe Portable Document Format
Description: