Using the Eclipse IDE to Develop Large-Scale Data Mining Programs in Boa
Boa is a domain-specific computer programming language used to query the Boa infrastructure, a big data infrastructure developed by Dr. Hridesh Rajan and others at Iowa State University. Boa infrastructure contains historic data from around 700,000 open source software projects that makes it possible to analyze software and its evolution. While the Boa language eases these tasks, there does not exist a development environment that eases the writing of Boa programs. Programmers that wish to verify the syntactic correctness of their algorithm must submit the query to the Boa servers and wait for a compile status. Having to do this every time a program is written or a change is made significantly increases the amount of time it takes to develop sound programs. To ease the development of Boa programs, we have created an Eclipse IDE plugin for Boa. Features include, but are not limited to, the ability to locally manage their online profile, view source code, delete jobs, and check compilation/execution status. Additionally, users can get live syntax/semantics checks for their programs prior to sending their job request. A possible future for the project would be to compile Boa programs locally and send that product to the Boa servers.