Softwareproject Informatica

Alle projecten

SearchSECO: A Search Engine for the Worldwide Software Ecosystem

In this software project we have created a distributed database system containing abstracted code fragments that are pulled from code host websites like GitHub. Together with a distributed job infrastructure so worker nodes can connect to the network and pull jobs to add data to the database. Our final product consists of a distributed database in which the code fragments are stored together with a queue of jobs. The database can be accessed via a custom database API. The Controller program handles the communication with the user, the database API and the other parts of our program. This system can be used by researchers and software engineers to check code on plagiarism, license issues and known vulnerabilities.