Skip to content

Project Structure

ABusyProgrammer edited this page Nov 28, 2019 · 4 revisions

This project consists of several program files. The following list shows how data flows through the program, starting from the user.

Structure of Code files

Tester Program: Test.java

This class takes in an entire directory of copied files and another directory of original files. It then iterates over each copied file and compares it against all original files. The resulting score is then stored in a CV file for viewing later. The directory can either be specified by the user, or the default one that comes with the project is used.

This class is the driver program for the project.

Whole File Checker WholeFileChecker.java

This class has a single static method checkWholeFile() that is called in every iteration of the Test.java file. It works by splitting the copied file into a list of sentences and comparing each sentence to the original text. The number of words plagiarized is returned for that sentence, and they are summed up for all the sentences. The resulting value is then divided by the total number of words in the copied string. This is the percentage of plagiarized content.

Jaccard Index Implementation Jaccard.java

This class contains an implementation of the Jaccard algorithm. The method that performs the computation accepts the plagiarized string. The original string is expected to be stored in the attribute originalSentence because the WholeFileChecker.java would otherwise have to send the same original text in for all sentences in the plagiarized text.

Tech and Tools Stack

  • This project was developed using Java (version 11).
  • The development environment used was IntelliJ IDEA
  • Git was used for version control
  • GitHub was used to track issues and for its Kanban board.
Clone this wiki locally