-
Notifications
You must be signed in to change notification settings - Fork 2
Project Structure
This project consists of several program files. The following list shows how data flows through the program, starting from the user.
This class takes in an entire directory of copied files and another directory of original files. It then iterates over each copied file and compares it against all original files. The resulting score is then stored in a CV file for viewing later. The directory can either be specified by the user, or the default one that comes with the project is used.
This class is the driver program for the project.
This class has a single static method checkWholeFile()
that is called in every iteration of the Test.java
file. It works by splitting the copied file into a list of sentences and comparing each sentence to the original text. The number of words plagiarized is returned for that sentence, and they are summed up for all the sentences. The resulting value is then divided by the total number of words in the copied string. This is the percentage of plagiarized content.
This class contains an implementation of the Jaccard algorithm. The method that performs the computation accepts the plagiarized string. The original string is expected to be stored in the attribute originalSentence
because the WholeFileChecker.java
would otherwise have to send the same original text in for all sentences in the plagiarized text.