- Getting an input from the user
- Detects language used
- Search google for top 20 most relevant articles
- Scrape url's webpage content for comparison
- Calculate Features using Term Frequency-Inverse Document Frequency Vectorization
- Calculate Cosine Similarity of two feature vectors
- Returning similarity matrix for each url loop
- Calculate for max, average, and min value for the plagiarism rate
You can try it tho..., but you'd need API Key for Google CSE and the CSE ID itself
Try my CSE! it's filtered to only search for academic domains only
For the API key, you can make it yourself via the google cloud console!
CSE_SID = "2214f184193cd4d49"
API_KEY = ""
- Creating Concept Program
- Deciding Better Calculation and Search Method
- Creating Base Program
- Implementing Flask API
- Searching for Better Calculation Method
- Searching for Better Search Method
- Implementing Academic Literature Classification Option