shingling
Here are 12 public repositories matching this topic...
Implementing Locality Sensitive Hashing for DNA Sequences.
-
Updated
Nov 29, 2020 - Python
Code for Shingling
-
Updated
Jun 12, 2024 - Python
Finding Similar Items: Textually Similar Documents
-
Updated
Nov 14, 2018 - TypeScript
A Java program to check Plagiarisms between multiple documents using the method of Shingling, MinHashing and Locality Sensitive Hashing.
-
Updated
Jul 31, 2020 - Java
Implementation of algorithms for big data using python, numpy, pandas.
-
Updated
Apr 27, 2020 - Python
Finding Similar Items: Textually Similar Documents
-
Updated
Sep 14, 2022 - Jupyter Notebook
Data Mining Algorithms
-
Updated
Dec 5, 2018 - Roff
-
Updated
Dec 26, 2019 - Python
Duplicate Detection on Hoaxy Dataset
-
Updated
Dec 4, 2018 - Jupyter Notebook
Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.
-
Updated
Aug 28, 2023 - Python
Improve this page
Add a description, image, and links to the shingling topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the shingling topic, visit your repo's landing page and select "manage topics."