Interact, analyze and structure massive text, image, embedding, audio and video datasets
-
Updated
Dec 13, 2024 - Python
Interact, analyze and structure massive text, image, embedding, audio and video datasets
WinDirStat is a disk usage statistics viewer and cleanup tool for Microsoft Windows
Remove duplicates from MASSIVE wordlist, without sorting it (for dictionary-based password cracking)
A plugin that does one thing only: Detect and manage duplicate items in Zotero.
Near Duplicate Video Detection (Perceptual Video Hashing) - Get a 64-bit comparable hash-value for any video.
Filter, Sort & Delete Duplicate Files Recursively
⚡ Check your npm modules for unused and duplicate dependencies fast
Interactive code for image similarity using SIFT algorithm
The Panako acoustic fingerprinting system.
Vidupe is a program that can find duplicate and similar video files. V1.211 released on 2019-09-18, Windows exe here:
CLI utility to find near duplicate images and remove all but the best copy.
Fast Near-Duplicate Image Search and Delete using pHash, t-SNE and KDTree.
Find similar audio files easily
Duplicates Detector is a cross-platform GUI utility for finding duplicate files, allowing you to delete or link them to save space. Duplicate files are displayed and processed on two synchronized panels for efficient and convenient operation.
Easily delete your YouTube Music library (and manage playlists)
CLI tool that fast checks if your bundle contains multiple versions of the same package, only by looking in package.json.
A collection of free-text bug reports for duplicate issue identification
Command-line program for managing a media collection, with focus on Content-Based Image Retrieval (Computer Vision) methods for finding duplicates.
Duplicates finder for various source code formats.
Detecting near-duplicate videos by aggregating features from intermediate CNN layers
Add a description, image, and links to the duplicate-detection topic page so that developers can more easily learn about it.
To associate your repository with the duplicate-detection topic, visit your repo's landing page and select "manage topics."