Skip to content
#

dataset-generator

Here are 47 public repositories matching this topic...

This repository hosts a comprehensive suite for graph-based entity summarization dataset generating from user-selected Wikipedia pages. Utilizing a series of interconnected modules, it leverages Wikidata and Wikipedia dumps to construct a dataset, alongside auto-generated ground truths.

  • Updated Jun 24, 2024
  • Python
ImageFromTextGenerator

IFTG (ImageFromTextGenerator) is a Python package that simplifies creating robust datasets for OCR models. Generate images from text, apply over 10 built-in noise effects, and customize fonts and layouts. IFTG supports all languages and offers endless noise combinations, including custom noise creation.

  • Updated Sep 10, 2024
  • Python

Improve this page

Add a description, image, and links to the dataset-generator topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dataset-generator topic, visit your repo's landing page and select "manage topics."

Learn more