Scalable data pre processing and curation toolkit for LLMs
-
Updated
Dec 24, 2024 - Jupyter Notebook
Scalable data pre processing and curation toolkit for LLMs
90+ CLI tools to build, browse, and blend your media library: an index for your archive.
Open source project for data preparation of LLM application builders
Wikidata-based scholarly profiles
An image + data web scraper build to crawl the CarMax website and store relevant information for vehicle identification projects.
Exploration and data curation of a dataset given by a Kaggle competition (https://www.kaggle.com/dansbecker/melbourne-housing-snapshot) related to properties that were sold in Melbourne in 2016 and 2017. The meaning of this project is to prepare a well-structured matrix, so it can be used to run a model in order to estimate their prices.
This is a capstone project for the course of Business Analytics and Business Data Management at IIT Madras. The project involves analyzing sales data of Uttam Supermarket in Indore, which has 5 franchises, collected over a year. The analysis includes store-wise and monthly sales, the effect of holidays on sales, and weekly sales analysis.
PostgreSQL code for archaeological data management
Acest repo conține materiale, seturi de date și soluții care au fost folosite în cadrul Școlii de vară Astra, prima ediție, 2021
Add a description, image, and links to the datacuration topic page so that developers can more easily learn about it.
To associate your repository with the datacuration topic, visit your repo's landing page and select "manage topics."