RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
-
Updated
Nov 7, 2024 - Python
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
a delightful machine learning tool that allows you to train, test, and use models without writing code
An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)
MLBox is a powerful Automated Machine Learning python library.
Automated Time Series Forecasting
NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Audio processing by using pytorch 1D convolution network
A Deep Learning Python Toolkit for Healthcare Applications.
Collection of various algorithms implemented in R.
High performance model preprocessing library on PyTorch
✔️Contextual word checker for better suggestions (not actively maintained)
Deal with bad samples in your dataset dynamically, use Transforms as Filters, and more!
A curated list of awesome CAE frameworks, libraries and software.
🎯 Personal data science and machine learning toolbox
A full pipeline AutoML tool for tabular data
Introduction to time series preprocessing and forecasting in Python using AR, MA, ARMA, ARIMA, SARIMA and Prophet model with forecast evaluation.
Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku
Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.
Add a description, image, and links to the preprocessing topic page so that developers can more easily learn about it.
To associate your repository with the preprocessing topic, visit your repo's landing page and select "manage topics."