Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.
-
Updated
Dec 13, 2023 - Python
Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.
Run greatexpectations.io on ANY SQL Engine using REST API. Supported by FastAPI, Pydantic and SQLAlchemy as best data quality tool
🦆 Blazing Fast and highly customizable Github Action to setup a DuckDb runtime
SQL based data profiling & data quality checks, which will help you to perform data profiling & data quality checks on SQL database at table & database level.
dbt Datasphere Plugin is for integrating multiple open-source data quality frameworks into your dbt projects. It unifies Soda SQL, Great Expectations, Datafold, providing a single interface to configure and run data quality checks.
This project involves a comprehensive analysis to determine the top YouTubers in the UK for 2024, Using Excel, SQL and Power BI.
This project extracts and cleans raw YouTube data from excel-csv (Kaggle) through SQL and identifies the top-performing UK-based Influencers. Data Stack: Excel | Microsoft SQL Server | Power BI
Ramblings of a curious mind
KPMG Data Analytics Consulting Virtual Internship
An Apache Airflow data pipeline is designed to perform ELT operations, utilizing Amazon S3 and Amazon Redshift Serverless.
Add a description, image, and links to the dataqualitycheck topic page so that developers can more easily learn about it.
To associate your repository with the dataqualitycheck topic, visit your repo's landing page and select "manage topics."