Join our community | Newsletter | Contact us | Docs | Blog | Website | YouTube
Ploomber is the fastest way to build data pipelines ⚡️. Use your favorite editor (Jupyter, VSCode, PyCharm) to develop interactively and deploy ☁️ without code changes (Kubernetes, Airflow, AWS Batch, and SLURM). Do you have legacy notebooks? Refactor them into modular pipelines with a single command.
Compatible with Python 3.7 and higher.
Install with pip
:
pip install ploomber
Or with conda
:
conda install ploomber -c conda-forge
A simple YAML API to get started quickly, a powerful Python API for total flexibility.
get-started.mp4
Automatically cache your pipeline’s previous results and only re-compute tasks that have changed since your last execution.
shorter-cycles.mp4
Run as a shell script in a single machine or distributively in Kubernetes, Airflow, AWS Batch, or SLURM.
deploy.mp4
Bring your old monolithic notebooks, and we’ll automatically convert them into maintainable, modular pipelines.
refactor.mp4
I want to migrate my notebook.
- Documentation
- PyData Chicago talk (covers motivation and demo)
- Develop and deploy an ML pipeline in 30 minutes (EuroPython 2021)
- Guest blog post on the official Jupyter blog
- Examples (Machine Learning pipeline, ETL, among others)
- Blog
- Comparison with other tools
- More videos
Ploomber is a big community of data enthusiasts pushing the boundaries of Data Science and Machine Learning tooling.
Whatever your skillset is, you can contribute to our mission. So whether you're a beginner or an experienced professional, you're welcome to join us on this journey!