A framework for building the home of your microservices.
-
Updated
Aug 27, 2020 - Java
A framework for building the home of your microservices.
CoCo: Coordinated Container Scheduling with Last-Level Cache and Memory Bandwidth Partitioning
This GitHub repository contains a comprehensive tutorial on Site Reliability Engineering (SRE), covering topics such as SLAs, SLOs, SLIs, Chaos Engineering, monitoring, alerting, and much more. It also includes a bonus content on SRE best practices. Follow along with the #100daysofSRE challenge and improve your reliability engineering skills.
SLOpPy - SLO demonstration on a simple Python API, that will design pizza toppings based on Wikipedia articles
Calculate the tolerable downtime of your service
Calculate SLI/SLO metrics from ZMON's timeseries data
Calculate how much downtime should be permitted in your Service Level Agreement or Objective
SLOs, Error windows and alerts are complicated. Here an attempt to make it easy
Slo-exporter computes standardized SLI and SLO metrics based on events coming from various data sources.
Manage application's SLI and SLO's easily with the application lifecycle inside a Kubernetes cluster
A curated list of Site Reliability and Production Engineering Tools
🦥 Easy and simple Prometheus SLO (service level objectives) generator
Add a description, image, and links to the service-level-objective topic page so that developers can more easily learn about it.
To associate your repository with the service-level-objective topic, visit your repo's landing page and select "manage topics."