Migrating Critical Traffic At Scale with No Downtime
Part 1: https://netflixtechblog.com/migrating-critical-traffic-at-scale-with-no-downtime-part-1-ba1c7a1c7835
Part 2: https://netflixtechblog.medium.com/migrating-critical-traffic-at-scale-with-no-downtime-part-2-4b1c8c7155c1
Part 1: https://netflixtechblog.com/migrating-critical-traffic-at-scale-with-no-downtime-part-1-ba1c7a1c7835
Part 2: https://netflixtechblog.medium.com/migrating-critical-traffic-at-scale-with-no-downtime-part-2-4b1c8c7155c1
Top metrics for Elasticsearch monitoring with Prometheus
https://sysdig.com/blog/elasticsearch-monitoring
https://sysdig.com/blog/elasticsearch-monitoring
CPU requests and limits in Kubernetes
In Kubernetes, what should I use as CPU requests and limits?https://community.ops.io/danielepolencic/cpu-requests-and-limits-in-kubernetes-ock
BACK TO TERRAFORM
In this blog post, we’ll discuss 3 different IaC tools: Terraform, Pulumi and Terragrunt. We’ll discuss two real-world cases where Pulumi and Terragrunt were replaced by Terraform. We’ll explain why they weren’t the correct fit and what lessons we learned from using them.https://ordina-jworks.github.io/cloud/2023/06/05/back-to-terraform.html
VictoriaLogs
VictoriaLogs is log management and log analytics system from VictoriaMetrics.https://github.com/VictoriaMetrics/VictoriaMetrics/releases/tag/v0.1.0-victorialogs
Infrastructure as Code (IaC) — keeping Terraform configuration DRY with Terragrunt
https://medium.com/otto-tech/infrastructure-as-code-iac-keeping-terraform-configuration-dry-with-terragrunt-bdb33bdac907
https://medium.com/otto-tech/infrastructure-as-code-iac-keeping-terraform-configuration-dry-with-terragrunt-bdb33bdac907
openobserve
OpenObserve is a cloud native observability platform built specifically for logs, metrics, traces and analytics designed to work at petabyte scale.https://github.com/openobserve/openobserve
An Alerting strategy for the cloud
There arent much articles out there on alerting strategies. I found that out when I was developing one myself to implement a robust alerting system. Its been a couple of years since then and not much has changed. Some gems of knowledge on alerting remain in books but not widely published on the internet. This article is an attempt to address that gap.https://abstraction.blog/2023/06/13/cloud-alerting-strategy
Replace a Dockerfile with Go (or Python, or Node.js)
https://docs.dagger.io/205271/replace-dockerfile
https://docs.dagger.io/205271/replace-dockerfile
Manage Redis on AWS from Kubernetes
Using AWS Controller for Kubernetes and CDK for Kuberneteshttps://itnext.io/manage-redis-on-aws-from-kubernetes-eeadba7eb889
K8s Workflow Management for Software Developers Using Argo Workflows
https://medium.com/riskified-technology/k8s-workflow-management-for-software-developers-using-argo-workflows-1e5247d2c4a6
https://medium.com/riskified-technology/k8s-workflow-management-for-software-developers-using-argo-workflows-1e5247d2c4a6
Scaling Amazon EKS and Cassandra Beyond 1,000 Nodes
https://aws.amazon.com/blogs/containers/scaling-amazon-eks-and-cassandra-beyond-1000-nodes
https://aws.amazon.com/blogs/containers/scaling-amazon-eks-and-cassandra-beyond-1000-nodes
Helm Release Time-To-Live(TTL)⏳💀 for Temporary Environments
https://dev.to/rtpro/helm-release-time-to-livettl-for-temporary-environments-1239
https://dev.to/rtpro/helm-release-time-to-livettl-for-temporary-environments-1239
Running Kubernetes jobs with sidecar containers
https://medium.com/@abhinav.ittekot/running-kubernetes-jobs-with-sidecar-containers-8c034b020993
https://medium.com/@abhinav.ittekot/running-kubernetes-jobs-with-sidecar-containers-8c034b020993
Getting data into and scaling for Billions of records with ClickHouse
https://medium.com/@shamsul.arefin/evaluating-the-performance-of-clickhouse-with-amplab-big-data-benchmark-dataset-on-kubernetes-b36e860ba027
https://medium.com/@shamsul.arefin/evaluating-the-performance-of-clickhouse-with-amplab-big-data-benchmark-dataset-on-kubernetes-b36e860ba027
Restricting cluster-admin permissions
https://marcusnoble.co.uk/2022-01-20-restricting-cluster-admin-permissions
https://marcusnoble.co.uk/2022-01-20-restricting-cluster-admin-permissions
How to use Kubernetes ephemeral volumes & storage
https://www.airplane.dev/blog/kubernetes-ephemeral-storage
https://www.airplane.dev/blog/kubernetes-ephemeral-storage
Istio service mesh, a start to finish tutorial with Side Car architecture and an analysis + comparison of the Ambient mesh architecture
https://natarajsundar.medium.com/istio-service-mesh-a-start-to-finish-tutorial-with-side-car-architecture-and-an-analysis-d70a255ea41d
https://natarajsundar.medium.com/istio-service-mesh-a-start-to-finish-tutorial-with-side-car-architecture-and-an-analysis-d70a255ea41d
krateo
Krateo Platformops is an open source tool platform that gives users the capability to create any desired resource on basically any infrastructure they'd like. Be it a K8s cluster, microservice, application, pipeline, database or anything else, Krateo has got your back.https://github.com/krateoplatformops/krateo