DevOps & SRE notes
11.8K subscribers
27 photos
19 files
2.41K links
Helpfull articles and tools for DevOps&SRE

For paid consultation (RU/EN), contact: @tutunak


All ways to support https://telegra.ph/How-support-the-channel-02-19
Download Telegram
๐Ÿคฃ9๐Ÿ”ฅ5
Author dotdc presents Terraflow, a CI/CD orchestrator designed to scale Terraform operations effectively. This report outlines the creation of the tool and how it helps manage complex infrastructure deployments.
https://medium.com/@dotdc/creating-terraflow-a-ci-cd-orchestrator-to-scale-terraform-3965b3f8931f
๐Ÿ‘1
This analysis provides a deep dive into writing policies for Kubernetes clusters using OPA Gatekeeper. The Permify Tech Blog explains how to enforce custom rules and maintain security and compliance in a cloud-native environment.
https://medium.com/permify-tech-blog/opa-gatekeeper-how-to-write-policies-for-kubernetes-clusters-bb660666eb19
โค1๐Ÿ‘1
AWS just released their postmortem (link in comment) for the October DynamoDB outage. It's thorough, technically detailed, and explains exactly what broke and how they'll "prevent" it from happening again. But this PR-approved, sanitized narrative tells us only what happened to the technology, nothing else.

https://aws.amazon.com/message/101925/
โค2๐Ÿ‘2
Marc Christian P. Gregorio offers a practical commentary on automating centralized NAT Gateways in AWS across multiple VPCs and regions using Terraform. The solution aims to optimize costs and simplify network management for large-scale deployments.
https://medium.com/@marcchristianp.gregorio/automating-centralized-nat-gateways-in-aws-vpcs-and-region-with-terraform-69a6f90d60da
๐Ÿ‘3โค1
Elliot Graebert proposes an impact-based leveling system for engineering organizations as an alternative to traditional career ladders. This treatise discusses how focusing on impact can foster a more motivated and effective engineering culture.
https://medium.com/@elliotgraebert/an-impact-based-level-system-for-engineering-organizations-2e0f9bee20e6
๐Ÿ‘2โค1
This article from JP Gouin provides a deep dive into implementing GitOps at scale, with a specific focus on the cluster bootstrapping process. It covers the challenges and solutions for managing numerous Kubernetes clusters efficiently and declaratively.
https://medium.com/@jp-gouin/gitops-at-scale-clusters-bootstrapping-f36695d4340d
โค2
This edition of the Scalable Thread newsletter breaks down effective strategies for handling sudden and unexpected bursts of traffic to your systems. It explores architectural patterns and techniques to ensure reliability and prevent service degradation during traffic spikes.
https://newsletter.scalablethread.com/p/how-to-handle-sudden-bursts-of-traffic
๐Ÿ‘3
This write-up from Prezi Engineering explains how multi-AZ deployments can lead to surprisingly high data transfer costs. It documents their journey of migrating from a costly self-hosted Prometheus setup to a more efficient monitoring solution to save on their cloud budget.
https://engineering.prezi.com/how-using-availability-zones-can-eat-up-your-budget-our-journey-from-prometheus-to-be8a816f7efe
๐Ÿ‘1
Author Yasin Taha Erol provides a practical guide on migrating from Kubernetes' native Horizontal Pod Autoscaler (HPA) to KEDA. The text highlights the benefits of KEDA's event-driven scaling and walks through the steps for a smooth transition.
https://yasintahaerol.medium.com/migrating-hpa-to-keda-13e946ee29ee
๐Ÿ‘2
This tutorial offers an interesting approach to container image distribution by using S3 as a private container registry. The author demonstrates how to set up and use an S3 bucket for storing and pulling images, providing a simple alternative to dedicated registry services.
https://ochagavia.nl/blog/using-s3-as-a-container-registry/
๐Ÿ‘1