DevOps & SRE notes
11.7K subscribers
27 photos
19 files
2.4K links
Helpfull articles and tools for DevOps&SRE

For paid consultation (RU/EN), contact: @tutunak


All ways to support https://telegra.ph/How-support-the-channel-02-19
Download Telegram
This article from JP Gouin provides a deep dive into implementing GitOps at scale, with a specific focus on the cluster bootstrapping process. It covers the challenges and solutions for managing numerous Kubernetes clusters efficiently and declaratively.
https://medium.com/@jp-gouin/gitops-at-scale-clusters-bootstrapping-f36695d4340d
โค2
This edition of the Scalable Thread newsletter breaks down effective strategies for handling sudden and unexpected bursts of traffic to your systems. It explores architectural patterns and techniques to ensure reliability and prevent service degradation during traffic spikes.
https://newsletter.scalablethread.com/p/how-to-handle-sudden-bursts-of-traffic
๐Ÿ‘3
This write-up from Prezi Engineering explains how multi-AZ deployments can lead to surprisingly high data transfer costs. It documents their journey of migrating from a costly self-hosted Prometheus setup to a more efficient monitoring solution to save on their cloud budget.
https://engineering.prezi.com/how-using-availability-zones-can-eat-up-your-budget-our-journey-from-prometheus-to-be8a816f7efe
๐Ÿ‘1
Author Yasin Taha Erol provides a practical guide on migrating from Kubernetes' native Horizontal Pod Autoscaler (HPA) to KEDA. The text highlights the benefits of KEDA's event-driven scaling and walks through the steps for a smooth transition.
https://yasintahaerol.medium.com/migrating-hpa-to-keda-13e946ee29ee
๐Ÿ‘2
This tutorial offers an interesting approach to container image distribution by using S3 as a private container registry. The author demonstrates how to set up and use an S3 bucket for storing and pulling images, providing a simple alternative to dedicated registry services.
https://ochagavia.nl/blog/using-s3-as-a-container-registry/
๐Ÿ‘1
This technical report from Datadog offers a deep dive into managing storage for etcd, the key-value store at the heart of Kubernetes. It explains the causes of database growth and provides strategies for monitoring, defragmenting, and purging old data to maintain a healthy cluster.
https://www.datadoghq.com/blog/managing-etcd-storage/
๐Ÿ‘1
In this story from the Betterstack newsletter, learn how Dropbox managed to save millions of dollars by optimizing its object storage architecture. The piece delves into the technical decisions and engineering efforts behind their impressive cost-reduction initiative.
https://newsletter.betterstack.com/p/how-dropbox-saved-millions-of-dollars
๐Ÿ”ฅ1
This extensive handbook serves as a go-to resource for troubleshooting common and complex issues within Kubernetes. It's packed with practical advice, commands, and methodologies to help engineers diagnose and resolve problems in their clusters.
https://itnext.io/the-kubernetes-troubleshooting-handbook-7596a1fdf2ff
๐Ÿ‘2๐Ÿ”ฅ1
This commentary by Techielass provides a step-by-step walkthrough of building a CI/CD pipeline for Terraform using GitHub Actions. It demonstrates how to automate infrastructure deployments safely and efficiently, incorporating best practices like planning and approval steps.
https://www.techielass.com/terraform-with-github-actions-ci-cd-pipeline/
๐Ÿ‘3๐Ÿ‘Ž1๐Ÿคฃ1
Forwarded from DevOps & SRE notes (tutunak)
Looking for a hosting platform to practice with Linux, Kubernetes, etc.? Register using my referral link on DigitalOcean and get $200 in credit for 60 days. By registering through my referral link, you also support this Telegram channel.

๐Ÿ‘‰ Register
โค3
This post from Chainguard Unchained introduces the concept of audited least privilege as a critical security measure for the software supply chain. It explains how this principle helps verify that components only have the permissions they strictly need to function.
https://www.chainguard.dev/unchained/audited-least-privilege
๐Ÿ‘1๐Ÿ’ฏ1
In this unique piece, author explores the interesting and often overlooked capabilities of GitRepo volumes in Kubernetes. The content details some fun experiments and practical applications for dynamically providing content to pods directly from a Git repository.
https://raesene.github.io/blog/2024/07/10/Fun-With-GitRepo-Volumes/
๐Ÿ‘1
This opinionated report argues that Large Language Models (LLMs) are not the ultimate solution for complex socio-technical problems in the SRE and operations space. It cautions against over-reliance on AI, emphasizing the continued need for human expertise and critical thinking.
https://blog.relyabilit.ie/llms-wont-save-us/
๐Ÿ‘1
Martin Atkins's latest study presents a clever technique for handling "ephemeral values" in Terraform, which are values needed during a plan but should not be stored in the state. The method helps manage dynamic or sensitive data that is only relevant for a single operation.
https://log.martinatkins.me/2024/05/22/terraform-ephemeral-values/
๐Ÿ‘1