DevOps & SRE notes
11.8K subscribers
27 photos
19 files
2.42K links
Helpfull articles and tools for DevOps&SRE

For paid consultation (RU/EN), contact: @tutunak


All ways to support https://telegra.ph/How-support-the-channel-02-19
Download Telegram
This technical report from Datadog offers a deep dive into managing storage for etcd, the key-value store at the heart of Kubernetes. It explains the causes of database growth and provides strategies for monitoring, defragmenting, and purging old data to maintain a healthy cluster.
https://www.datadoghq.com/blog/managing-etcd-storage/
πŸ‘1
In this story from the Betterstack newsletter, learn how Dropbox managed to save millions of dollars by optimizing its object storage architecture. The piece delves into the technical decisions and engineering efforts behind their impressive cost-reduction initiative.
https://newsletter.betterstack.com/p/how-dropbox-saved-millions-of-dollars
πŸ”₯1
This extensive handbook serves as a go-to resource for troubleshooting common and complex issues within Kubernetes. It's packed with practical advice, commands, and methodologies to help engineers diagnose and resolve problems in their clusters.
https://itnext.io/the-kubernetes-troubleshooting-handbook-7596a1fdf2ff
πŸ‘2πŸ”₯1
This commentary by Techielass provides a step-by-step walkthrough of building a CI/CD pipeline for Terraform using GitHub Actions. It demonstrates how to automate infrastructure deployments safely and efficiently, incorporating best practices like planning and approval steps.
https://www.techielass.com/terraform-with-github-actions-ci-cd-pipeline/
πŸ‘3πŸ‘Ž1🀣1
Forwarded from DevOps & SRE notes (tutunak)
Looking for a hosting platform to practice with Linux, Kubernetes, etc.? Register using my referral link on DigitalOcean and get $200 in credit for 60 days. By registering through my referral link, you also support this Telegram channel.

πŸ‘‰ Register
❀3
This post from Chainguard Unchained introduces the concept of audited least privilege as a critical security measure for the software supply chain. It explains how this principle helps verify that components only have the permissions they strictly need to function.
https://www.chainguard.dev/unchained/audited-least-privilege
πŸ‘1πŸ’―1
In this unique piece, author explores the interesting and often overlooked capabilities of GitRepo volumes in Kubernetes. The content details some fun experiments and practical applications for dynamically providing content to pods directly from a Git repository.
https://raesene.github.io/blog/2024/07/10/Fun-With-GitRepo-Volumes/
πŸ‘1
This opinionated report argues that Large Language Models (LLMs) are not the ultimate solution for complex socio-technical problems in the SRE and operations space. It cautions against over-reliance on AI, emphasizing the continued need for human expertise and critical thinking.
https://blog.relyabilit.ie/llms-wont-save-us/
πŸ‘1
Martin Atkins's latest study presents a clever technique for handling "ephemeral values" in Terraform, which are values needed during a plan but should not be stored in the state. The method helps manage dynamic or sensitive data that is only relevant for a single operation.
https://log.martinatkins.me/2024/05/22/terraform-ephemeral-values/
πŸ‘1
A terminal-based LDAP server explorer built with Go and BubbleTea, providing an interactive interface for browsing LDAP directory trees, viewing records, and executing custom queries.

https://github.com/ericschmar/moribito
πŸ‘2
This in-depth article by Henrik Gerdes benchmarks various container runtime interfaces (CRIs) for Kubernetes. It provides a detailed comparison of runc, crun, gvisor, and youki, focusing on performance and memory consumption.
https://henrikgerdes.me/blog/2024-07-kubernetes-cri-bench/
πŸ‘2
kubectl-validate is a SIG-CLI subproject to support the local validation of resources for native Kubernetes types and CRDs.

https://github.com/kubernetes-sigs/kubectl-validate
❀‍πŸ”₯1
This write-up from incident.io introduces the "Incident Maturity Model," a framework for evaluating and improving an organization's incident management processes. The model outlines three stages: Centralized, Distributed, and Democratized, offering a roadmap for growth.
https://incident.io/blog/the-incident-maturity-model
πŸ‘1