DevOps&SRE Library
17.8K subscribers
459 photos
4 videos
2 files
4.75K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://knd.gov.ru/license?id=67704b536aa9672b963777b3&registryType=bloggersPermission
Download Telegram
Why Our Pods Were Breaking Bad (and How We Fixed Them)

In this article, we’ll walk through the process of diagnosing a memory leak, analyzing the root cause, and implementing effective solutions to mitigate its impact. We’ll explore practical steps that any application, regardless of the underlying stack or architecture, can follow to troubleshoot and optimize performance.


https://kshitij-nawandar.medium.com/why-our-pods-were-breaking-bad-and-how-we-fixed-them-b3c3e9e8003b
Operational Considerations for Managing Stateful Workloads

When managing stateful workloads, whether in Kubernetes or traditional infrastructure, operational concerns like isolation, lifecycle management, security, disaster recovery, scalability, and observability take center stage. While the examples focus on AWS, PostgreSQL, and Kubernetes, the principles and best practices discussed here are broadly applicable to any environment. This article approaches these topics from an operations perspective, prioritizing reliability, maintainability, and resilience. The goal is not just to run a database, but to ensure it operates efficiently, scales properly, and remains secure in real-world conditions. We’ll explore key aspects of running stateful workloads, from managing failure domains to ensuring observability, and how these impact both operations teams and developers. Whether you’re running a database in a cloud-native setup or on bare metal, these strategies will help you build a robust, well-managed system.


https://dev.to/pampatzoglou/operational-considerations-for-managing-stateful-workloads-20c3
Can Configuration Languages (config DSLs) solve configuration complexity?

https://itnext.io/can-configuration-languages-dsls-solve-configuration-complexity-eee8f124e13a
Why Every Platform Engineer Should Care About Kubernetes Operators

https://www.pulumi.com/blog/why-every-platform-engineer-should-care-about-kubernetes-operators
kpatch

kpatch is a Linux dynamic kernel patching infrastructure which allows you to patch a running kernel without rebooting or restarting any processes. It enables sysadmins to apply critical security patches to the kernel immediately, without having to wait for long-running tasks to complete, for users to log off, or for scheduled reboot windows. It gives more control over uptime without sacrificing security or stability.


https://github.com/dynup/kpatch
Understanding the Circuit Breaker: A Key Design Pattern for Resilient Systems

The Circuit Breaker Pattern is a key design pattern for building resilient systems by preventing cascading failures and ensuring graceful degradation.


https://dzone.com/articles/circuit-breaker-pattern-resilient-systems
Load Testing with Impulse at Airbnb

Comprehensive Load Testing with Load Generator, Dependency Mocker, Traffic Collector, and More


https://medium.com/airbnb-engineering/load-testing-with-impulse-at-airbnb-f466874d03d2
unregistry

Unregistry is a lightweight container image registry that stores and serves images directly from your Docker daemon's storage.


https://github.com/psviderski/unregistry
terrafetch

Terrafetch is the Neofetch of Terraform—because your infrastructure deserves a little flair. It scans your Terraform repository and displays key statistics like the number of variables, resources, modules, outputs, and more—all in a stylish, terminal-friendly format. Perfect for CLI screenshots, repo intros, or just flexing your infra hygiene.


https://github.com/RoseSecurity/terrafetch
How Kubernetes Runs Containers : A Practical Deep Dive

Taking a deep dive into how Kubernetes runs containers as Linux processes


https://blog.esc.sh/kubernetes-containers-linux-processes
tigrisfs

We're proud to announce the immediate availability of tigrisfs, the native filesystem interface for Tigris. This lets you mount Tigris buckets to your laptops, desktops, and servers so you can use data in your buckets as if it was local. This bridges the gap between the cloud and your machine.


https://www.tigrisdata.com/blog/tigrisfs
octelium

Octelium is a free and open source, self-hosted, unified platform for zero trust resource access that is primarily meant to be a modern alternative to remote access VPNs and similar tools.


https://github.com/octelium/octelium