DevOps&SRE Library
17.8K subscribers
462 photos
4 videos
2 files
4.75K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://knd.gov.ru/license?id=67704b536aa9672b963777b3&registryType=bloggersPermission
Download Telegram
PgBouncer on Kubernetes and how to achieve minimal latency

Experiments with connection poolers on Kubernetes for Postgres Operator

https://engineering.zalando.com/posts/2020/06/postgresql-connection-poolers.html
Unthrottled: Fixing CPU Limits in the Cloud

This year, my teammates and I solved a CPU throttling issue that affects nearly every container orchestrator with hard limits, including Kubernetes, Docker, and Mesos. In doing so, we lowered worst-case response latency in one of Indeed’s applications from over two seconds to 30 milliseconds. In this two-part series, I’ll explain our journey to find the root cause and how we ultimately arrived at the solution.

Part 1: https://medium.com/indeed-engineering/unthrottled-fixing-cpu-limits-in-the-cloud-a0995ede8e89

Part 2: https://medium.com/indeed-engineering/unthrottled-how-a-valid-fix-becomes-a-regression-f61eabb2fbd9
KubeDB by AppsCode

KubeDB by AppsCode is a production-grade cloud-native database management solution for Kubernetes. KubeDB simplifies and automates routine database tasks such as provisioning, patching, backup, recovery, failure detection, and repair for various popular databases on private and public clouds. It frees you to focus on your applications so you can give them the fast performance, high availability, security and compatibility they need.

https://github.com/kubedb/operator
How SLIs Help You Understand Users' Needs

https://www.blameless.com/blog/slis-understand-users-needs
Code review checklist for distributed systems

- Define a path for error handling
- Have a plan for recovery
- Always set timeouts on remote system calls
- Retry on timeout
- Use circuit breaker
- Don't handle timeouts like a failure
- Don't invoke remote systems inside transactions
- Use smart batching
- All APIs MUST be idempotent
- Define response time and throughput SLAs explicitly and code to adhere to them
- Define and limit batch APIs
- Think about Observability up-front
- Cache aggressively
- Consider unit of failure
- Isolate external domain objects at the edge of the system
- Sanitize input at every edge
- Never commit credentials

https://www.kislayverma.com/post/code-review-checklist-for-distributed-systems
Vector

A lightweight and ultra-fast tool for building observability pipelines

https://github.com/timberio/vector
ConfigMaps in Kubernetes: how they work and what you should remember

https://medium.com/flant-com/configmaps-in-kubernetes-f9f6d0081dcb
I Found A Painless Way To Manage Secrets In Google Kubernetes Engine

https://hackernoon.com/i-found-a-painless-way-to-manage-secrets-in-google-kubernetes-engine-cs3d3uuz
Presslabs is the First Managed WordPress Hosting Platform running on Kubernetes

https://www.presslabs.com/blog/presslabs-is-the-first-managed-wordpress-hosting-platform-running-on-kubernetes
Verify your Kubernetes Cluster Network Policies: From Faith to Proof

https://blog.nody.cc/posts/2020-06-kubernetes-network-policy-verification
Install a Kubernetes load balancer on your Raspberry Pi homelab with MetalLB

https://opensource.com/article/20/7/homelab-metallb
Introducing Frigate

A documentation generation tool for Kubernetes Helm Charts

https://medium.com/rapids-ai/introducing-frigate-a-documentation-generation-tool-for-kubernetes-1791854031a1
Minimum Viable Kubernetes

So just for fun, let's see what the absolute bare minimum "Kubernetes cluster" actually looks like.

https://eevans.co/blog/minimum-viable-kubernetes