DevOps&SRE Library
17.8K subscribers
458 photos
4 videos
2 files
4.75K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://knd.gov.ru/license?id=67704b536aa9672b963777b3&registryType=bloggersPermission
Download Telegram
Syncthing

Syncthing is a continuous file synchronization program. It synchronizes files between two or more computers in real time, safely protected from prying eyes. Your data is your data alone and you deserve to choose where it is stored, whether it is shared with some third party, and how it's transmitted over the internet.

https://syncthing.net
Twitter’s Reliability Journey

Twitter’s SRE team is one of the most advanced in the industry, managing the services that capture the pulse of the world every single day and throughout the moments that connect us all. We had the privilege of interviewing Brian Brophy, Sr. Staff SRE, Carrie Fernandez, Head of Site Reliability Engineering, JP Doherty, Engineering Manager, and Zac Kiehl, Sr. Staff SRE to learn about how SRE is practiced at Twitter.

Part 1: https://www.blameless.com/blog/twitters-reliability-journey

Part 2: https://www.blameless.com/blog/slo-adoption-twitter
PgBouncer on Kubernetes and how to achieve minimal latency

Experiments with connection poolers on Kubernetes for Postgres Operator

https://engineering.zalando.com/posts/2020/06/postgresql-connection-poolers.html
Unthrottled: Fixing CPU Limits in the Cloud

This year, my teammates and I solved a CPU throttling issue that affects nearly every container orchestrator with hard limits, including Kubernetes, Docker, and Mesos. In doing so, we lowered worst-case response latency in one of Indeed’s applications from over two seconds to 30 milliseconds. In this two-part series, I’ll explain our journey to find the root cause and how we ultimately arrived at the solution.

Part 1: https://medium.com/indeed-engineering/unthrottled-fixing-cpu-limits-in-the-cloud-a0995ede8e89

Part 2: https://medium.com/indeed-engineering/unthrottled-how-a-valid-fix-becomes-a-regression-f61eabb2fbd9
KubeDB by AppsCode

KubeDB by AppsCode is a production-grade cloud-native database management solution for Kubernetes. KubeDB simplifies and automates routine database tasks such as provisioning, patching, backup, recovery, failure detection, and repair for various popular databases on private and public clouds. It frees you to focus on your applications so you can give them the fast performance, high availability, security and compatibility they need.

https://github.com/kubedb/operator
How SLIs Help You Understand Users' Needs

https://www.blameless.com/blog/slis-understand-users-needs
Code review checklist for distributed systems

- Define a path for error handling
- Have a plan for recovery
- Always set timeouts on remote system calls
- Retry on timeout
- Use circuit breaker
- Don't handle timeouts like a failure
- Don't invoke remote systems inside transactions
- Use smart batching
- All APIs MUST be idempotent
- Define response time and throughput SLAs explicitly and code to adhere to them
- Define and limit batch APIs
- Think about Observability up-front
- Cache aggressively
- Consider unit of failure
- Isolate external domain objects at the edge of the system
- Sanitize input at every edge
- Never commit credentials

https://www.kislayverma.com/post/code-review-checklist-for-distributed-systems
Vector

A lightweight and ultra-fast tool for building observability pipelines

https://github.com/timberio/vector
ConfigMaps in Kubernetes: how they work and what you should remember

https://medium.com/flant-com/configmaps-in-kubernetes-f9f6d0081dcb
I Found A Painless Way To Manage Secrets In Google Kubernetes Engine

https://hackernoon.com/i-found-a-painless-way-to-manage-secrets-in-google-kubernetes-engine-cs3d3uuz
Presslabs is the First Managed WordPress Hosting Platform running on Kubernetes

https://www.presslabs.com/blog/presslabs-is-the-first-managed-wordpress-hosting-platform-running-on-kubernetes
Verify your Kubernetes Cluster Network Policies: From Faith to Proof

https://blog.nody.cc/posts/2020-06-kubernetes-network-policy-verification
Install a Kubernetes load balancer on your Raspberry Pi homelab with MetalLB

https://opensource.com/article/20/7/homelab-metallb
Introducing Frigate

A documentation generation tool for Kubernetes Helm Charts

https://medium.com/rapids-ai/introducing-frigate-a-documentation-generation-tool-for-kubernetes-1791854031a1