awsls
A list command for AWS resources. It supports listing of over 200 types of resources across 76 different AWS services.https://github.com/jckuester/awsls
The goal is to support every AWS resource that is also covered by Terraform (currently over 500) without adding much code but rather generating it.
How to Be a Financially Conscious Site Reliability Engineer
https://www.cloudzero.com/blog/how-to-be-a-financially-conscious-site-reliability-engineer
https://www.cloudzero.com/blog/how-to-be-a-financially-conscious-site-reliability-engineer
Kubernetes static code analysis with Checkov
https://bridgecrew.io/blog/kubernetes-static-code-analysis-with-checkov
https://bridgecrew.io/blog/kubernetes-static-code-analysis-with-checkov
Syncthing
Syncthing is a continuous file synchronization program. It synchronizes files between two or more computers in real time, safely protected from prying eyes. Your data is your data alone and you deserve to choose where it is stored, whether it is shared with some third party, and how it's transmitted over the internet.https://syncthing.net
Twitter’s Reliability Journey
Part 2: https://www.blameless.com/blog/slo-adoption-twitter
Twitter’s SRE team is one of the most advanced in the industry, managing the services that capture the pulse of the world every single day and throughout the moments that connect us all. We had the privilege of interviewing Brian Brophy, Sr. Staff SRE, Carrie Fernandez, Head of Site Reliability Engineering, JP Doherty, Engineering Manager, and Zac Kiehl, Sr. Staff SRE to learn about how SRE is practiced at Twitter.Part 1: https://www.blameless.com/blog/twitters-reliability-journey
Part 2: https://www.blameless.com/blog/slo-adoption-twitter
PgBouncer on Kubernetes and how to achieve minimal latency
Experiments with connection poolers on Kubernetes for Postgres Operatorhttps://engineering.zalando.com/posts/2020/06/postgresql-connection-poolers.html
Unthrottled: Fixing CPU Limits in the Cloud
Part 2: https://medium.com/indeed-engineering/unthrottled-how-a-valid-fix-becomes-a-regression-f61eabb2fbd9
This year, my teammates and I solved a CPU throttling issue that affects nearly every container orchestrator with hard limits, including Kubernetes, Docker, and Mesos. In doing so, we lowered worst-case response latency in one of Indeed’s applications from over two seconds to 30 milliseconds. In this two-part series, I’ll explain our journey to find the root cause and how we ultimately arrived at the solution.Part 1: https://medium.com/indeed-engineering/unthrottled-fixing-cpu-limits-in-the-cloud-a0995ede8e89
Part 2: https://medium.com/indeed-engineering/unthrottled-how-a-valid-fix-becomes-a-regression-f61eabb2fbd9
KubeDB by AppsCode
KubeDB by AppsCode is a production-grade cloud-native database management solution for Kubernetes. KubeDB simplifies and automates routine database tasks such as provisioning, patching, backup, recovery, failure detection, and repair for various popular databases on private and public clouds. It frees you to focus on your applications so you can give them the fast performance, high availability, security and compatibility they need.https://github.com/kubedb/operator
NAMING APPLICATIONS AND MICROSERVICES
https://srcco.de/posts/naming-applications-components-microservices.html
https://srcco.de/posts/naming-applications-components-microservices.html
Code review checklist for distributed systems
- Define a path for error handlinghttps://www.kislayverma.com/post/code-review-checklist-for-distributed-systems
- Have a plan for recovery
- Always set timeouts on remote system calls
- Retry on timeout
- Use circuit breaker
- Don't handle timeouts like a failure
- Don't invoke remote systems inside transactions
- Use smart batching
- All APIs MUST be idempotent
- Define response time and throughput SLAs explicitly and code to adhere to them
- Define and limit batch APIs
- Think about Observability up-front
- Cache aggressively
- Consider unit of failure
- Isolate external domain objects at the edge of the system
- Sanitize input at every edge
- Never commit credentials
Vector
A lightweight and ultra-fast tool for building observability pipelineshttps://github.com/timberio/vector
ConfigMaps in Kubernetes: how they work and what you should remember
https://medium.com/flant-com/configmaps-in-kubernetes-f9f6d0081dcb
https://medium.com/flant-com/configmaps-in-kubernetes-f9f6d0081dcb
I Found A Painless Way To Manage Secrets In Google Kubernetes Engine
https://hackernoon.com/i-found-a-painless-way-to-manage-secrets-in-google-kubernetes-engine-cs3d3uuz
https://hackernoon.com/i-found-a-painless-way-to-manage-secrets-in-google-kubernetes-engine-cs3d3uuz
How to measure Linux Performance Avoiding Most Typical Mistakes
CPU: https://ma.ttias.be/how-to-measure-linux-performance-avoiding-most-typical-mistakes-cpu
Disk: https://ma.ttias.be/how-to-measure-linux-performance-avoiding-most-typical-mistakes-disk-storage
Memory: https://ma.ttias.be/how-to-measure-linux-performance-avoiding-most-typical-mistakes-memory
Network: https://ma.ttias.be/how-to-measure-linux-performance-avoiding-most-typical-mistakes-network
CPU: https://ma.ttias.be/how-to-measure-linux-performance-avoiding-most-typical-mistakes-cpu
Disk: https://ma.ttias.be/how-to-measure-linux-performance-avoiding-most-typical-mistakes-disk-storage
Memory: https://ma.ttias.be/how-to-measure-linux-performance-avoiding-most-typical-mistakes-memory
Network: https://ma.ttias.be/how-to-measure-linux-performance-avoiding-most-typical-mistakes-network
Design review checklist for Distributed Systems
https://www.kislayverma.com/post/design-review-checklist-for-distributed-systems
https://www.kislayverma.com/post/design-review-checklist-for-distributed-systems
Docker and Kubernetes — root vs. privileged
https://itnext.io/docker-and-kubernetes-root-vs-privileged-9d2a37453dec
https://itnext.io/docker-and-kubernetes-root-vs-privileged-9d2a37453dec
Presslabs is the First Managed WordPress Hosting Platform running on Kubernetes
https://www.presslabs.com/blog/presslabs-is-the-first-managed-wordpress-hosting-platform-running-on-kubernetes
https://www.presslabs.com/blog/presslabs-is-the-first-managed-wordpress-hosting-platform-running-on-kubernetes
Verify your Kubernetes Cluster Network Policies: From Faith to Proof
https://blog.nody.cc/posts/2020-06-kubernetes-network-policy-verification
https://blog.nody.cc/posts/2020-06-kubernetes-network-policy-verification