Improving the Resiliency of Our Infrastructure DNS Zone
https://blog.cloudflare.com/improving-the-resiliency-of-our-infrastructure-dns-zone
https://blog.cloudflare.com/improving-the-resiliency-of-our-infrastructure-dns-zone
Root Cause Analysis For Reliability: A Case Study
https://medium.com/last9/root-cause-analysis-for-reliability-a-case-study-8a987ed3a31c
https://medium.com/last9/root-cause-analysis-for-reliability-a-case-study-8a987ed3a31c
10 Best Tools to Monitor SSL Certificate Expiry, Validity & Change
https://sematext.com/blog/ssl-certificate-monitoring
https://sematext.com/blog/ssl-certificate-monitoring
Pulumi Import: Generate IaC for Existing Cloud Resources
https://www.pulumi.com/blog/pulumi-import-generate-iac-for-existing-cloud-resources
https://www.pulumi.com/blog/pulumi-import-generate-iac-for-existing-cloud-resources
Generic mitigations
A philosophy of duct-tape outage resolution
https://www.oreilly.com/content/generic-mitigationsEverything You Know About MongoDB is Wrong!
Myth 0: MongoDB is Web Scalehttps://developer.mongodb.com/article/everything-you-know-is-wrong
Myth 1: MongoDB is on v3.2
Myth 2: MongoDB is a JSON Database
Myth 3: MongoDB Doesn't Support Transactions
Myth 4: MongoDB Doesn't Support Relationships
Myth 5: MongoDB is All About Sharding
Myth 6: MongoDB is Insecure
Myth 7: MongoDB Loses Data
Myth 8: MongoDB is Easy
ethr
Ethr is a cross platform network performance measurement tool written in golang. The goal of this project is to provide a native tool for comprehensive network performance measurements of bandwidth, connections/s, packets/s, latency, loss & jitter, across multiple protocols such as TCP, UDP, HTTP, HTTPS, and across multiple platforms such as Windows, Linux and other Unix systems.https://github.com/microsoft/ethr
Headlamp
Headlamp is an easy-to-use and extensible Kubernetes web UI.https://github.com/kinvolk/headlamp
The Boring Option
Migrating Segment Efforts Storage at Strava.https://medium.com/strava-engineering/the-boring-option-4a7c6ad16ab8
cloudquery
cloudquery exposes your cloud configuration and metadata as sql tables, providing powerful analysis and monitoring without writing code.https://github.com/cloudquery/cloudquery
How to monitor multi-cloud Kubernetes with Prometheus and Grafana
https://inlets.dev/blog/2020/12/15/multi-cluster-monitoring.html
https://inlets.dev/blog/2020/12/15/multi-cluster-monitoring.html
Qovery Engine
Qovery Engine is an open-source abstraction layer library that turns easy apps deployment on AWS, GCP, Azure, and other Cloud providers in just a few minutes. The Qovery Engine is written in Rust and takes advantage of Terraform, Helm, Kubectl, and Docker to manage resources.https://github.com/Qovery/engine
rga
rga is a line-oriented search tool that allows you to look for a regex in a multitude of file types. rga wraps the awesome ripgrep and enables it to search in pdf, docx, sqlite, jpg, movie subtitles (mkv, mp4), etc.https://github.com/phiresky/ripgrep-all
Linux Hardening Guide
Linux is not a secure operating system. However, there are steps you can take to improve it. This guide aims to explain how to harden Linux as much as possible for security and privacy. This guide attempts to be distribution-agnostic and is not tied to any specific one.https://madaidans-insecurities.github.io/guides/linux-hardening.html
Active-Active PostgreSQL Federation on Kubernetes
What if I told you that you can create an out-of-the-box active-active, federated PostgreSQL cluster on Kubernetes?https://info.crunchydata.com/blog/active-active-postgres-federation-on-kubernetes
How to Trace Linux System Calls in Production with Minimal Impact on Performance
https://pingcap.com/blog/how-to-trace-linux-system-calls-in-production-with-minimal-impact-on-performance
https://pingcap.com/blog/how-to-trace-linux-system-calls-in-production-with-minimal-impact-on-performance
Tips for On Call Engineers During the Holidays
1. Share the love (or spread the pain) when organizing on call shifts, and incentivize communal behavior.https://www.transposit.com/blog/2019.12.23-tips-for-on-call-engineers-during-the-holidays
2. Communicate early and often, with and without runbooks.
3. Plan around potential travel problems
4. Let friendly allies help you manage the social side of the situation
5. Pat yourself and your team on the back