DevOps&SRE Library
17.8K subscribers
460 photos
4 videos
2 files
4.75K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://knd.gov.ru/license?id=67704b536aa9672b963777b3&registryType=bloggersPermission
Download Telegram
ioping

A tool to monitor I/O latency in real time. It shows disk latency in the same way as ping shows network latency.

https://github.com/koct9i/ioping
Managing my personal server in 2020

This document is going to describe how I manage my personal server in 2020. It will talk about

- Management of secrets with SOPS and a GPG key
- Automatic management of DNS record
- Configuration of Debian and installation of Kubernetes k3s
- Setup Nginx ingress with let's encrypt for automatic TLS certificate
- Deployment of postfix + dovecot for deploying an email server
- Install Nextcloud to get your personal cloud in the sky
- Putting backup in place
- Using Wireguard to create a private network and WsTunnel to bypass firewalls
- Adding a Raspberry Pi to the K3s cluster

https://github.com/erebe/personal-server
Why Linkerd doesn't use Envoy

In short: Linkerd doesn't use Envoy because using Envoy wouldn't allow us to build the lightest, simplest, and most secure Kubernetes service mesh in the world.

https://linkerd.io/2020/12/03/why-linkerd-doesnt-use-envoy
Introducing another free CA as an alternative to Let's Encrypt

ZeroSSL.com is now joining the (sadly) very small group of awesome CAs giving away free, 90-day certs via ACME.

https://scotthelme.co.uk/introducing-another-free-ca-as-an-alternative-to-lets-encrypt
What I Wish I Knew About Incident Management

In this post, I would like to share the incident management practices I have picked up over the years as an SRE at Linkedin that help me keep calm under pressure and effectively drive incidents to resolution.

https://ronaknathani.com/blog/2020/11/what-i-wish-i-knew-about-incident-management
Mitigate Connection Leaks in Production via Proxies

https://reliability.substack.com/p/mitigate-connection-leaks-in-production
Systems Design Fundamentals

The most important ingredient to success in systems design interviews is having extensive knowledge in the areas of distributed computing, reliability engineering, information storage, and systems architecture.

Our design fundamentals course is intelligently organized into 25 modules, each covering a key concept and building upon the previous one. The result is a guided, comprehensive education that equips you with all the tools you need to successfully navigate—and ace—any systems design interview.

https://www.algoexpert.io/systems/product
Cloudflare’s privacy-first Web Analytics is now available for everyone

https://blog.cloudflare.com/privacy-first-web-analytics
How to Make Your Code Reviewer Fall in Love with You

1. Review your own code first
2. Write a clear changelist description
3. Automate the easy stuff
4. Answer questions with the code itself
5. Narrowly scope changes
6. Separate functional and non-functional changes
7. Break up large changelists
8. Respond graciously to critiques
9. Be patient when your reviewer is wrong
10. Communicate your responses explicitly
11. Artfully solicit missing information
12. Award all ties to your reviewer
13. Minimize lag between rounds of review

https://mtlynch.io/code-review-love
How to Do Code Reviews Like a Human

- Let computers do the boring parts
- Settle style arguments with a style guide
- Start reviewing immediately
- Start high level and work your way down
- Be generous with code examples
- Never say “you”
- Frame feedback as requests, not commands
- Tie notes to principles, not opinions
- Aim to bring the code up a letter grade or two
- Limit feedback on repeated patterns
- Respect the scope of the review
- Look for opportunities to split up large reviews
- Offer sincere praise
- Grant approval when remaining fixes are trivial
- Handle stalemates proactively

https://mtlynch.io/human-code-reviews-1
https://mtlynch.io/human-code-reviews-2
School of SRE

In this course, we are focusing on building strong foundational skills. The course is structured in a way to provide more real life examples and how learning each of these topics can play an important role in day to day SRE life.

https://linkedin.github.io/school-of-sre
Everything You Always Wanted To Know About GitHub (But Were Afraid To Ask)

We prepared a dataset from the GH Archive that contains all the events in all GitHub repositories since 2011 in structured format. The dataset was uploaded into ClickHouse, where it contains 3.1 billion records. We redistribute it for research purposes and it can be downloaded at this direct link. This dataset can help answer almost any question about GitHub that you can imagine.

This article shows the usage of the dataset and offers insights into the GitHub ecosystem. It emphasizes how easy it is to play with data using modern tools.

https://gh.clickhouse.tech/explorer/
New from Universe 2020: Dark mode, GitHub Sponsors for companies, and more

With all the news coming out of GitHub Universe today we wanted to give you a quick summary of all the announcements and timelines for the features being shown off this week.

https://github.blog/2020-12-08-new-from-universe-2020-dark-mode-github-sponsors-for-companies-and-more
Command Line Interface Guidelines

An open-source guide to help you write better command-line programs, taking traditional UNIX principles and updating them for the modern day.

https://clig.dev
The DevOps Reading List: Choosing your next DevOps book

https://octopus.com/blog/devops-reading-list
k6

k6 is a modern load testing tool, building on Load Impact's years of experience in the load and performance testing industry. It provides a clean, approachable scripting API, local and cloud execution, and flexible configuration.

https://github.com/loadimpact/k6