DevOps&SRE Library
17.8K subscribers
462 photos
4 videos
2 files
4.75K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://knd.gov.ru/license?id=67704b536aa9672b963777b3&registryType=bloggersPermission
Download Telegram
How we upgraded PostgreSQL at GitLab.com

We explain the precise maintenance process to execute a major version upgrade of PostgreSQL.

https://about.gitlab.com/blog/2020/09/11/gitlab-pg-upgrade
Kubernetes YAML Generator

https://k8syaml.com
IPVS Node Controller

ipvs-node-controller is the kubernetes controller that solves External-IP (Load Balancer IP) issue with IPVS proxy mode. IPVS proxy mode has various problems, and one of them is that the External-IP assigned through the LoadBalancer type service with externalTrafficPolicy=Local option cannot access inside the cluster.

https://github.com/kakao/ipvs-node-controller
Inside the Hidden World of Legacy IT Systems

How and why we spend trillions to keep old software going

https://spectrum.ieee.org/computing/it/inside-hidden-world-legacy-it-systems
A simple web UI for managing Kubernetes deployments

Kubeonoff is a small web UI that allows to quickly stop/start/restart pods.

https://github.com/GambitResearch/kubeonoff
infracost

Infracost shows hourly and monthly cost estimates for a Terraform project. This helps developers, DevOps et al. quickly see the cost breakdown and compare different deployment options upfront.

https://github.com/infracost/infracost
Istio ingress controller as an API gateway

https://banzaicloud.com/blog/backyards-api-gateway
Managed Kubernetes Services Compared: GKE vs. EKS vs. AKS

Comparing the three most popular managed Kubernetes platforms in features and overall experience

https://medium.com/better-programming/managed-kubernetes-services-compared-gke-vs-eks-vs-aks-df1ecb22bba0
The Ten Commandments of Container Security

1. Source base image from trusted repositories
2. Install verified packages
3. Minimize attack surface in the Image
4. Do not bake secrets in the image
5. Use of Secure Private or Public Registries
6. Do not use privileged or root user to run the application in a container
7. Implement image vulnerability scanning in CI/CD
8. Enable kernel security profiles like AppArmor
9. Secure centralized and remote logging
10. Deploy runtime security monitoring

https://www.infracloud.io/blogs/top-10-things-for-container-security
Database migrations lessons learned

- Keep your migration scripts away from your production code.
- Keep it low-tech, don’t deserialize.
- Write tests to exercise each migration script individually.
- Consider running long migrations online.
- Consider versioning your documents.

https://octopus.com/blog/database-migrations-lessons-learned
Helm 3 Umbrella Charts & Standalone Chart Image Tags — An Alternative Approach

https://itnext.io/helm-3-umbrella-charts-standalone-chart-image-tags-an-alternative-approach-78a218d74e2d
Replacing ngrok with ktunnel

Ktunnel is a CLI tool that establishes a reverse tunnel between a kubernetes cluster and your local machine. It lets you expose your machine as a service in the cluster or expose it to a specific deployment.

https://mbuffett.com/posts/ktunnel-ngrok-replace
Capsule

This project provides a custom operator for implementing a strong multi-tenant environment in Kubernetes. Capsule is not intended to be yet another PaaS, instead, it has been designed as a lightweight tool with a minimalist approach leveraging only the standard features of upstream Kubernetes.

https://github.com/clastix/capsule
NS Killer

A Kubernetes project to kill all namespace living over X times. Quite useful when auto-generated development environments on the fly and give them a lifecycle out-of-the-box from Kubernetes or even Helm.

https://github.com/germainlefebvre4/ns-killer
ckad-prep-notes

List of resources and notes for passing the Certified Kubernetes Application Developer (CKAD) exam.

https://github.com/twajr/ckad-prep-notes
Manage all your Kubernetes clusters with Anthos attached clusters

https://cloud.google.com/blog/topics/anthos/getting-to-know-anthos-attached-clusters
SRE Classroom: Distributed PubSub

SRE Classroom: Distributed PubSub is a workshop developed by Google’s Site Reliability Engineering group. The goals of this workshop are to (1) introduce participants to the principles of non-abstract large systems design (NALSD), and (2) provide hands-on experiences with applying these principles to the design and evaluation of these systems. We consider NALSD a concept fundamental to SRE, and understanding its principles provides a basis for having meaningful conversations about the design and operation of large software systems.

In the first theoretical part of the workshop, participants learn about some foundational large system design principles and concepts. Topics include correctness, reliability, performance, different inter-system communication styles, and more. We introduce the problem requirements in detail and walk through the first parts of an example solution.

The practical part of this workshop asks participants to apply the principles they have learned to develop a Publish-Subscribe system that meets certain performance and correctness requirements and Service Level Objectives (SLOs).

The workshop concludes with a detailed example solution, as well as a discussion of the system’s inputs and SLOs.

https://landing.google.com/sre/resources/practicesandprocesses/sre-classroom
awstaghelper

awstaghelper allow tagging hundreds of AWS resources in few commands

https://github.com/mpostument/awstaghelper