DevOps&SRE Library

How we upgraded PostgreSQL at GitLab.com

We explain the precise maintenance process to execute a major version upgrade of PostgreSQL.

https://about.gitlab.com/blog/2020/09/11/gitlab-pg-upgrade

3.49K views16:00

DevOps&SRE Library

Kubernetes YAML Generator

https://k8syaml.com

4.06K views07:00

DevOps&SRE Library

IPVS Node Controller

ipvs-node-controller is the kubernetes controller that solves External-IP (Load Balancer IP) issue with IPVS proxy mode. IPVS proxy mode has various problems, and one of them is that the External-IP assigned through the LoadBalancer type service with externalTrafficPolicy=Local option cannot access inside the cluster.

https://github.com/kakao/ipvs-node-controller

3.32K views16:00

DevOps&SRE Library

Inside the Hidden World of Legacy IT Systems

How and why we spend trillions to keep old software going

https://spectrum.ieee.org/computing/it/inside-hidden-world-legacy-it-systems

3.22K views07:00

DevOps&SRE Library

A simple web UI for managing Kubernetes deployments

Kubeonoff is a small web UI that allows to quickly stop/start/restart pods.

https://github.com/GambitResearch/kubeonoff

3.3K views16:00

DevOps&SRE Library

infracost

Infracost shows hourly and monthly cost estimates for a Terraform project. This helps developers, DevOps et al. quickly see the cost breakdown and compare different deployment options upfront.

https://github.com/infracost/infracost

3.32K views07:00

DevOps&SRE Library

Istio ingress controller as an API gateway

https://banzaicloud.com/blog/backyards-api-gateway

3.26K views16:00

DevOps&SRE Library

Managed Kubernetes Services Compared: GKE vs. EKS vs. AKS

Comparing the three most popular managed Kubernetes platforms in features and overall experience

https://medium.com/better-programming/managed-kubernetes-services-compared-gke-vs-eks-vs-aks-df1ecb22bba0

3.25K views07:00

DevOps&SRE Library

The Ten Commandments of Container Security

1. Source base image from trusted repositories
2. Install verified packages
3. Minimize attack surface in the Image
4. Do not bake secrets in the image
5. Use of Secure Private or Public Registries
6. Do not use privileged or root user to run the application in a container
7. Implement image vulnerability scanning in CI/CD
8. Enable kernel security profiles like AppArmor
9. Secure centralized and remote logging
10. Deploy runtime security monitoring

https://www.infracloud.io/blogs/top-10-things-for-container-security

3.67K views16:00

DevOps&SRE Library

Database migrations lessons learned

- Keep your migration scripts away from your production code.
- Keep it low-tech, don’t deserialize.
- Write tests to exercise each migration script individually.
- Consider running long migrations online.
- Consider versioning your documents.

https://octopus.com/blog/database-migrations-lessons-learned

4.05K views07:00

DevOps&SRE Library

Helm 3 Umbrella Charts & Standalone Chart Image Tags — An Alternative Approach

https://itnext.io/helm-3-umbrella-charts-standalone-chart-image-tags-an-alternative-approach-78a218d74e2d

3.44K views16:00

DevOps&SRE Library

Replacing ngrok with ktunnel

Ktunnel is a CLI tool that establishes a reverse tunnel between a kubernetes cluster and your local machine. It lets you expose your machine as a service in the cluster or expose it to a specific deployment.

https://mbuffett.com/posts/ktunnel-ngrok-replace

3.26K views07:00

DevOps&SRE Library

Capsule

This project provides a custom operator for implementing a strong multi-tenant environment in Kubernetes. Capsule is not intended to be yet another PaaS, instead, it has been designed as a lightweight tool with a minimalist approach leveraging only the standard features of upstream Kubernetes.

https://github.com/clastix/capsule

3.2K views16:00

DevOps&SRE Library

GitHub CLI 1.0 is now available

https://github.blog/2020-09-17-github-cli-1-0-is-now-available

3.55K views06:00

DevOps&SRE Library

NS Killer

A Kubernetes project to kill all namespace living over X times. Quite useful when auto-generated development environments on the fly and give them a lifecycle out-of-the-box from Kubernetes or even Helm.

https://github.com/germainlefebvre4/ns-killer

3.56K views07:00

DevOps&SRE Library

Velero backup/restore for K8s Stateful Applications managed by Operators

https://medium.com/@Sandeepkallazhi/velero-backup-restore-for-k8s-stateful-applications-managed-by-operators-8fd9c732ffcc

3.3K views16:00

DevOps&SRE Library

A Picture of Java in 2020

https://blog.jetbrains.com/idea/2020/09/a-picture-of-java-in-2020

3.43K views06:00

DevOps&SRE Library

ckad-prep-notes

List of resources and notes for passing the Certified Kubernetes Application Developer (CKAD) exam.

https://github.com/twajr/ckad-prep-notes

3.7K views07:00

DevOps&SRE Library

Manage all your Kubernetes clusters with Anthos attached clusters

https://cloud.google.com/blog/topics/anthos/getting-to-know-anthos-attached-clusters

3.14K views16:00

DevOps&SRE Library

SRE Classroom: Distributed PubSub

SRE Classroom: Distributed PubSub is a workshop developed by Google’s Site Reliability Engineering group. The goals of this workshop are to (1) introduce participants to the principles of non-abstract large systems design (NALSD), and (2) provide hands-on experiences with applying these principles to the design and evaluation of these systems. We consider NALSD a concept fundamental to SRE, and understanding its principles provides a basis for having meaningful conversations about the design and operation of large software systems.

In the first theoretical part of the workshop, participants learn about some foundational large system design principles and concepts. Topics include correctness, reliability, performance, different inter-system communication styles, and more. We introduce the problem requirements in detail and walk through the first parts of an example solution.

The practical part of this workshop asks participants to apply the principles they have learned to develop a Publish-Subscribe system that meets certain performance and correctness requirements and Service Level Objectives (SLOs).

The workshop concludes with a detailed example solution, as well as a discussion of the system’s inputs and SLOs.

https://landing.google.com/sre/resources/practicesandprocesses/sre-classroom

3.44K views06:00

DevOps&SRE Library

awstaghelper

awstaghelper allow tagging hundreds of AWS resources in few commands

https://github.com/mpostument/awstaghelper

3.14K views07:00

About

Blog

Apps

Platform