DevOps & SRE notes – Telegram

DevOps & SRE notes

@devops_sre_notes

11.8K subscribers

27 photos

19 files

2.41K links

Helpfull articles and tools for DevOps&SRE

For paid consultation (RU/EN), contact: @tutunak

All ways to support https://telegra.ph/How-support-the-channel-02-19

Download Telegram

About

Blog

Apps

Platform

DevOps & SRE notes

11.8K subscribers

DevOps & SRE notes

Ably's post explains their "four pillars" engineering principle, which is designed to ensure their systems have no ceiling on scale. This philosophy guides their architecture to handle massive, unpredictable, and complex realtime workloads.
https://ably.com/blog/ablys-four-pillars-no-scale-ceiling

Ably's four pillars: no scale ceiling

1.28K viewstutunak, 08:03

DevOps & SRE notes

A lightweight TUI application to view and query tabular data files, such as CSV, TSV, and parquet.

https://github.com/shshemi/tabiew

GitHub - shshemi/tabiew: A lightweight TUI application to view and query tabular data files, such as CSV, TSV, and parquet.

A lightweight TUI application to view and query tabular data files, such as CSV, TSV, and parquet. - shshemi/tabiew

👍1

1.3K viewstutunak, 15:04

DevOps & SRE notes

KubeBuddy - A PowerShell tool for monitoring and managing Kubernetes clusters. Perform health checks, resource usage insights, and configuration audits with ease. Supports AKS best practices, snapshot-based monitoring, and security checks tailored for Kubernetes environments. Available on the PowerShell Gallery.

https://github.com/KubeDeckio/KubeBuddy

GitHub - KubeDeckio/KubeBuddy: KubeBuddy - A PowerShell tool for monitoring and managing Kubernetes clusters. Perform health checks…

KubeBuddy - A PowerShell tool for monitoring and managing Kubernetes clusters. Perform health checks, resource usage insights, and configuration audits with ease. Supports AKS best practices, snaps...

👍3❤1

1.31K viewstutunak, 08:03

DevOps & SRE notes

This piece from Airbnb Engineering details their journey of building a centralized user signals platform. It explores the motivations, challenges, and architectural decisions behind creating a system to capture user interactions at scale.
https://medium.com/airbnb-engineering/building-a-user-signals-platform-at-airbnb-b236078ec82b

Building a User Signals Platform at Airbnb

How Airbnb built a stream processing platform to power user personalization.

👍1

1.52K viewstutunak, 15:00

DevOps & SRE notes

In this insightful study, the Games24x7 Tech team shares their experience of migrating Node.js services to Kubernetes. They discuss the strategies and tools used to achieve a seamless and efficient transition with minimal downtime.
https://medium.com/@Games24x7Tech/how-we-seamlessly-transitioned-our-node-services-to-k8s-7e2e6067daa0

👍1

1.42K viewstutunak, 08:04

DevOps & SRE notes

TUI dashboard builder that lets you visualize data from scripts and APIs right in your console

https://github.com/rasjonell/dashbrew

GitHub - rasjonell/dashbrew: TUI dashboard builder that lets you visualize data from scripts and APIs right in your console

TUI dashboard builder that lets you visualize data from scripts and APIs right in your console - rasjonell/dashbrew

1.59K viewstutunak, 15:02

DevOps & SRE notes

Forwarded from Make. Build. Break. Reflect.

🤣9🔥5

1.46K viewstutunak, 16:57

DevOps & SRE notes

A tool for exploring each layer in a docker image

https://github.com/wagoodman/dive

GitHub - wagoodman/dive: A tool for exploring each layer in a docker image

A tool for exploring each layer in a docker image. Contribute to wagoodman/dive development by creating an account on GitHub.

❤1

1.41K viewstutunak, 08:00

DevOps & SRE notes

Author dotdc presents Terraflow, a CI/CD orchestrator designed to scale Terraform operations effectively. This report outlines the creation of the tool and how it helps manage complex infrastructure deployments.
https://medium.com/@dotdc/creating-terraflow-a-ci-cd-orchestrator-to-scale-terraform-3965b3f8931f

Creating Terraflow, a CI/CD orchestrator to scale Terraform

👍1

1.36K viewstutunak, 15:01

DevOps & SRE notes

This analysis provides a deep dive into writing policies for Kubernetes clusters using OPA Gatekeeper. The Permify Tech Blog explains how to enforce custom rules and maintain security and compliance in a cloud-native environment.
https://medium.com/permify-tech-blog/opa-gatekeeper-how-to-write-policies-for-kubernetes-clusters-bb660666eb19

Opa Gatekeeper: How To Write Policies For Kubernetes Clusters

Learn how to leverage OPA Gatekeeper to write and enforce policies in Kubernetes clusters, ensuring security and efficient resource…

❤1👍1

1.36K viewstutunak, 08:00

DevOps & SRE notes

🚀 10x easier, 🚀 140x lower storage cost, 🚀 high performance, 🚀 petabyte scale - Elasticsearch/Splunk/Datadog alternative for 🚀 (logs, metrics, traces, RUM, Error tracking, Session replay).

https://github.com/openobserve/openobserve

GitHub - openobserve/openobserve: Modern observability platform: 10x easier, 140x lower storage cost, petabyte scale. Open-source…

Modern observability platform: 10x easier, 140x lower storage cost, petabyte scale. Open-source alternative to Elasticsearch/Splunk/Datadog for logs, metrics, traces, RUM, and more. - openobserve/o...

🤣3

1.39K viewstutunak, 15:01

DevOps & SRE notes

Checkmate is an open-source, self-hosted tool designed to track and monitor server hardware, uptime, response times, and incidents in real-time with beautiful visualizations

https://github.com/bluewave-labs/Checkmate

GitHub - bluewave-labs/Checkmate: Checkmate is an open-source, self-hosted tool designed to track and monitor server hardware,…

Checkmate is an open-source, self-hosted tool designed to track and monitor server hardware, uptime, response times, and incidents in real-time with beautiful visualizations. Don't be shy, ...

👍2

1.45K viewstutunak, 08:04

DevOps & SRE notes

AWS just released their postmortem (link in comment) for the October DynamoDB outage. It's thorough, technically detailed, and explains exactly what broke and how they'll "prevent" it from happening again. But this PR-approved, sanitized narrative tells us only what happened to the technology, nothing else.

https://aws.amazon.com/message/101925/

❤2👍2

1.33K viewstutunak, 09:57

DevOps & SRE notes

Marc Christian P. Gregorio offers a practical commentary on automating centralized NAT Gateways in AWS across multiple VPCs and regions using Terraform. The solution aims to optimize costs and simplify network management for large-scale deployments.
https://medium.com/@marcchristianp.gregorio/automating-centralized-nat-gateways-in-aws-vpcs-and-region-with-terraform-69a6f90d60da

Automating Centralized NAT Gateways in AWS VPCs and Region with Terraform

When managing a large-scale AWS environment with multiple accounts, deploying multiple NAT gateways across various VPCs can become very…

👍3❤1

1.37K viewstutunak, 15:01

DevOps & SRE notes

Elliot Graebert proposes an impact-based leveling system for engineering organizations as an alternative to traditional career ladders. This treatise discusses how focusing on impact can foster a more motivated and effective engineering culture.
https://medium.com/@elliotgraebert/an-impact-based-level-system-for-engineering-organizations-2e0f9bee20e6

An impact-based level system for engineering organizations

Defining L1-L6 for individual contributors and leads

👍2❤1

1.48K viewstutunak, 08:01

DevOps & SRE notes

https://github.com/cozystack/cozypkg

Cozy wrapper around Helm and Flux CD for local development

GitHub - cozystack/cozypkg: Cozy wrapper around Helm and Flux CD for local development

Cozy wrapper around Helm and Flux CD for local development - cozystack/cozypkg

👍3

1.33K viewstutunak, 15:05

DevOps & SRE notes

A human-friendly alternative to netstat for socket and port monitoring on Linux and macOS.

https://github.com/theopfr/somo

GitHub - theopfr/somo: A human-friendly alternative to netstat for socket and port monitoring on Linux and macOS.

A human-friendly alternative to netstat for socket and port monitoring on Linux and macOS. - theopfr/somo

👍2

1.28K viewstutunak, 08:05

DevOps & SRE notes

This article from JP Gouin provides a deep dive into implementing GitOps at scale, with a specific focus on the cluster bootstrapping process. It covers the challenges and solutions for managing numerous Kubernetes clusters efficiently and declaratively.
https://medium.com/@jp-gouin/gitops-at-scale-clusters-bootstrapping-f36695d4340d

GitOps at scale — Clusters bootstrapping

Explore one approach to help infrastructure team managing their multiple environments, variants and all required applications

❤2

1.34K viewstutunak, 15:02

DevOps & SRE notes

This edition of the Scalable Thread newsletter breaks down effective strategies for handling sudden and unexpected bursts of traffic to your systems. It explores architectural patterns and techniques to ensure reliability and prevent service degradation during traffic spikes.
https://newsletter.scalablethread.com/p/how-to-handle-sudden-bursts-of-traffic

How to Handle Sudden Bursts of Traffic or "Thundering Herd Problem"?

Techniques to Avoid Potential Failures Caused by Sudden Traffic Spikes

👍3

1.33K viewstutunak, 09:01

DevOps & SRE notes

Enable dynamic and seamless Kubernetes multi-cluster topologies

https://github.com/liqotech/liqo

GitHub - liqotech/liqo: Enable dynamic and seamless Kubernetes multi-cluster topologies

Enable dynamic and seamless Kubernetes multi-cluster topologies - liqotech/liqo

👍1

1.34K viewstutunak, 16:03

DevOps & SRE notes

Terraform configuration for my entire Mikrotik-powered home network.

https://github.com/mirceanton/mikrotik-terraform/

GitHub - mirceanton/mikrotik-terraform: Terraform configuration for my entire Mikrotik-powered home network.

Terraform configuration for my entire Mikrotik-powered home network. - mirceanton/mikrotik-terraform

❤1👍1

1.35K viewstutunak, 09:02