A post looking at the role of an SRE team in adopting observability tooling. A lot of this depends, in my experience, on the reality on the ground of roles vs the titles.
https://rootly.io/blog/the-role-of-sres-in-observability
#sre #observability
https://rootly.io/blog/the-role-of-sres-in-observability
#sre #observability
Rootly
The Role of SREs in Observability
Although conversation about observability often ignores SREs, SREs have a central role to play in observability success.
Forwarded from oleg_fov (Oleg Kovalov)
YouTube
Как правильно и надежно убить MySQL / Владимир Федорков (ECOMMPAY)
Приглашаем на конференцию Saint HighLoad++ 2025, которая пройдет 23 и 24 июня в Санкт-Петербурге!
Программа, подробности и билеты по ссылке: https://highload.ru/spb/2025
________
HighLoad++ Весна 2021
Крупнейшая профессиональная конференция для разработчиков…
Программа, подробности и билеты по ссылке: https://highload.ru/spb/2025
________
HighLoad++ Весна 2021
Крупнейшая профессиональная конференция для разработчиков…
Let service teams own the service operations instead of the SRE
https://medium.com/@crossbizz/let-service-teams-own-the-service-operations-instead-of-the-sre-4ff7bcbd53e0
#sre #devops
https://medium.com/@crossbizz/let-service-teams-own-the-service-operations-instead-of-the-sre-4ff7bcbd53e0
#sre #devops
Lessons Learned in 10 Years of SRE: Part 1 - Starting SRE
https://www.usenix.org/publications/loginonline/lessons-learned-10-years-sre-part-1-starting-sre
#sre #devops
https://www.usenix.org/publications/loginonline/lessons-learned-10-years-sre-part-1-starting-sre
#sre #devops
Why Your Monitoring Dashboard May Be Feeding You Phantom Metrics
https://hackernoon.com/why-your-monitoring-dashboard-may-be-feeding-you-phantom-metrics
#monitoring #observability #sre #devops
https://hackernoon.com/why-your-monitoring-dashboard-may-be-feeding-you-phantom-metrics
#monitoring #observability #sre #devops
Kubernetes monitoring: why it is difficult and how to improve it
https://youtu.be/R9oV6DE0K10
#kubernetes #k8s #monitoring #observability #sre #victoriametrics
https://youtu.be/R9oV6DE0K10
#kubernetes #k8s #monitoring #observability #sre #victoriametrics
botkube
#kubernetes #devops #chatops #chatbot #sre #monitoring
An app that helps you monitor your Kubernetes cluster, debug critical deployments & gives recommendations for standard practices
https://github.com/kubeshop/botkube#kubernetes #devops #chatops #chatbot #sre #monitoring
awesome platform engineering tools
https://github.com/seifrajhi/awesome-platform-engineering-tools
#tools #devops #sre #development #vcs #testing #monitoring #security
https://github.com/seifrajhi/awesome-platform-engineering-tools
#tools #devops #sre #development #vcs #testing #monitoring #security
How ilert Can Help Enhance Your Monitoring With Its VictoriaMetrics Integration
https://victoriametrics.com/blog/using-victoriametrics-and-ilert
#monitoring #sre #observability #ilert
https://victoriametrics.com/blog/using-victoriametrics-and-ilert
#monitoring #sre #observability #ilert
Grafana dashboards for AWS CloudWatch
https://github.com/monitoringartist/grafana-aws-cloudwatch-dashboards
#aws #grafana #cloudwatch #dashboard #monitoring #devops #sre
https://github.com/monitoringartist/grafana-aws-cloudwatch-dashboards
#aws #grafana #cloudwatch #dashboard #monitoring #devops #sre
Runbook automation platform with deep observability integrations for SRE & On-Call Teams
https://github.com/DrDroidLab/playbooks
#sre #monitoring #logs #metrics #alerts #traces #observability
https://github.com/DrDroidLab/playbooks
#sre #monitoring #logs #metrics #alerts #traces #observability
What is Reliability Engineering?
https://newsletter.pragmaticengineer.com/p/reliability-engineering
#sre
https://newsletter.pragmaticengineer.com/p/reliability-engineering
#sre
🚀 Join Mathias Palmersheim – Solution Engineer at conf42.com! 🎙
🛠 How to Monitor your Monitoring 🖥
📟 If your monitoring system crashes in the middle of the night, does your team get alerted?
💡Hopefully, yes – but if not, this talk will provide simple, cost-effective solutions to get you started with #victoriaMetrics!
🔧 And even if you already have #monitoring for your monitoring, Mathias will share expert tips to help you improve your current setup.
🗓 October 17th – Online
https://www.conf42.com/Incident_Management_2024_Mathias_Palmersheim_27_monitoring_monitoring_how
#monitoring #devops #cloud #sre
🛠 How to Monitor your Monitoring 🖥
📟 If your monitoring system crashes in the middle of the night, does your team get alerted?
💡Hopefully, yes – but if not, this talk will provide simple, cost-effective solutions to get you started with #victoriaMetrics!
🔧 And even if you already have #monitoring for your monitoring, Mathias will share expert tips to help you improve your current setup.
🗓 October 17th – Online
https://www.conf42.com/Incident_Management_2024_Mathias_Palmersheim_27_monitoring_monitoring_how
#monitoring #devops #cloud #sre
SREday 2024 - London
https://www.youtube.com/playlist?list=PL2CAJ_jforK6OBFSKr0ossbkyDfAMz_ix
#sre #devops
(playlist)https://www.youtube.com/playlist?list=PL2CAJ_jforK6OBFSKr0ossbkyDfAMz_ix
#sre #devops
Versus incident
https://github.com/VersusControl/versus-incident
#alerts #monitoring #sre #oncall
An open-source incident management system with multi-channel alerting capabilities
https://github.com/VersusControl/versus-incident
#alerts #monitoring #sre #oncall
Preq
https://github.com/prequel-dev/preq
#monitoring #reliability #sre
preq is the community-driven problem detector for Common Reliability Enumerations (CREs)
https://github.com/prequel-dev/preq
#monitoring #reliability #sre
Alerting best practices
https://victoriametrics.com/blog/alerting-best-practices
#sre #devops #observability #monitoring
https://victoriametrics.com/blog/alerting-best-practices
#sre #devops #observability #monitoring