Мониторим ИТ

One Grafana Dashboard With Multiple Prometheus Datasources

In this article, the following aspects of using Prometheus and Grafana will be demonstrated:

⚡ One Grafana server presenting data from multiple Prometheus resources.

⚡ Each dashboard would show only selected Prometheus datasources (not all configured datasources are relevant to all of the dashboards).

⚡ Present only the relevant data from each datasource according to the dashboard content. (For example in case the dashboard panel present one storage mount which has a different mount requirements per server).

⚡️ Useful dashboards for your needs:
- Host / VM Resources ( CPU, RAM, Storage and I/O, Network).
- Docker Containers (Use of resources per container).

Читать дальше.

Medium

One Grafana Dashboard With Multiple Prometheus Datasources

In this article, the following aspects of using Prometheus and Grafana will be demonstrated:

2.4K viewsedited 06:28

Мониторим ИТ

Расчет перцентилей для мониторинга высоконагруженных систем

При мониторинге часто требуется использовать перцентили. Они позволяют понять, как система работает бóльшую часть времени, в отличие от усреднения значений, которое сильно подвержено влиянию выбросов. Если 9 из 10 запросов выполняются за 1 секунду, а один за 10 секунд, то среднее будет 1,9 секунды, а 50-перцентиль — 1 секунда. Это лишь один пример того, что среднее значение не подходит для мониторинга. Возникает необходимость считать перцентили, для этого мы добавили в tarantool/metrics Summary-коллектор. Читать дальше.

Хабр

Расчет перцентилей для мониторинга высоконагруженных систем

Привет, меня зовут Игорь, и я разработчик решений на Tarantool в Mail.ru Group. Я работаю над витринами маркетинга в реальном времени для Мегафона. При мониторинге часто требуется использовать...

2.8K views14:02

Мониторим ИТ

Галс Софтвэр и Broadcom приглашают на вебинар по зонтичной системе мониторинга DX Operations Intelligence

Основа DX OI — это современная распределенная облачная архитектура. В решении реализованы механизмы Machine Learning над всеми поступающими данными как из доменных решений Broadcom, так и от сторонних систем через REST API, таких как Zabbix, SCOM и других популярных систем. Основная функция DX OI — создание полноценной ресурсно-сервисной модели (РСМ) на базе конфигурационных единиц (КЕ), наполняющих инвентарную базу при интеграции со сторонними системами. Важная особенность DX OI — возможность спрогнозировать отказ КЕ в будущем и оценить степень его вляиние на доступность сервиса.

Вебинар состоится в пятницу 27 ноября в 11 часов утра по московскому времени на площадке Zoom.

⚡️ Регистрация на вебинар

⚡️ Статья на Хабре с описанием возможностей

3.2K viewsedited 08:36

Мониторим ИТ

How we eliminated service outages from ‘certificate expired’ by setting up alerts with Grafana and Prometheus

There’s one thing most of the customers have in common: At one point or another, expired certificates have caused a problem. In theory, they shouldn’t; the exact expiration date is known, and so is the process for updating. But still the problems persist!

In this blog post, we present a simple yet effective solution: Monitor the expiration date of certificates with Prometheus and visualize it with Grafana, using features from the new table visualization in Grafana 7. Читать дальше.

3.1K views08:00

Мониторим ИТ

Monitoring the Mattermost server with Prometheus and Grafana

We’ve been using Prometheus and Grafana to monitor our cluster for a while now, and you can read this great post where my colleague Stylianos explains how we have them working for our multi-cluster environment. Читать дальше.

Mattermost.com

Monitoring the Mattermost server with Prometheus and Grafana

Lately we've been working on improving different parts of the Mattermost server, including our monitoring and observability capabilities using Prometheus and Grafana.

2.2K views08:50

Мониторим ИТ

Prometheus и VictoriaMetrics: отказоустойчивая инфраструктура для хранения метрик

Стек, о котором пойдёт речь: Prometheus, Alertmanager, Pushgateway, Blackbox exporter, Grafana и VictoriaMetrics. Читать дальше.

2.4K views15:18

Мониторим ИТ

How to find traces in Tempo with Elasticsearch and Grafana

Grafana Tempo, the recently announced distributed tracing backend, relies on integrations with other data sources for trace discovery. Tempo’s job is to store massive amounts of traces, place them in object storage, and retrieve them by ID. Logs and other data sources allow users to quickly and more powerfully jump directly to traces than ever before. Читать дальше.

Grafana Labs

How to find traces in Tempo with Elasticsearch and Grafana | Grafana Labs

Here's how to use Elasticsearch for trace discovery in Tempo, a fantastic new tool for mass trace ingestion.

2.3K views18:42

Мониторим ИТ

/proc/meminfo + gawk = удобный JSON для discovery метрик в zabbix

В работе над одной задачей, понадобилось добавить в мониторинг все счетчики памяти из /proc/meminfo с нескольких linux хостов, для отслеживания состояние памяти в течении времени. Читать дальше.

2.6K views08:00

Мониторим ИТ

Статья на Хабре для тех, кто интересуется Ceph — программно-определяемой распределённой файловой системой.

2.8K views12:30

Мониторим ИТ

Занимательная статья на Хабре о том, почему Apache Kafka такая шустрая и популярная. Для тех, кто работает с технологией советуем ознакомиться с тем, что у кафки “под капотом”. Это многое объясняет. Например, можно почитать про батчинг записей, пакетное сжатие, буферизованные операции, zero-copy и другие фишечки.

Хабр

Почему Kafka такая быстрая

За последние несколько лет в сфере архитектуры ПО произошли огромные изменения. Идея единственного монолитного приложения или даже нескольких крупных сервисов,...

2.7K views12:30

Мониторим ИТ

Мониторинг многопоточных приложений Node.JS

В этой статье мы разберем особенности мониторинга многопоточного Node.JS приложения на примере нашего коллектора для сервиса мониторинга и анализа логов серверов PostgreSQL. Читать дальше.

Хабр

Мониторинг многопоточных приложений Node.JS

В этой статье мы разберем особенности мониторинга многопоточного Node.JS приложения на примере нашего коллектора для сервиса мониторинга и анализа логов серверов...

4.1K views08:00

Мониторим ИТ

How we eliminated service outages from ‘certificate expired’ by setting up alerts with Grafana and Prometheus

In this blog post, we present a simple yet effective solution: Monitor the expiration date of certificates with Prometheus and visualize it with Grafana, using features from the new table visualization in Grafana 7. Читать дальше.

Grafana Labs

How we eliminated service outages from ‘certificate expired’ by setting up alerts with Grafana and Prometheus | Grafana Labs

In this guest blog, get the step-by-step instructions to set up monitoring for the expiration date of certificates.

2.4K views08:00

Мониторим ИТ

How histograms changed the game for monitoring time series with Prometheus

Смотреть запись с Fosdem 2020.

Grafana Labs

How histograms changed the game for monitoring time series with Prometheus | Grafana Labs

At FOSDEM 2020, I did a deep dive into the secret history of histograms in Prometheus.

3.4K views10:00

Мониторим ИТ

IoT monitoring with Grafana: How Eurac observes climate change in the Alps

The Eurac Research team uses Grafana for several different purposes. In addition to infrastructure monitoring and maintenance, we use Grafana as a data analysis tool for identifying trends and patterns and publicly share our dashboards to make our data more accessible. Читать дальше.

Grafana Labs

IoT monitoring with Grafana: How Eurac observes climate change in the Alps

Using Grafana, Eurac Research tracks and analyzes data from 24 micro-climatic stations in the Italian Alps.

2.7K views08:00

Мониторим ИТ

How to migrate your configuration database

Grafana по умолчанию использует встроенную в дистрибутив SQLLite, но можно использовать и другую БД. В этой статье о том, как переехать на MySQL. Читать дальше.

Grafana Labs

How to migrate your configuration database | Grafana Labs

Grafana uses sqlite3 as the default configuration database. Here’s a look at how to migrate your configuration to a different database if you need to.

3.2K views10:00

Мониторим ИТ

После известного твита Илона Маска с рекомендацией использования Signal, у них случился резкий прирост новых пользователей. На волне роста популярности в блоге Zabbix и вышла эта статья об интеграции с этим мессенджером.

2.2K views14:38

Мониторим ИТ

Регистрируйтесь на вебинар по Grafana Tempo. Его проведёт Joe Elliot, создатель Tempo и постоянный мейнтейнер Jaeger. Вебинар состоится 4 февраля в 17:30 UTC.

⚡Getting started setting up Tempo

⚡️ Why Tempo?

⚡️ How to discover traces without native search (Exemplars/Loki 2.0)

⚡️ Upcoming Grafana exemplar support

⚡️ Upcoming Prometheus exemplar support

2.2K views08:00

Регистрация

Мониторим ИТ

Смотрим что там занимает место в БД Zabbix

Zabbix Blog

What takes disk space - Zabbix Blog

In today’s class let’s talk about where the disk space goes. Which items and hosts objects consume the disk…

2.4K views16:00

Мониторим ИТ

А кто-то слышал про Perfromance Co-Pilot? У них и с Grafana интеграция есть.

Еще о PCP можно почитать в блоге RHEL.

👍 — слышал и использую/использовал

👎 — слышал, но не использовал

🖕 — я адепт мейнстрима

2.3K views18:21

👍 3 👎 10 🖕 13

Мониторим ИТ

Сегодня Zabbix проводил митап, где Алексей Владышев (основатель Zabbix) рассказал о серьезных нововведениях в версии 5.4 (она, кстати, не LTS). Появляется новый синтаксис для описания триггерных выражений, вычисляемых и агрегированных проверок.

Было: {host:key.func(params)}=0

Станет: func(/host/key, params)

К слову, начиная с версии 5.4 в Zabbix больше не будет поддержки прежнего синтаксиса. И к этому нужно готовиться. Ниже несколько скриншотов из презентации для понимания.

2.3K viewsedited 14:26

About

Blog

Apps

Platform