Data Engineering / Инженерия данных / Data Engineer / DWH
1.94K subscribers
49 photos
7 videos
52 files
350 links
Data Engineering: ETL / DWH / Data Pipelines based on Open-Source software. Инженерия данных.

DWH / SQL
Python / ETL / ELT / dbt / Spark
Apache Airflow

Рекламу не размещаю
Вопросы: @iv_shamaev | datatalks.ru
Download Telegram
Apache_Hive_Essentials_Essential_techniques_to_help_you_process.pdf
3.9 MB
Apache Hive Essentials: Essential techniques to help you process, and get unique insights from, big data

What you will learn
▫️Create and set up the Hive environment
▫️Discover how to use Hive's definition language to describe data
▫️Discover interesting data by joining and filtering datasets in Hive
▫️Transform data by using Hive sorting, ordering, and functions
▫️Aggregate and sample data in different ways
▫️Boost Hive query performance and enhance data security in Hive
▫️Customize Hive to your needs by using user-defined functions and integrate it with other tools
Practical_Real_time_Data_Processing_and_Analytics_Shilpi_Saxena.pdf
13.4 MB
Practical Real-time Data Processing and Analytics: Distributed Computing and Event Processing using Apache Spark, Flink, Storm, and Kafka

What You Will Learn
▫️Get an introduction to the established real-time stack
▫️Understand the key integration of all the components
▫️Get a thorough understanding of the basic building blocks for real-time solution designing
▫️Garnish the search and visualization aspects for your real-time solution
▫️Get conceptually and practically acquainted with real-time analytics
▫️Be well equipped to apply the knowledge and create your own solutions
Forwarded from Data-comics
Читала отчёт по DevOps Setups benchmarking 2022 от Luca G и humanitec.

В целом, есть интересные моменты про разные типы команд разработчиков, ребята провели большую работу.

Но результаты преподнесли немного дезинформирующе.
Пример - на приложенной картинке.

Что не так? 😁

Ссылка на отчёт тут: https://humanitec.com/whitepapers/2021-devops-setups-benchmarking-report
Файл, кому интересно, приложу в комменты.
TelegramOperator — apache-airflow-providers-telegram Documentation

Оператор Airflow для отправки уведомлений в Telegram

https://airflow.apache.org/docs/apache-airflow-providers-telegram/stable/operators.html
Play with Docker

▫️Docker 101 Tutorial - Self-paced tutorials to increase your Docker knowledge.
▫️Lab Environment - Complete a workshop without installing anything using this Docker playground.
▫️Community Training - Free and paid learning materials from Docker Captains.

https://www.docker.com/play-with-docker/
👍1