Python 🐍 Work With Data
1.6K subscribers
76 photos
13 videos
136 files
441 links
A collection of books and articles on Python and various data manipulation tools. Overview of architecture of business intelligence systems, design and development of BI Reports, data processing in Python Pandas.
Download Telegram
Mining the Social Web, 3rd Edition (Jupyter Examples)

The official code repository for Mining the Social Web, 3rd Edition. The book is available from Amazon and Safari Books Online.

The notebooks folder of this repository contains the latest bug-fixed sample code used in the book chapters.

▫️ Chapter 0 - Preface.ipynb
▫️ Chapter 1 - Mining Twitter.ipynb
▫️ Chapter 2 - Mining Facebook.ipynb
▫️ Chapter 3 - Mining Instagram.ipynb
▫️ Chapter 4 - Mining LinkedIn.ipynb
▫️ Chapter 5 - Mining Text Files.ipynb
▫️ Chapter 6 - Mining Web Pages.ipynb
▫️ Chapter 7 - Mining Mailboxes.ipynb
▫️ Chapter 8 - Mining GitHub.ipynb
▫️ Chapter 9 - Twitter Cookbook.ipynb
▫️ _Appendix A - Virtual Machine Experience.ipynb
▫️ _Appendix B - OAuth Primer.ipynb
▫️ _Appendix C - Python & Jupyter Notebook Tips.ipynb

👉 @python_powerbi

https://github.com/mikhailklassen/Mining-the-Social-Web-3rd-Edition
data_culture_survey_23_report.pdf
3.1 MB
BARC DATA CULTURE SURVEY 23

How to liberalize data access to empower data users
Обзор форматов данных и способы оптимизации при работе с Pandas

Подборка из трех статей:

🔸 The Best Format to Save Pandas Data
🔸 Optimize Storing in Pandas: 98% Faster Disk Reads and 72% Less Space
🔸 Stop persisting pandas data frames in CSVs

〰️〰️〰️
👉 @python_powerbi
👍1
Data Analytics Methods and Algorithms
🔥1
Waterfall vs. Agile
👍21
Forwarded from Data Coffee
В свежем новостном выпуске Data Coffee:
— Что происходит в Яндексе
— Судьба арбузов и почерневших ягод
— Diagram as a Code и свежие интерфейсы для доступа к данным
— Ну и, конечно, Doom и VR!

#datacoffee #podcast #data

Где слушать🎧:
Anchor.FM
YouTube
Бот (последний эпизод)
Остальные площадки
ClickHouse Academy https://clickhouse.com/learn/
Become a ClickHouse expert with our free official ClickHouse training

Например, блок ClickHouse Developer Courses содержит следующие курсы:
▫️ 101 - Getting Started (duration: 25 minutes)
▫️ 102 - Creating Databases and Tables (duration: 25 minutes)
▫️ 103 - ClickHouse Architecture (duration: 25 minutes)
▫️ 104 - Data Ingestion (duration: 20 minutes)
▫️ Deduplication (duration: 25 minutes)
▫️ Materialized Views (duration: 25 minutes)
▫️ Projections (duration: 15 minutes)

👉 @python_powerbi
3👍1🔥1
Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown

MIT License

Mermaid is a JavaScript-based diagramming and charting tool that uses Markdown-inspired text definitions and a renderer to create and modify complex diagrams. The main purpose of Mermaid is to help documentation catch up with development.

https://github.com/mermaid-js/mermaid
👍1
Знаете ли Вы что-нибудь про Open-Source Data Catalogs: DataHub or Open MetaData? Если знаете, то какой на ваш взгляд более удобный в использовании?
Anonymous Poll
15%
DataHub
2%
Open Metadata
83%
Нет опыта с этими системами
Juypter Notebooks for ClickHouse

This directory contains sample notebooks showing how to connect Python to ClickHouse.

Notebook code has been tested on Ubuntu using Anaconda and Python 3.7. They should work from Python 3.5 onwards.

▫️ EX-1.0-Getting-to-Know-the-Clickhouse-driver-Client.ipynb
▫️ EX-1.1-Something-You-Should-Not-Do.ipynb
▫️ EX-2-ClickHouse-SQL-Alchemy.ipynb
▫️ EX-3-Sql-Magic-Functions.ipynb
▫️ EX-4-Pivot-Using-SQL-And-Pandas.ipynb
▫️ EX-5-Airline-OnTime-Data.ipynb


-------------------------------------
👉 @python_powerbi
🔥3
This media is not supported in your browser
VIEW IN TELEGRAM
GitHub - approximatelabs/sketch

AI code-writing assistant that understands data content

https://github.com/approximatelabs/sketch
🔥1