PythonHub – Telegram

PythonHub

2.45K subscribers

2.35K photos

49.5K links

News & links about Python programming.
https://pythonhub.dev/

Download Telegram

About

Blog

Apps

Platform

2.45K subscribers

LLM-Deflate: Extracting LLMs Into Datasets

LLM-Deflate is a technique for systematically extracting structured datasets from trained large language models by probing their internal knowledge with hierarchical topic exploration and prompt engineering. This reverse-compression process enables model analysis, knowledge transfer, training data augmentation, and debugging, potentially making knowledge extraction a standard tool as inf...

https://www.scalarlm.com/blog/llm-deflate-extracting-llms-into-datasets

LLM-Deflate: Extracting LLMs Into Datasets

Large Language Models compress massive amounts of training data into their parameters. This compression is lossy but highly effective—billions of parameters can encode the essential patterns from terabytes of text. However, what’s less obvious is that this…

157 views11:15

The Kaggle Grandmasters Playbook: 7 Battle-Tested Modeling Techniques for Tabular Data

The Kaggle Grandmasters Playbook presents seven proven techniques for tabular data modeling, emphasizing fast experimentation and careful validation powered by GPU acceleration to handle large-scale data effectively. Key strategies include advanced exploratory data analysis, building diverse baselines, extensive feature engineering, ensembling with hill climbing and stacking, pseudo-labe...

https://developer.nvidia.com/blog/the-kaggle-grandmasters-playbook-7-battle-tested-modeling-techniques-for-tabular-data/

NVIDIA Technical Blog

The Kaggle Grandmasters Playbook: 7 Battle-Tested Modeling Techniques for Tabular Data

Over hundreds of Kaggle competitions, we’ve refined a playbook that consistently lands us near the top of the leaderboard—no matter if we’re working with millions of rows, missing values…

164 views17:15

How to Build Advanced AI Agents – Course for Beginners (LiveKit, Exa, LangChain)

The video teaches beginners how to build advanced AI agents, such as voice sales agents, research assistants, and multi-agent workflows, using LiveKit, Exa, LangChain, and Cerebras. It provides step-by-step guidance, hands-on code, and free API credits to help developers quickly create real-world AI applications.

https://www.youtube.com/watch?v=B0TJC4lmzEM

How to Build Advanced AI Agents – Course for Beginners (LiveKit, Exa, LangChain)

Learn how to build real-world AI apps in this 3-part workshop series. You'll learn to build voice agents, deep research tools, multi-agent workflows, and more.‌‍‍‍‌‍‌‍‌‍‍‌‌‍‌‌‍‍‌‌‍‍‍‍‍‍‍‍‌‌‍‌‌‍‍‌‍‍‌‌‌‌‍‌‍‍‌‍‍‌‌‍‍‍‍‍‍‌‍‍‌‍‌‍‌‌‌‍‌‍‍‍‍‍…

171 views23:15

Python Singleton Pattern: Smarter Than You Think?

This video analyzes the strengths and weaknesses of the singleton pattern in Python, explaining why global state is risky but controlled instantiation can be valuable in certain cases. It recommends module-level singletons and thread safety measures, while cautioning against tight coupling and testing pitfalls with traditional singleton implementations.

https://www.youtube.com/watch?v=p_UQ7tzUFLo

The Real Reason the Singleton Pattern Exists

💡 Learn how to design great software in 7 steps: https://arjan.codes/designguide.

Singletons are often criticized for introducing global state and making code harder to test—but there’s more to the story. In this video, we explore the real problems with…

166 views05:15

LLMs from Scratch – Practical Engineering from Base Model to PPO RLHF

This video provides a hands-on guide to building a large language model entirely from scratch in PyTorch, covering every step from core transformer design to advanced alignment with RLHF. By the end, viewers gain practical experience in implementing, training, scaling, and aligning their own custom LLMs.

https://www.youtube.com/watch?v=p3sij8QzONQ

LLMs from Scratch – Practical Engineering from Base Model to PPO RLHF

Learn to build a complete large language model from scratch using only pure PyTorch. This course takes you through the entire lifecycle, from foundational concepts to advanced alignment techniques. By the end, you'll have the deep, hands-on experience needed…

189 views11:15

Unlocking Performance in Python's Free-Threaded Future: GC Optimizations

A description of the performance optimizations made to the free-threaded garbage collector for Python 3.14.

https://labs.quansight.org/blog/free-threaded-gc-3-14

labs.quansight.org

Unlocking Performance in Python's Free-Threaded Future: GC Optimizations

A description of the performance optimizations made to the free-threaded garbage collector for Python 3.14.

198 views17:15

Air

The new web framework that breathes fresh air into Python web development. Built with FastAPI, Starlette, and Pydantic.

https://github.com/feldroy/air

GitHub - feldroy/air: The new Python web framework by the authors of Two Scoops of Django

The new Python web framework by the authors of Two Scoops of Django - feldroy/air

204 views23:15

Python Hub Weekly Digest for 2025-10-05

https://pythonhub.dev/digest/2025-10-05/

Python Hub Weekly Digest for 2025-10-05

Popular articles, projects and reddits in World of Python Programming.

179 views18:15

onyx-dot-app / onyx

Open Source AI Platform - AI Chat with advanced features that works with every LLM

https://github.com/onyx-dot-app/onyx

GitHub - onyx-dot-app/onyx: Open Source AI Platform - AI Chat with advanced features that works with every LLM

Open Source AI Platform - AI Chat with advanced features that works with every LLM - onyx-dot-app/onyx

179 views20:15

Helium

Private, fast, and honest web browser.

https://github.com/imputnet/helium

GitHub - imputnet/helium: Private, fast, and honest web browser

Private, fast, and honest web browser. Contribute to imputnet/helium development by creating an account on GitHub.

164 views23:15

Pyscn – Python code quality analyzer for vibe coders

https://github.com/ludo-technologies/pyscn

GitHub - ludo-technologies/pyscn: An Intelligent Python Code Quality Analyzer

An Intelligent Python Code Quality Analyzer. Contribute to ludo-technologies/pyscn development by creating an account on GitHub.

165 views03:15

memvid

Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.

https://github.com/Olow304/memvid

GitHub - Olow304/memvid: Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic…

Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed. - Olow304/memvid

162 views07:15

DuckDB vs Polars. Wait. DuckDB and Polars.

The article emphasizes that DuckDB and Polars are not direct competitors but complementary tools in the Modern Data Stack, with each excelling in different contexts: DuckDB is best for SQL-heavy analytics and embedding as a query engine, while Polars suits end-to-end ETL pipelines and DataFrame-centric workflows. The choice depends on your problem context, team comfort, and use case rath...

https://www.confessionsofadataguy.com/duckdb-vs-polars-wait-duckdb-and-polars/

Confessions of a Data Guy

DuckDB vs Polars. Wait. DuckDB and Polars. - Confessions of a Data Guy

So, the classic newbie question. DuckDB vs Polars, which one should you pick? This is an interesting question, and actually drives a lot of search traffic to this website on which you find yourself wasting time. I thank you for that. This is probably the…

158 views11:15

PyOCI – Publish and install private Python packages using OCI/Docker registries

https://github.com/AllexVeldman/pyoci

GitHub - AllexVeldman/pyoci: Publish and install private python packages using OCI/docker registries.

Publish and install private python packages using OCI/docker registries. - AllexVeldman/pyoci

👍2

146 views15:15

search_evals

Batteries-included eval framework for search APIs.

https://github.com/perplexityai/search_evals

GitHub - perplexityai/search_evals: Batteries-included eval framework for search APIs

Batteries-included eval framework for search APIs. Contribute to perplexityai/search_evals development by creating an account on GitHub.

148 views19:15

Simplifying Resource Management in mssql-python through Context Manager

The article introduces context manager support in the mssql-python driver, allowing Python applications to manage SQL Server and Azure SQL resources more safely and efficiently using Python's "with" statement. This feature automates opening and closing of connections and cursors, as well as commit and rollback of transactions, reducing boilerplate code, preventing resource leaks, and ens...

https://devblogs.microsoft.com/python/simplifying-resource-management-in-mssql-python-through-context-manager/

Simplifying Resource Management in mssql-python through Context Manager

Uncover the advantages of using the python driver for sql server in your projects, ensuring clean and efficient database access.

144 views23:15

Why Is Python So Popular in 2025? – The PyCharm Blog

https://blog.jetbrains.com/pycharm/2025/09/why-is-python-so-popular/

The JetBrains Blog

Why Is Python So Popular in 2025? | The PyCharm Blog

From powering AI and data science to driving web development and automation, Python continues to dominate in 2025. Discover why in our blog post.

150 views03:15

bytedance / Dolphin

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

https://github.com/bytedance/Dolphin

GitHub - bytedance/Dolphin: The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025. - bytedance/Dolphin

155 views07:15

I built a full programming language interpreter in Python based on a meme

https://www.reddit.com/r/Python/comments/1nmta0f/i_built_a_full_programming_language_interpreter/

From the Python community on Reddit: I built a full programming language interpreter in Python based on a meme

Explore this post and more from the Python community

176 views11:15

Effective context engineering for AI agents

Context is a critical but finite resource for AI agents. In this post, we explore strategies for effectively curating and managing the context that powers them.

https://www.anthropic.com/engineering/effective-context-engineering-for-ai-agents

Effective context engineering for AI agents

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

204 views15:15