PythonHub
2.32K subscribers
2.35K photos
49K links
News & links about Python programming.
https://pythonhub.dev/
Download Telegram
Lessons learnt building a real-time audio application in Python

https://www.vangemert.dev/#/blog/lessons-learnt-backlooper
Classifying all of the pdfs on the internet

The article describes an attempt to classify a massive dataset of 8.4 million PDFs from Common Crawl using various machine learning techniques. The author experiments with different approaches, including deep learning models and traditional machine learning methods like XGBoost, ultimately achieving the best performance with an XGBoost model trained on embeddings, reaching 85.26% accurac...

https://snats.xyz/pages/articles/classifying_a_bunch_of_pdfs.html
Using GPT-4o for web scraping

The article discusses using GPT-4 with OpenAI's structured outputs feature to create an AI-assisted web scraper, exploring its capabilities in parsing complex tables and generating XPaths. While the author found GPT-4 effective at extracting data from various HTML tables, they also noted challenges with merged rows, high API costs, and the need for further refinements to improve accuracy...

https://blancas.io/blog/ai-web-scraper/
Integrating Stripe Into A One-Product Django Python Shop

In the first part of this series, we created a Django online shop with htmx. In this second part, we'll handle orders using Stripe.

https://blog.appsignal.com/2024/09/04/integrating-stripe-into-a-one-product-django-python-shop.html
My Favorite Error Handling Technique

This video presents a surprising “Let it burn” approach to error handling, demonstrating how allowing code to fail fast can result in simpler, clearer, and more robust software. Discover the benefits of this method and its impact on improving overall code quality.

https://www.youtube.com/watch?v=YA0Wq1rcs6U
Pure Python: Build a full stack ChatGPT-like UI. Reflex, Neon Postgres. Deploy with Docker to a VM

This video tutorial demonstrates how to build a full-stack ChatGPT-like UI using Reflex, a Python framework for web development, integrating it with Neon Postgres database and OpenAI. It covers the entire process from setting up the development environment to deploying the application using Docker, GitHub Actions, and Ansible on a virtual machine.

https://www.youtube.com/watch?v=NuNaI__4xiU
nlp-zero-to-hero

A comprehensive resource for learning Natural Language Processing (NLP) from the basics to advanced topics. It contains Jupyter notebooks covering various NLP concepts, techniques, and implementations, making it a valuable guide for beginners and intermediate learners in the field of NLP.

https://github.com/JUSTSUJAY/nlp-zero-to-hero
PyPI Proxying for Docker Builds

I wanted to improve our CI system by caching PyPI data locally. I saw that there’s a project to do this, but I didn’t see any good examples actually using it.

https://www.robopenguins.com/pypi-proxy/