Lessons learnt building a real-time audio application in Python
https://www.vangemert.dev/#/blog/lessons-learnt-backlooper
https://www.vangemert.dev/#/blog/lessons-learnt-backlooper
Classifying all of the pdfs on the internet
The article describes an attempt to classify a massive dataset of 8.4 million PDFs from Common Crawl using various machine learning techniques. The author experiments with different approaches, including deep learning models and traditional machine learning methods like XGBoost, ultimately achieving the best performance with an XGBoost model trained on embeddings, reaching 85.26% accurac...
https://snats.xyz/pages/articles/classifying_a_bunch_of_pdfs.html
The article describes an attempt to classify a massive dataset of 8.4 million PDFs from Common Crawl using various machine learning techniques. The author experiments with different approaches, including deep learning models and traditional machine learning methods like XGBoost, ultimately achieving the best performance with an XGBoost model trained on embeddings, reaching 85.26% accurac...
https://snats.xyz/pages/articles/classifying_a_bunch_of_pdfs.html
snats.xyz
snats website
Classifying all of the pdfs on the internet
Mini-Omni
Mini-Omni is an open-source multimodel large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
https://github.com/gpt-omni/mini-omni
Mini-Omni is an open-source multimodel large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
https://github.com/gpt-omni/mini-omni
GitHub
GitHub - gpt-omni/mini-omni: open-source multimodal large language model that can hear, talk while thinking. Featuring real-time…
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities. - GitHub - gpt-o...
How to Create a Pre-Commit Hook
A step-by-step guide to developing your own pre-commit hook.
https://stefaniemolin.com/articles/devx/pre-commit/hook-creation-guide/
A step-by-step guide to developing your own pre-commit hook.
https://stefaniemolin.com/articles/devx/pre-commit/hook-creation-guide/
Stefanie Molin
Pre-Commit Hook Creation Guide | Stefanie Molin
Pre-commit hooks are a great way to help maintain code quality. However, some of your code quality standards may be specific to your project, and therefore, not covered by existing code linting and formatting tools. In this article, I will show you how to…
pipefunc
Lightweight function pipeline (DAG) creation in pure Python for scientific workflows.
https://github.com/pipefunc/pipefunc
Lightweight function pipeline (DAG) creation in pure Python for scientific workflows.
https://github.com/pipefunc/pipefunc
GitHub
GitHub - pipefunc/pipefunc: Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪
Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪 - pipefunc/pipefunc
Lesser known parts of Python standard library – Trickster Dev
https://www.trickster.dev/post/lesser-known-parts-of-python-standard-library/
https://www.trickster.dev/post/lesser-known-parts-of-python-standard-library/
www.trickster.dev
Lesser known parts of Python standard library – Trickster Dev
Code level discussion of web scraping, gray hat automation, growth hacking and bounty hunting
Using GPT-4o for web scraping
The article discusses using GPT-4 with OpenAI's structured outputs feature to create an AI-assisted web scraper, exploring its capabilities in parsing complex tables and generating XPaths. While the author found GPT-4 effective at extracting data from various HTML tables, they also noted challenges with merged rows, high API costs, and the need for further refinements to improve accuracy...
https://blancas.io/blog/ai-web-scraper/
The article discusses using GPT-4 with OpenAI's structured outputs feature to create an AI-assisted web scraper, exploring its capabilities in parsing complex tables and generating XPaths. While the author found GPT-4 effective at extracting data from various HTML tables, they also noted challenges with merged rows, high API costs, and the need for further refinements to improve accuracy...
https://blancas.io/blog/ai-web-scraper/
Eduardo Blancas
Using GPT-4o for web scraping
tl;dr; show me the demo and source code!
Integrating Stripe Into A One-Product Django Python Shop
In the first part of this series, we created a Django online shop with htmx. In this second part, we'll handle orders using Stripe.
https://blog.appsignal.com/2024/09/04/integrating-stripe-into-a-one-product-django-python-shop.html
In the first part of this series, we created a Django online shop with htmx. In this second part, we'll handle orders using Stripe.
https://blog.appsignal.com/2024/09/04/integrating-stripe-into-a-one-product-django-python-shop.html
Appsignal
Integrating Stripe Into A One-Product Django Python Shop | AppSignal Blog
Let's handle orders on our one-product shop by setting up Stripe.
My Favorite Error Handling Technique
This video presents a surprising “Let it burn” approach to error handling, demonstrating how allowing code to fail fast can result in simpler, clearer, and more robust software. Discover the benefits of this method and its impact on improving overall code quality.
https://www.youtube.com/watch?v=YA0Wq1rcs6U
This video presents a surprising “Let it burn” approach to error handling, demonstrating how allowing code to fail fast can result in simpler, clearer, and more robust software. Discover the benefits of this method and its impact on improving overall code quality.
https://www.youtube.com/watch?v=YA0Wq1rcs6U
YouTube
My FAVORITE Error Handling Technique
👷 Review code better and faster with my 3-Factor Framework: https://arjan.codes/diagnosis.
In this video, I’ll show you my probably surprising “Let it burn” approach to error handling. Learn why letting your code fail fast can lead to simpler, clearer, and…
In this video, I’ll show you my probably surprising “Let it burn” approach to error handling. Learn why letting your code fail fast can lead to simpler, clearer, and…
Pure Python: Build a full stack ChatGPT-like UI. Reflex, Neon Postgres. Deploy with Docker to a VM
This video tutorial demonstrates how to build a full-stack ChatGPT-like UI using Reflex, a Python framework for web development, integrating it with Neon Postgres database and OpenAI. It covers the entire process from setting up the development environment to deploying the application using Docker, GitHub Actions, and Ansible on a virtual machine.
https://www.youtube.com/watch?v=NuNaI__4xiU
This video tutorial demonstrates how to build a full-stack ChatGPT-like UI using Reflex, a Python framework for web development, integrating it with Neon Postgres database and OpenAI. It covers the entire process from setting up the development environment to deploying the application using Docker, GitHub Actions, and Ansible on a virtual machine.
https://www.youtube.com/watch?v=NuNaI__4xiU
YouTube
Pure Python: Build a full stack ChatGPT-like UI. Reflex, Neon Postgres. Deploy with Docker to a VM
⭐️ Sign up for Neon right now! https://neon.tech/cfe
Topics:
✅ Full Stack Web Development in Pure Python with Reflex
✅ Integrate Neon Database with Reflex
✅ Seamless integration with frontend and backend
✅ Using SQLModel and Reflex's built-in support for…
Topics:
✅ Full Stack Web Development in Pure Python with Reflex
✅ Integrate Neon Database with Reflex
✅ Seamless integration with frontend and backend
✅ Using SQLModel and Reflex's built-in support for…
kazam
Linux Screen Recorder, Broadcaster, Capture and OCR with AI in mind.
https://github.com/henrywoo/kazam
Linux Screen Recorder, Broadcaster, Capture and OCR with AI in mind.
https://github.com/henrywoo/kazam
GitHub
GitHub - henrywoo/kazam: Kazam2 - Linux Screen Recorder, Broadcaster, Capture and OCR with AI in mind
Kazam2 - Linux Screen Recorder, Broadcaster, Capture and OCR with AI in mind - henrywoo/kazam
cookiecutter-uv
A modern cookiecutter template for Python projects that use uv for dependency management.
https://github.com/fpgmaas/cookiecutter-uv
A modern cookiecutter template for Python projects that use uv for dependency management.
https://github.com/fpgmaas/cookiecutter-uv
GitHub
GitHub - fpgmaas/cookiecutter-uv: A modern cookiecutter template for Python projects that use uv for dependency management
A modern cookiecutter template for Python projects that use uv for dependency management - GitHub - fpgmaas/cookiecutter-uv: A modern cookiecutter template for Python projects that use uv for depe...
Shades of testing HTTP requests in Python
The post discusses various approaches to testing HTTP requests in Python applications, focusing on mocking external API calls during unit and integration testing.
https://rednafi.com/python/testing_http_requests/
The post discusses various approaches to testing HTTP requests in Python applications, focusing on mocking external API calls during unit and integration testing.
https://rednafi.com/python/testing_http_requests/
Redowan's Reflections
Shades of testing HTTP requests in Python
Here’s a Python snippet that makes an HTTP POST request:
# script.py
import httpx
from typing import Any
async def make_request(url: str) -> dict[str, Any]:
headers = {"Content-Type": "application/json"}
async with httpx.AsyncClient(headers=headers)…
# script.py
import httpx
from typing import Any
async def make_request(url: str) -> dict[str, Any]:
headers = {"Content-Type": "application/json"}
async with httpx.AsyncClient(headers=headers)…
nlp-zero-to-hero
A comprehensive resource for learning Natural Language Processing (NLP) from the basics to advanced topics. It contains Jupyter notebooks covering various NLP concepts, techniques, and implementations, making it a valuable guide for beginners and intermediate learners in the field of NLP.
https://github.com/JUSTSUJAY/nlp-zero-to-hero
A comprehensive resource for learning Natural Language Processing (NLP) from the basics to advanced topics. It contains Jupyter notebooks covering various NLP concepts, techniques, and implementations, making it a valuable guide for beginners and intermediate learners in the field of NLP.
https://github.com/JUSTSUJAY/nlp-zero-to-hero
GitHub
GitHub - JUSTSUJAY/nlp-zero-to-hero: NLP Zero to Hero in just 10 Kernels
NLP Zero to Hero in just 10 Kernels. Contribute to JUSTSUJAY/nlp-zero-to-hero development by creating an account on GitHub.
PyPI Proxying for Docker Builds
I wanted to improve our CI system by caching PyPI data locally. I saw that there’s a project to do this, but I didn’t see any good examples actually using it.
https://www.robopenguins.com/pypi-proxy/
I wanted to improve our CI system by caching PyPI data locally. I saw that there’s a project to do this, but I didn’t see any good examples actually using it.
https://www.robopenguins.com/pypi-proxy/
Robopenguins
PyPI Proxying for Docker Builds
I wanted to improve our CI system by caching PyPI data locally. I saw that there’s a project to do this, but I didn’t see any good examples actually using it.
Multiversion Python Thoughts
A braindump on how to make multi version in Python work.
https://lucumr.pocoo.org/2024/9/9/multiversion-python/
A braindump on how to make multi version in Python work.
https://lucumr.pocoo.org/2024/9/9/multiversion-python/
Armin Ronacher's Thoughts and Writings
Multiversion Python Thoughts
A braindump on how to make multi version in Python work.