Why I Still Use Python Virtual Environments in Docker
The article argues for using Python virtual environments in Docker containers, citing benefits like predictability, standardization, and easier debugging. The author contends that virtual environments provide a consistent, well-understood structure for Python applications, making communication and deployment across teams more straightforward, while also simplifying Python's import behavior.
https://hynek.me/articles/docker-virtualenv/
The article argues for using Python virtual environments in Docker containers, citing benefits like predictability, standardization, and easier debugging. The author contends that virtual environments provide a consistent, well-understood structure for Python applications, making communication and deployment across teams more straightforward, while also simplifying Python's import behavior.
https://hynek.me/articles/docker-virtualenv/
Hynek Schlawack
Why I Still Use Python Virtual Environments in Docker
Whenever I publish something about my Python Docker workflows, I invariably get challenged about whether it makes sense to use virtual environments in Docker containers. As always, it’s a trade-off, and I err on the side of standards and predictability.
Maximizing Python Code Efficiency: Strategies to Overcome Common Performance Hurdles
This article talks about performance issues caused by nested loops and memory allocation issues. It provides strategies to overcome these issues while improving efficiency.
https://towardsdatascience.com/maximizing-python-code-efficiency-strategies-to-overcome-common-performance-hurdles-c6292610d785
This article talks about performance issues caused by nested loops and memory allocation issues. It provides strategies to overcome these issues while improving efficiency.
https://towardsdatascience.com/maximizing-python-code-efficiency-strategies-to-overcome-common-performance-hurdles-c6292610d785
Towards Data Science
Maximizing Python Code Efficiency: Strategies to Overcome Common Performance Hurdles | Towards Data Science
Navigating Nested Loops and Memory Challenges for Seamless Performance using Python
kotaemon
An open-source RAG-based tool for chatting with your documents.
https://github.com/Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
https://github.com/Cinnamon/kotaemon
GitHub
GitHub - Cinnamon/kotaemon: An open-source RAG-based tool for chatting with your documents.
An open-source RAG-based tool for chatting with your documents. - Cinnamon/kotaemon
Taming the beast that is the Django ORM - An introduction
The Django ORM, how it compares to raw SQL and gotchas that you should be aware of when using it
https://www.davidhang.com/blog/2024-09-01-taming-the-django-orm/
The Django ORM, how it compares to raw SQL and gotchas that you should be aware of when using it
https://www.davidhang.com/blog/2024-09-01-taming-the-django-orm/
Davidhang
Taming the beast that is the Django ORM - An introduction
The Django ORM, how it compares to raw SQL and gotchas that you should be aware of when using it
👌2
Building LLMs from the Ground Up
This tutorial guides coders through the fundamentals of large language models (LLMs), explaining how they work and how to build them from scratch in PyTorch. It covers coding a small GPT-like model, its data pipeline, architecture, pretraining, and fine-tuning using open-source libraries.
https://www.youtube.com/watch?v=quh7z1q7-uc
This tutorial guides coders through the fundamentals of large language models (LLMs), explaining how they work and how to build them from scratch in PyTorch. It covers coding a small GPT-like model, its data pipeline, architecture, pretraining, and fine-tuning using open-source libraries.
https://www.youtube.com/watch?v=quh7z1q7-uc
YouTube
Building LLMs from the Ground Up: A 3-hour Coding Workshop
REFERENCES:
1. Build an LLM from Scratch book: https://amzn.to/4fqvn0D
2. Build an LLM from Scratch repo: https://github.com/rasbt/LLMs-from-scratch
3. GitHub repository with workshop code: https://github.com/rasbt/LLM-workshop-2024
4. Lightning Studio for…
1. Build an LLM from Scratch book: https://amzn.to/4fqvn0D
2. Build an LLM from Scratch repo: https://github.com/rasbt/LLMs-from-scratch
3. GitHub repository with workshop code: https://github.com/rasbt/LLM-workshop-2024
4. Lightning Studio for…
supertree
supertree is a Python package designed to visualize decision trees in an interactive and user-friendly way within Jupyter Notebooks, Jupyter Lab, Google Colab, and any other notebooks that support HTML rendering.
https://github.com/mljar/supertree
supertree is a Python package designed to visualize decision trees in an interactive and user-friendly way within Jupyter Notebooks, Jupyter Lab, Google Colab, and any other notebooks that support HTML rendering.
https://github.com/mljar/supertree
GitHub
GitHub - mljar/supertree: Visualize decision trees in Python
Visualize decision trees in Python. Contribute to mljar/supertree development by creating an account on GitHub.
Used Python to create public-domain US maps that can serve as desktop backgrounds
https://www.reddit.com/r/Python/comments/1f29mo0/used_python_to_create_publicdomain_us_maps_that/
https://www.reddit.com/r/Python/comments/1f29mo0/used_python_to_create_publicdomain_us_maps_that/
Reddit
From the Python community on Reddit: Used Python to create public-domain US maps that can serve as desktop backgrounds
Explore this post and more from the Python community
uvtrick
A fun party trick to run Python code from another venv into this one.
https://github.com/koaning/uvtrick
A fun party trick to run Python code from another venv into this one.
https://github.com/koaning/uvtrick
GitHub
GitHub - koaning/uvtrick: A fun party trick to run Python code from another venv into this one.
A fun party trick to run Python code from another venv into this one. - GitHub - koaning/uvtrick: A fun party trick to run Python code from another venv into this one.
Multimodal Data Analysis with LLMs and Python – Tutorial
The tutorial teaches how to analyze multimodal data using Large Language Models (LLMs) and Python, covering text classification, image-based question answering, audio transcription, and creating a natural language query interface for SQL databases.
https://www.youtube.com/watch?v=3-4qAkFRpAk
The tutorial teaches how to analyze multimodal data using Large Language Models (LLMs) and Python, covering text classification, image-based question answering, audio transcription, and creating a natural language query interface for SQL databases.
https://www.youtube.com/watch?v=3-4qAkFRpAk
YouTube
Multimodal Data Analysis with LLMs and Python – Tutorial
Learn how to analyze multimodal data with LLMs and Python. This tutorial covers:
- Classifying text with LLMs
- Answering questions about images
- Transcribing audio data (speech) to text
- Building a natural language query interface over an SQL database…
- Classifying text with LLMs
- Answering questions about images
- Transcribing audio data (speech) to text
- Building a natural language query interface over an SQL database…
Lessons learnt building a real-time audio application in Python
https://www.vangemert.dev/#/blog/lessons-learnt-backlooper
https://www.vangemert.dev/#/blog/lessons-learnt-backlooper
Classifying all of the pdfs on the internet
The article describes an attempt to classify a massive dataset of 8.4 million PDFs from Common Crawl using various machine learning techniques. The author experiments with different approaches, including deep learning models and traditional machine learning methods like XGBoost, ultimately achieving the best performance with an XGBoost model trained on embeddings, reaching 85.26% accurac...
https://snats.xyz/pages/articles/classifying_a_bunch_of_pdfs.html
The article describes an attempt to classify a massive dataset of 8.4 million PDFs from Common Crawl using various machine learning techniques. The author experiments with different approaches, including deep learning models and traditional machine learning methods like XGBoost, ultimately achieving the best performance with an XGBoost model trained on embeddings, reaching 85.26% accurac...
https://snats.xyz/pages/articles/classifying_a_bunch_of_pdfs.html
snats.xyz
snats website
Classifying all of the pdfs on the internet
Mini-Omni
Mini-Omni is an open-source multimodel large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
https://github.com/gpt-omni/mini-omni
Mini-Omni is an open-source multimodel large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
https://github.com/gpt-omni/mini-omni
GitHub
GitHub - gpt-omni/mini-omni: open-source multimodal large language model that can hear, talk while thinking. Featuring real-time…
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities. - GitHub - gpt-o...
How to Create a Pre-Commit Hook
A step-by-step guide to developing your own pre-commit hook.
https://stefaniemolin.com/articles/devx/pre-commit/hook-creation-guide/
A step-by-step guide to developing your own pre-commit hook.
https://stefaniemolin.com/articles/devx/pre-commit/hook-creation-guide/
Stefanie Molin
Pre-Commit Hook Creation Guide | Stefanie Molin
Pre-commit hooks are a great way to help maintain code quality. However, some of your code quality standards may be specific to your project, and therefore, not covered by existing code linting and formatting tools. In this article, I will show you how to…
pipefunc
Lightweight function pipeline (DAG) creation in pure Python for scientific workflows.
https://github.com/pipefunc/pipefunc
Lightweight function pipeline (DAG) creation in pure Python for scientific workflows.
https://github.com/pipefunc/pipefunc
GitHub
GitHub - pipefunc/pipefunc: Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪
Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪 - pipefunc/pipefunc
Lesser known parts of Python standard library – Trickster Dev
https://www.trickster.dev/post/lesser-known-parts-of-python-standard-library/
https://www.trickster.dev/post/lesser-known-parts-of-python-standard-library/
www.trickster.dev
Lesser known parts of Python standard library – Trickster Dev
Code level discussion of web scraping, gray hat automation, growth hacking and bounty hunting
Using GPT-4o for web scraping
The article discusses using GPT-4 with OpenAI's structured outputs feature to create an AI-assisted web scraper, exploring its capabilities in parsing complex tables and generating XPaths. While the author found GPT-4 effective at extracting data from various HTML tables, they also noted challenges with merged rows, high API costs, and the need for further refinements to improve accuracy...
https://blancas.io/blog/ai-web-scraper/
The article discusses using GPT-4 with OpenAI's structured outputs feature to create an AI-assisted web scraper, exploring its capabilities in parsing complex tables and generating XPaths. While the author found GPT-4 effective at extracting data from various HTML tables, they also noted challenges with merged rows, high API costs, and the need for further refinements to improve accuracy...
https://blancas.io/blog/ai-web-scraper/
Eduardo Blancas
Using GPT-4o for web scraping
tl;dr; show me the demo and source code!