microsoft / LLMLingua
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
https://github.com/microsoft/LLMLingua
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
https://github.com/microsoft/LLMLingua
GitHub
GitHub - microsoft/LLMLingua: [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress…
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression ...
surya
Accurate line-level text detection and recognition (OCR) in any language.
https://github.com/VikParuchuri/surya
Accurate line-level text detection and recognition (OCR) in any language.
https://github.com/VikParuchuri/surya
GitHub
GitHub - datalab-to/surya: OCR, layout analysis, reading order, table recognition in 90+ languages
OCR, layout analysis, reading order, table recognition in 90+ languages - datalab-to/surya
Type information for faster Python C extensions
PyPy is an alternative implementation of the Python language. PyPy’s C API compatibility layer has some performance issues. Carl Friedrich Bolz-Tereick and I are working on a way to make PyPy’s C API interactions much faster. It’s looking very promising. Here’s a sketch of how it works.
https://bernsteinbear.com/blog/typed-c-extensions/
PyPy is an alternative implementation of the Python language. PyPy’s C API compatibility layer has some performance issues. Carl Friedrich Bolz-Tereick and I are working on a way to make PyPy’s C API interactions much faster. It’s looking very promising. Here’s a sketch of how it works.
https://bernsteinbear.com/blog/typed-c-extensions/
Max Bernstein
Type information for faster Python C extensions
Update: The paper version of this post is accepted at PLDI SOAP 2024. Take a look at the preprint (PDF).
pathway
Pathway is a high-throughput, low-latency data processing framework that handles live data & streaming for you.
https://github.com/pathwaycom/pathway
Pathway is a high-throughput, low-latency data processing framework that handles live data & streaming for you.
https://github.com/pathwaycom/pathway
GitHub
GitHub - pathwaycom/pathway: Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG. - pathwaycom/pathway
Yet Another Password Reset Tutorial in Flask
Implementation of the password reset flow, with a twist.
https://freelancefootprints.substack.com/p/yet-another-password-reset-tutorial
Implementation of the password reset flow, with a twist.
https://freelancefootprints.substack.com/p/yet-another-password-reset-tutorial
Freelance Footprints
Yet Another Password Reset Tutorial in Flask
Implementation of the password reset flow, with a twist.
An Intuitive Guide to Self-attention in GPT: the Venetian Masquerade
https://twiecki.io/blog/2024/01/04
https://twiecki.io/blog/2024/01/04
twiecki.io
while my_mcmc: gently(samples) - An Intuitive Guide to Self-Attention in GPT: The Venetian Masquerade
1 TB of RAM saved with a single line of code
Anthony Sottile shows off a small change he made at work with a huge impact and explains how it works!
https://www.youtube.com/watch?v=Hgw_RlCaIds
Anthony Sottile shows off a small change he made at work with a huge impact and explains how it works!
https://www.youtube.com/watch?v=Hgw_RlCaIds
YouTube
ONE TERABYTE of RAM saved with a single line of code (advanced) anthony explains #566
today I show off a small change I made at work with huge impact and explain how it works!
- fork vs spawn https://youtu.be/hzBtsDPJ1dM
- python sucks at copy-on-write https://youtu.be/sdmcCQ7Em04
playlist: https://www.youtube.com/playlist?list=PLWBKAf8…
- fork vs spawn https://youtu.be/hzBtsDPJ1dM
- python sucks at copy-on-write https://youtu.be/sdmcCQ7Em04
playlist: https://www.youtube.com/playlist?list=PLWBKAf8…
How well-structured should your data code be?
The post explores the tradeoff between speed and system reliability, particularly in the context of data science. It delves into the challenges faced by those who prototype ML models, discussing the pressure to move quickly and the decision-making process between handing off work or taking on the role of a Machine Learning Engineer in production.
https://blog.dagworks.io/p/how-well-structured-should-your-data
The post explores the tradeoff between speed and system reliability, particularly in the context of data science. It delves into the challenges faced by those who prototype ML models, discussing the pressure to move quickly and the decision-making process between handing off work or taking on the role of a Machine Learning Engineer in production.
https://blog.dagworks.io/p/how-well-structured-should-your-data
blog.dagworks.io
How well-structured should your data code be?
You need more structure than you think but less than you fear.
Lamport's Bakery Algorithm, Demonstrated in Python
https://github.com/Dicklesworthstone/bakery_algorithm
https://github.com/Dicklesworthstone/bakery_algorithm
GitHub
GitHub - Dicklesworthstone/bakery_algorithm: Lamport's Bakery Algorithm Demonstrated in Python
Lamport's Bakery Algorithm Demonstrated in Python. Contribute to Dicklesworthstone/bakery_algorithm development by creating an account on GitHub.
Applio
Ultimate voice cloning tool, meticulously optimized for unrivaled power, modularity, and user-friendly experience.
https://github.com/IAHispano/Applio
Ultimate voice cloning tool, meticulously optimized for unrivaled power, modularity, and user-friendly experience.
https://github.com/IAHispano/Applio
GitHub
GitHub - IAHispano/Applio: A simple, high-quality voice conversion tool focused on ease of use and performance.
A simple, high-quality voice conversion tool focused on ease of use and performance. - IAHispano/Applio
NVIDIA / GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
https://github.com/NVIDIA/GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
https://github.com/NVIDIA/GenerativeAIExamples
GitHub
GitHub - NVIDIA/GenerativeAIExamples: Generative AI reference workflows optimized for accelerated infrastructure and microservice…
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture. - NVIDIA/GenerativeAIExamples
Building a priority-expiry LRU cache without heaps or trees in Python
Learn how to implement a least recently used cache with priorities and expiry, using only the Python standard library.
https://death.andgravity.com/lru-cache
Learn how to implement a least recently used cache with priorities and expiry, using only the Python standard library.
https://death.andgravity.com/lru-cache
death and gravity
This is not interview advice: a priority-expiry LRU cache in Python without heaps or trees
Today we're implementing a least recently used cache with priorities and expiry, using only the Python standard library.
This is a bIG TEch CoDINg InTerVIEW problem, so we'll work hard to stay away from the correct™ data structures, but we'll end up with…
This is a bIG TEch CoDINg InTerVIEW problem, so we'll work hard to stay away from the correct™ data structures, but we'll end up with…
django-webhook
Outgoing Django webhooks triggered on model changes.
https://github.com/danihodovic/django-webhook
Outgoing Django webhooks triggered on model changes.
https://github.com/danihodovic/django-webhook
GitHub
GitHub - danihodovic/django-webhook: Django webhooks triggered on model changes
Django webhooks triggered on model changes. Contribute to danihodovic/django-webhook development by creating an account on GitHub.
Python Data Analysis and Visualization Course – Astronomical Data
Learn data analysis, data visualizations, and image processing techniques using real-world astronomical data. The course offers a practical, hands-on approach to simplifying complex concepts in data analysis, making it great for beginners.
https://www.youtube.com/watch?v=H9KefzbryEw
Learn data analysis, data visualizations, and image processing techniques using real-world astronomical data. The course offers a practical, hands-on approach to simplifying complex concepts in data analysis, making it great for beginners.
https://www.youtube.com/watch?v=H9KefzbryEw
YouTube
Python Data Analysis and Visualization Course – Astronomical Data
Learn data analysis, data visualizations, and image processing techniques using real-world astronomical data. The course offers a practical, hands-on approach to simplifying complex concepts in data analysis, making it great for beginners.
Code: https:…
Code: https:…