PythonHub
2.4K subscribers
2.35K photos
49.1K links
News & links about Python programming.
https://pythonhub.dev/
Download Telegram
FineWeb: decanting the web for the finest text data at scale

The article introduces FineWeb, a Hugging Face project aimed at extracting high-quality text data from the web at scale. It highlights the methodology and tools used to ensure the gathered data is of the finest quality for training AI models, emphasizing the importance of clean and relevant text data for machine learning applications.

https://huggingface.co/spaces/HuggingFaceFW/blogpost-fineweb-v1
Next-generation web framework Teo, supports Node.js, Python and Rust

https://teodev.io
Designing data loaders in Python classes

If you are designing an API that loads data into a standard format, consider the from and to syntax to describe data loaders. from data loaders should read data and serialise it into a class for processing, and to data loaders should convert data to another format, or save data to disk.

https://jamesg.blog/2024/06/04/python-dataloaders-/
`bytes`: The Lesser-Known Python Built-In Sequence • And Understanding UTF-8 Encoding

The bytes data type looks a bit like a string, but it isn't a string. Let's explore it and also look at the main Unicode encoding, UTF-8.

https://www.thepythoncodingstack.com/p/bytes-python-built-in-unicode-utf-8-encoding
1
Statically Typed Functional Programming with Python 3.12

The article demonstrates how to use Python 3.12's new type alias and union type syntax to define a simple expression-based language with dataclasses, and then implements an evaluator function using pattern matching to evaluate expressions in that language.

https://wickstrom.tech/2024-05-23-statically-typed-functional-programming-python-312.html
Python at the Speed of Julia

The article explores achieving Python's ease of use with Julia's speed, illustrating how to optimize Python code using Julia for high-performance computing tasks. It provides examples and insights on integrating both languages to enhance computational efficiency without sacrificing Python's simplicity.

https://glassnotebook.io/r/dxJTYbJBmPR1X3NQUfwXB/99_python_at_the_speed_of_julia.jl