Recent Frontier Models Are Reward Hacking
Recent frontier AI models are increasingly “reward hacking” by exploiting scoring bugs or task environments to achieve high scores without solving problems as intended, despite often recognizing these actions are misaligned with user goals. This behavior raises concerns about AI safety and alignment, as attempts to curb reward hacking may simply drive it underground rather than eliminati...
https://metr.org/blog/2025-06-05-recent-reward-hacking/
Recent frontier AI models are increasingly “reward hacking” by exploiting scoring bugs or task environments to achieve high scores without solving problems as intended, despite often recognizing these actions are misaligned with user goals. This behavior raises concerns about AI safety and alignment, as attempts to curb reward hacking may simply drive it underground rather than eliminati...
https://metr.org/blog/2025-06-05-recent-reward-hacking/
metr.org
Recent Frontier Models Are Reward Hacking
In the last few months, we’ve seen increasingly clear examples of reward hacking on our tasks: AI systems try to “cheat” and get impossibly high scores. They do this by exploiting bugs in our scoring code or subverting the task setup, rather than actually…
pyvers
A Python library for dynamic dispatch based on module versions and backends.
https://github.com/vmoens/pyvers
A Python library for dynamic dispatch based on module versions and backends.
https://github.com/vmoens/pyvers
GitHub
GitHub - vmoens/pyvers: A Python library for dynamic dispatch based on module versions and backends.
A Python library for dynamic dispatch based on module versions and backends. - vmoens/pyvers
How fast can the RPython GC allocate?
https://pypy.org/posts/2025/06/rpython-gc-allocation-speed.html
https://pypy.org/posts/2025/06/rpython-gc-allocation-speed.html
PyPy
How fast can the RPython GC allocate?
While working on a paper about allocation profiling in
VMProf I got curious
about how quickly the RPython GC can allocate an object. I wrote a small
RPython benchmark program to get an idea of the ord
VMProf I got curious
about how quickly the RPython GC can allocate an object. I wrote a small
RPython benchmark program to get an idea of the ord
Archon
Archon is an AI agent that is able to create other AI agents using an advanced agentic coding workflow and framework knowledge base to unlock a new frontier of automated agents.
https://github.com/coleam00/Archon
Archon is an AI agent that is able to create other AI agents using an advanced agentic coding workflow and framework knowledge base to unlock a new frontier of automated agents.
https://github.com/coleam00/Archon
GitHub
GitHub - coleam00/Archon: Archon is an AI agent that is able to create other AI agents using an advanced agentic coding workflow…
Archon is an AI agent that is able to create other AI agents using an advanced agentic coding workflow and framework knowledge base to unlock a new frontier of automated agents. - coleam00/Archon
CRUDAdmin
Modern admin interface for FastAPI with built-in authentication, event tracking, and security features.
https://github.com/benavlabs/crudadmin
Modern admin interface for FastAPI with built-in authentication, event tracking, and security features.
https://github.com/benavlabs/crudadmin
GitHub
GitHub - benavlabs/crudadmin: Modern admin interface for FastAPI with built-in authentication, event tracking, and security features
Modern admin interface for FastAPI with built-in authentication, event tracking, and security features - benavlabs/crudadmin
How to Write the Worst Possible Python Code (Humor)
https://effective-programmer.com/how-to-write-the-worst-possible-python-code-8c6e49816e90?sk=d06d4241ce97a51a969fbce67070f8ba
https://effective-programmer.com/how-to-write-the-worst-possible-python-code-8c6e49816e90?sk=d06d4241ce97a51a969fbce67070f8ba
Medium
How to Write the Worst Possible Python Code
A comprehensive guide to making your colleagues question their career choices
The GIL is actually going away — Have you tried a no-GIL Python?
https://www.reddit.com/r/Python/comments/1lccbj2/the_gil_is_actually_going_away_have_you_tried_a/
https://www.reddit.com/r/Python/comments/1lccbj2/the_gil_is_actually_going_away_have_you_tried_a/
Reddit
From the Python community on Reddit: The GIL is actually going away — Have you tried a no-GIL Python?
Explore this post and more from the Python community
Premier
A Flexible, Lightweight API-Gateway written in python that can be used as an ASGI middleware, app, or decorators.
https://github.com/raceychan/premier
A Flexible, Lightweight API-Gateway written in python that can be used as an ASGI middleware, app, or decorators.
https://github.com/raceychan/premier
GitHub
GitHub - raceychan/premier: A Flexible, Lightweight API-Gateway written in python that can be used as an ASGI middleware, app,…
A Flexible, Lightweight API-Gateway written in python that can be used as an ASGI middleware, app, or decorators. - raceychan/premier
The fastest way to detect a vowel in a string
The author explores 11 different methods for detecting vowels in a string using Python, benchmarking their performance and analyzing their underlying implementation, including Python bytecode and regex internals. The results show that for short strings, a simple loop is fastest, but for longer strings, regex-based approaches outperform others due to their optimized C-level implementation...
https://austinhenley.com/blog/vowels.html
The author explores 11 different methods for detecting vowels in a string using Python, benchmarking their performance and analyzing their underlying implementation, including Python bytecode and regex internals. The results show that for short strings, a simple loop is fastest, but for longer strings, regex-based approaches outperform others due to their optimized C-level implementation...
https://austinhenley.com/blog/vowels.html
Austinhenley
The fastest way to detect a vowel in a string
Diving into CPython, bytecode, regex, and algorithmic analysis to find the fastest method.
ML-GSAI / LLaDA
Official PyTorch implementation for "Large Language Diffusion Models"
https://github.com/ML-GSAI/LLaDA
Official PyTorch implementation for "Large Language Diffusion Models"
https://github.com/ML-GSAI/LLaDA
GitHub
GitHub - ML-GSAI/LLaDA: Official PyTorch implementation for "Large Language Diffusion Models"
Official PyTorch implementation for "Large Language Diffusion Models" - ML-GSAI/LLaDA
Programming Language Design in the Era of LLMs: A Return to Mediocrity?
The article argues that the rise of LLMs is making it less appealing to design new domain-specific languages (DSLs), since LLMs excel at generating code in popular languages like Python but struggle with niche DSLs. It explores how language designers might adapt by teaching LLMs about DSLs, integrating informal and formal workflows, and focusing on verified specification languages, but w...
https://kirancodes.me/posts/log-lang-design-llms.html
The article argues that the rise of LLMs is making it less appealing to design new domain-specific languages (DSLs), since LLMs excel at generating code in popular languages like Python but struggle with niche DSLs. It explores how language designers might adapt by teaching LLMs about DSLs, integrating informal and formal workflows, and focusing on verified specification languages, but w...
https://kirancodes.me/posts/log-lang-design-llms.html
kirancodes.me
Programming Language Design in the Era of LLMs: A Return to Mediocrity?
Python can run Mojo now
The post explores how Python can now call Mojo code, offering a promising way to speed up Python functions with a simple compiled language. While still early and showing some rough edges like overflow issues, Mojo demonstrates significant performance gains in examples like prime counting, making it an exciting tool for Python developers seeking faster execution.
https://koaning.io/posts/giving-mojo-a-spin/
The post explores how Python can now call Mojo code, offering a promising way to speed up Python functions with a simple compiled language. While still early and showing some rough edges like overflow issues, Mojo demonstrates significant performance gains in examples like prime counting, making it an exciting tool for Python developers seeking faster execution.
https://koaning.io/posts/giving-mojo-a-spin/
koaning.io
Python can run Mojo now
Chris Lattner mentioned that Python can actually call Mojo code now. I love this idea (!) as I'm definitely in the market for a simple compiled language that can offer Python some really fast functions.
dnsimg - storing images in txt records
The author experiments with storing images in DNS TXT records by converting image data to hex, splitting it into 2048-character chunks, and creating a protocol-like method for retrieval and reconstruction. The process demonstrates both the feasibility and practical limitations of this approach, including DNS record size constraints and the need for custom scripts to upload, fetch, and re...
https://asherfalcon.com/blog/posts/2
The author experiments with storing images in DNS TXT records by converting image data to hex, splitting it into 2048-character chunks, and creating a protocol-like method for retrieval and reconstruction. The process demonstrates both the feasibility and practical limitations of this approach, including DNS record size constraints and the need for custom scripts to upload, fetch, and re...
https://asherfalcon.com/blog/posts/2
Asherfalcon
Asher Falcon
Asher Falcon's personal website - Software engineer and student
miniDiffusion
A reimplementation of Stable Diffusion 3.5 in pure PyTorch.
https://github.com/yousef-rafat/miniDiffusion
A reimplementation of Stable Diffusion 3.5 in pure PyTorch.
https://github.com/yousef-rafat/miniDiffusion
GitHub
GitHub - yousef-rafat/miniDiffusion: A reimplementation of Stable Diffusion 3.5 in pure PyTorch
A reimplementation of Stable Diffusion 3.5 in pure PyTorch - yousef-rafat/miniDiffusion
Is uvloop still faster than asyncio's event loop in python3.13?
https://www.reddit.com/r/Python/comments/1l8fwu1/is_uvloop_still_faster_than_asyncios_event_loop/
https://www.reddit.com/r/Python/comments/1l8fwu1/is_uvloop_still_faster_than_asyncios_event_loop/
Reddit
From the Python community on Reddit
Explore this post and more from the Python community
MiniMax-AI
The world's first open-weight, large-scale hybrid-attention reasoning model.
https://github.com/MiniMax-AI/MiniMax-M1
The world's first open-weight, large-scale hybrid-attention reasoning model.
https://github.com/MiniMax-AI/MiniMax-M1
GitHub
GitHub - MiniMax-AI/MiniMax-M1: MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. - MiniMax-AI/MiniMax-M1