Self Supervised Boy – Telegram

Self Supervised Boy

@selfsupervised

160 subscribers

9 photos

56 links

Posting links to papers I read. Right now I'm mostly interested in things around LLMs, AI agents, and ML4Code. That is subject to change.

@martolod

Download Telegram

About

Blog

Apps

Platform

Self Supervised Boy

160 subscribers

Self Supervised Boy

https://arxiv.org/abs/2601.10343v1

OctoBench: Benchmarking Scaffold-Aware Instruction Following in...

Modern coding scaffolds turn LLMs into capable software agents, but their ability to follow scaffold-specified instructions remains under-examined, especially when constraints are heterogeneous...

87 views19:54

Self Supervised Boy

https://arxiv.org/abs/2601.10245v1

TRIM: Hybrid Inference via Targeted Stepwise Routing in Multi-Step...

Multi-step reasoning tasks like mathematical problem solving are vulnerable to cascading failures, where a single incorrect step leads to complete solution breakdown. Current LLM routing methods...

97 views19:54

Self Supervised Boy

https://arxiv.org/abs/2601.10639v1

STEM: Scaling Transformers with Embedding Modules

Fine-grained sparsity promises higher parametric capacity without proportional per-token compute, but often suffers from training instability, load balancing, and communication overhead. We...

👍1

115 views19:54

Self Supervised Boy

Forwarded from Just links

Time Horizon 1.1 https://metr.org/blog/2026-1-29-time-horizon-1-1/

Time Horizon 1.1

We’re releasing a new version of our time horizon estimates (TH1.1), using more tasks and a new eval infrastructure.

👍3

54 views14:16