Papers.Data.Code | Digests
7 subscribers
4 links
The ML top, not the feed. Highlights, trends & TL;DRs. One post a day. ๐Ÿ“ˆ
papers.data.code@gmail.com
Download Telegram
Channel photo updated
Channel name was changed to ยซPapers.Data.Code | Digestsยป
Top ML Papers ยท May 11 โ€“ May 17
#Paper #WeeklyDigest #W20_2026
From weekly rankings ยท through May 17

This week in top
โ€ข Any-step video diffusion
โ€ข On-policy flow distillation
โ€ข NEO-unify multimodal model

๐Ÿฅ‡ AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation
Yuchao Gu, Guian Fang, Yuxin Jiang et al.
Unlike fixed-step distillation, it learns arbitrary-time video flow maps and tops 14B 4-NFE VBench.
โ†’ Full breakdown

๐Ÿฅˆ Flow-OPD: On-Policy Distillation for Flow Matching Models
Zhen Fang, Wenxuan Huang, Yu Zeng et al.
It brings on-policy multi-teacher distillation to flow matching, boosting SD3.5 GenEval 63โ†’92 and OCR 59โ†’94.
โ†’ Full breakdown

๐Ÿฅ‰ SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
Haiwen Diao, Penghao Wu, Hanming Deng et al.
Unlike encoder-plus-VAE hybrids, it jointly does understanding and pixel-space generation in one backbone at 32ร— compression.
โ†’ Full breakdown

โžก๏ธ Daily ML signals โ†’ Papers.Data.Code
via @Papers.Data.Code | Digests
Top ML Repos ยท May 11 โ€“ May 17
#Repo #WeeklyDigest #W20_2026
From weekly rankings ยท through May 17

This week in top
โ€ข Metal local inference
โ€ข Program reconstruction benchmark
โ€ข Self-supervised IMU odometry

๐Ÿฅ‡ antirez/ds4
Unlike recent local DeepSeek ports, it adds disk-persistent KV cache for 1M-token contexts.
โ†’ Full breakdown

๐Ÿฅˆ facebookresearch/ProgramBench
Targets black-box reverse engineering: rebuilding full codebases from binaries, docs, and tests.
โ†’ Full breakdown

๐Ÿฅ‰ sparolab/KISS-IMU
Uses LiDAR-odometry pseudo-labels with motion-balanced sampling to learn IMU odometry self-supervised.
โ†’ Full breakdown

โžก๏ธ Daily ML signals โ†’ Papers.Data.Code
via @Papers.Data.Code | Digests
Top ML Datasets ยท May 11 โ€“ May 17
#Dataset #WeeklyDigest #W20_2026
From weekly rankings ยท through May 17

This week in top
โ€ข Global AI panel dataset
โ€ข Permissive image corpus
โ€ข Global hantavirus dataset

๐Ÿฅ‡ AI Index Data: Growth, Talent (Cambridge/Harvard)
It uniquely harmonizes 259,546 verified AI indicators across 227 countries from 1998โ€“2025.
โ†’ Full breakdown

๐Ÿฅˆ stanford-vision-lab/giant-permissive-image-corpus
Unlike most recent datasets, it offers 100M high-quality images under fully permissive licensing.
โ†’ Full breakdown

๐Ÿฅ‰ ๐Ÿฆ  Hantavirus (Andes Virus) โ€” Global Epidemiology
Links epidemiology, clinical outcomes, environmental risks, and strain data across 25 countries from 1993โ€“2025.
โ†’ Full breakdown

โžก๏ธ Daily ML signals โ†’ Papers.Data.Code
via @Papers.Data.Code | Digests
๐Ÿ“‹ ML Weekly Recap ยท May 11 โ€“ May 17
#Recap #WeeklyDigest #W20_2026

This week in top
โ€ข Any-step video diffusion ยท ๐Ÿ”— Paper
โ€ข Metal local inference ยท ๐Ÿ”— Repo
โ€ข Global AI panel dataset ยท ๐Ÿ”— Dataset

โšก Trends

โ–ธ On-policy distillation accelerates flow and diffusion generation with stronger few-step quality
โ–ธ Test-time reasoning shifts toward agentic control, multi-agent search, and reusable memory
โ–ธ Unified or efficient multimodal generation targets native pixels, high resolution, and bounded compute

๐Ÿงญ TL;DR

๐Ÿ“„ SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
Haiwen Diao, Penghao Wu, Hanming Deng et al.
Unified multimodal understanding and pixel generation in one end-to-end architecture

โญ SwiftI2V
Practical 2K image-to-video generation with 202ร— less GPU-time

๐Ÿ’ก Efficiency and unification are driving multimodal generation and reasoning forward.

โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€
โžก๏ธ Daily ML signals โ†’ Papers.Data.Code
via @Papers.Data.Code | Digests