Top ML Papers ยท May 11 โ May 17
#Paper #WeeklyDigest #W20_2026
From weekly rankings ยท through May 17
This week in top
โข Any-step video diffusion
โข On-policy flow distillation
โข NEO-unify multimodal model
๐ฅ AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation
Yuchao Gu, Guian Fang, Yuxin Jiang et al.
Unlike fixed-step distillation, it learns arbitrary-time video flow maps and tops 14B 4-NFE VBench.
โ Full breakdown
๐ฅ Flow-OPD: On-Policy Distillation for Flow Matching Models
Zhen Fang, Wenxuan Huang, Yu Zeng et al.
It brings on-policy multi-teacher distillation to flow matching, boosting SD3.5 GenEval 63โ92 and OCR 59โ94.
โ Full breakdown
๐ฅ SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
Haiwen Diao, Penghao Wu, Hanming Deng et al.
Unlike encoder-plus-VAE hybrids, it jointly does understanding and pixel-space generation in one backbone at 32ร compression.
โ Full breakdown
โก๏ธ Daily ML signals โ Papers.Data.Code
via @Papers.Data.Code | Digests
#Paper #WeeklyDigest #W20_2026
From weekly rankings ยท through May 17
This week in top
โข Any-step video diffusion
โข On-policy flow distillation
โข NEO-unify multimodal model
๐ฅ AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation
Yuchao Gu, Guian Fang, Yuxin Jiang et al.
Unlike fixed-step distillation, it learns arbitrary-time video flow maps and tops 14B 4-NFE VBench.
โ Full breakdown
๐ฅ Flow-OPD: On-Policy Distillation for Flow Matching Models
Zhen Fang, Wenxuan Huang, Yu Zeng et al.
It brings on-policy multi-teacher distillation to flow matching, boosting SD3.5 GenEval 63โ92 and OCR 59โ94.
โ Full breakdown
๐ฅ SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
Haiwen Diao, Penghao Wu, Hanming Deng et al.
Unlike encoder-plus-VAE hybrids, it jointly does understanding and pixel-space generation in one backbone at 32ร compression.
โ Full breakdown
โก๏ธ Daily ML signals โ Papers.Data.Code
via @Papers.Data.Code | Digests
Top ML Repos ยท May 11 โ May 17
#Repo #WeeklyDigest #W20_2026
From weekly rankings ยท through May 17
This week in top
โข Metal local inference
โข Program reconstruction benchmark
โข Self-supervised IMU odometry
๐ฅ antirez/ds4
Unlike recent local DeepSeek ports, it adds disk-persistent KV cache for 1M-token contexts.
โ Full breakdown
๐ฅ facebookresearch/ProgramBench
Targets black-box reverse engineering: rebuilding full codebases from binaries, docs, and tests.
โ Full breakdown
๐ฅ sparolab/KISS-IMU
Uses LiDAR-odometry pseudo-labels with motion-balanced sampling to learn IMU odometry self-supervised.
โ Full breakdown
โก๏ธ Daily ML signals โ Papers.Data.Code
via @Papers.Data.Code | Digests
#Repo #WeeklyDigest #W20_2026
From weekly rankings ยท through May 17
This week in top
โข Metal local inference
โข Program reconstruction benchmark
โข Self-supervised IMU odometry
๐ฅ antirez/ds4
Unlike recent local DeepSeek ports, it adds disk-persistent KV cache for 1M-token contexts.
โ Full breakdown
๐ฅ facebookresearch/ProgramBench
Targets black-box reverse engineering: rebuilding full codebases from binaries, docs, and tests.
โ Full breakdown
๐ฅ sparolab/KISS-IMU
Uses LiDAR-odometry pseudo-labels with motion-balanced sampling to learn IMU odometry self-supervised.
โ Full breakdown
โก๏ธ Daily ML signals โ Papers.Data.Code
via @Papers.Data.Code | Digests
Top ML Datasets ยท May 11 โ May 17
#Dataset #WeeklyDigest #W20_2026
From weekly rankings ยท through May 17
This week in top
โข Global AI panel dataset
โข Permissive image corpus
โข Global hantavirus dataset
๐ฅ AI Index Data: Growth, Talent (Cambridge/Harvard)
It uniquely harmonizes 259,546 verified AI indicators across 227 countries from 1998โ2025.
โ Full breakdown
๐ฅ stanford-vision-lab/giant-permissive-image-corpus
Unlike most recent datasets, it offers 100M high-quality images under fully permissive licensing.
โ Full breakdown
๐ฅ ๐ฆ Hantavirus (Andes Virus) โ Global Epidemiology
Links epidemiology, clinical outcomes, environmental risks, and strain data across 25 countries from 1993โ2025.
โ Full breakdown
โก๏ธ Daily ML signals โ Papers.Data.Code
via @Papers.Data.Code | Digests
#Dataset #WeeklyDigest #W20_2026
From weekly rankings ยท through May 17
This week in top
โข Global AI panel dataset
โข Permissive image corpus
โข Global hantavirus dataset
๐ฅ AI Index Data: Growth, Talent (Cambridge/Harvard)
It uniquely harmonizes 259,546 verified AI indicators across 227 countries from 1998โ2025.
โ Full breakdown
๐ฅ stanford-vision-lab/giant-permissive-image-corpus
Unlike most recent datasets, it offers 100M high-quality images under fully permissive licensing.
โ Full breakdown
๐ฅ ๐ฆ Hantavirus (Andes Virus) โ Global Epidemiology
Links epidemiology, clinical outcomes, environmental risks, and strain data across 25 countries from 1993โ2025.
โ Full breakdown
โก๏ธ Daily ML signals โ Papers.Data.Code
via @Papers.Data.Code | Digests
๐ ML Weekly Recap ยท May 11 โ May 17
#Recap #WeeklyDigest #W20_2026
This week in top
โข Any-step video diffusion ยท ๐ Paper
โข Metal local inference ยท ๐ Repo
โข Global AI panel dataset ยท ๐ Dataset
โก Trends
โธ On-policy distillation accelerates flow and diffusion generation with stronger few-step quality
โธ Test-time reasoning shifts toward agentic control, multi-agent search, and reusable memory
โธ Unified or efficient multimodal generation targets native pixels, high resolution, and bounded compute
๐งญ TL;DR
๐ SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
Haiwen Diao, Penghao Wu, Hanming Deng et al.
Unified multimodal understanding and pixel generation in one end-to-end architecture
โญ SwiftI2V
Practical 2K image-to-video generation with 202ร less GPU-time
๐ก Efficiency and unification are driving multimodal generation and reasoning forward.
โโโโโโโโโโโโ
โก๏ธ Daily ML signals โ Papers.Data.Code
via @Papers.Data.Code | Digests
#Recap #WeeklyDigest #W20_2026
This week in top
โข Any-step video diffusion ยท ๐ Paper
โข Metal local inference ยท ๐ Repo
โข Global AI panel dataset ยท ๐ Dataset
โก Trends
โธ On-policy distillation accelerates flow and diffusion generation with stronger few-step quality
โธ Test-time reasoning shifts toward agentic control, multi-agent search, and reusable memory
โธ Unified or efficient multimodal generation targets native pixels, high resolution, and bounded compute
๐งญ TL;DR
๐ SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
Haiwen Diao, Penghao Wu, Hanming Deng et al.
Unified multimodal understanding and pixel generation in one end-to-end architecture
โญ SwiftI2V
Practical 2K image-to-video generation with 202ร less GPU-time
๐ก Efficiency and unification are driving multimodal generation and reasoning forward.
โโโโโโโโโโโโ
โก๏ธ Daily ML signals โ Papers.Data.Code
via @Papers.Data.Code | Digests
