π Paper #Paper #CV #DiffusionDistillation #TextToImageGeneration
Continuous-Time Distribution Matching for Few-Step Diffusion Distillation
π€ Tao Liu, Hao Yan, Mengting Chen et al.
π― Task
Few-step text-to-image diffusion distillation
π‘ Idea
Continuous-time distribution matching with dynamic random-length schedules and velocity-based off-trajectory matching. It supervises arbitrary times, not fixed anchors, to reduce drift and preserve fine details without GAN or reward losses.
β¨ Why it's interesting
At 4 NFE on SD3-Medium, it reaches HPSv3 9.561 vs 9.176 for D-DMD.
π» Repo
β byliutao/cdm β 77 stars
π paper
via @Papers.Data.Code
Continuous-Time Distribution Matching for Few-Step Diffusion Distillation
π€ Tao Liu, Hao Yan, Mengting Chen et al.
π― Task
Few-step text-to-image diffusion distillation
π‘ Idea
Continuous-time distribution matching with dynamic random-length schedules and velocity-based off-trajectory matching. It supervises arbitrary times, not fixed anchors, to reduce drift and preserve fine details without GAN or reward losses.
β¨ Why it's interesting
At 4 NFE on SD3-Medium, it reaches HPSv3 9.561 vs 9.176 for D-DMD.
π» Repo
β byliutao/cdm β 77 stars
π paper
via @Papers.Data.Code
GitHub
GitHub - byliutao/CDM: Continuous-Time Distribution Matching for Few-Step Diffusion Distillationπ
Continuous-Time Distribution Matching for Few-Step Diffusion Distillationπ - byliutao/CDM
π Dataset #Dataset #CV #ImageGeneration #PermissiveLicense
giant-permissive-image-corpus
π€ stanford-vision-lab
π― Task
Visual generation
π‘ Idea
100M high-quality, diverse images in a fully permissive image corpus for visual generation.
β¨ Why it's interesting
Fully permissive 100M-image scale supports training and studying visual generation without restrictive licensing.
Size: 100M images
Downloads: 86 | Likes: 3
π dataset
via @Papers.Data.Code
giant-permissive-image-corpus
π€ stanford-vision-lab
π― Task
Visual generation
π‘ Idea
100M high-quality, diverse images in a fully permissive image corpus for visual generation.
β¨ Why it's interesting
Fully permissive 100M-image scale supports training and studying visual generation without restrictive licensing.
Size: 100M images
Downloads: 86 | Likes: 3
π dataset
via @Papers.Data.Code
π» Repo #Repo #LLM #Benchmark #SoftwareEngineering
Program Bench
π€ facebookresearch
π― Task
Program reconstruction benchmark
π‘ Idea
Evaluates LM-based software agents on recreating complete codebases that match an original program's behavior using only binaries, docs, and test suites.
β¨ Why it's interesting
Provides a black-box benchmark for full-program reverse engineering by LM agents.
π» Repo
β facebookresearch/ProgramBench β 390 stars (+278 3d)
Python
via @Papers.Data.Code
Program Bench
π€ facebookresearch
π― Task
Program reconstruction benchmark
π‘ Idea
Evaluates LM-based software agents on recreating complete codebases that match an original program's behavior using only binaries, docs, and test suites.
β¨ Why it's interesting
Provides a black-box benchmark for full-program reverse engineering by LM agents.
π» Repo
β facebookresearch/ProgramBench β 390 stars (+278 3d)
Python
via @Papers.Data.Code
GitHub
GitHub - facebookresearch/ProgramBench: Can Language Models Rebuild Programs From Scratch?
Can Language Models Rebuild Programs From Scratch? - facebookresearch/ProgramBench
π Paper #Paper #Multimodal #TextToImage #KnowledgeDistillation
Flow-OPD: On-Policy Distillation for Flow Matching Models
π€ Zhen Fang, Wenxuan Huang, Yu Zeng et al.
π― Task
Text-to-image model alignment
π‘ Idea
On-policy multi-teacher distillation for flow matching: route each sampled trajectory to a task-specific teacher for dense velocity-field supervision, then use manifold anchor regularization to keep outputs on a high-quality visual manifold.
β¨ Why it's interesting
On SD 3.5 Medium, GenEval rises 63β92 and OCR 59β94, about 10 points over GRPO.
π» Repo
β CostaliyA/Flow-OPD β 80 stars
π paper
via @Papers.Data.Code
Flow-OPD: On-Policy Distillation for Flow Matching Models
π€ Zhen Fang, Wenxuan Huang, Yu Zeng et al.
π― Task
Text-to-image model alignment
π‘ Idea
On-policy multi-teacher distillation for flow matching: route each sampled trajectory to a task-specific teacher for dense velocity-field supervision, then use manifold anchor regularization to keep outputs on a high-quality visual manifold.
β¨ Why it's interesting
On SD 3.5 Medium, GenEval rises 63β92 and OCR 59β94, about 10 points over GRPO.
π» Repo
β CostaliyA/Flow-OPD β 80 stars
π paper
via @Papers.Data.Code
GitHub
GitHub - CostaliyA/Flow-OPD: Official Repo of "Flow-OPD: On-Policy Distillation for Flow Matching Models"
Official Repo of "Flow-OPD: On-Policy Distillation for Flow Matching Models" - CostaliyA/Flow-OPD
π Paper #Paper #LLM #TestTimeScaling #Reasoning
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling
π€ Tong Zheng, Haolin Liu, Chengsong Huang et al.
π― Task
Test-time scaling for LLM reasoning
π‘ Idea
Offline replay controller synthesis for width-depth TTS: an agent learns branch, continue, probe, prune, and stop rules from pre-collected trajectories, using single-Ξ² parameterization and execution-trace feedback.
β¨ Why it's interesting
Discovery costs $39.9/160 min and improves accuracy-cost tradeoffs over hand-crafted baselines.
π» Repo
β zhengkid/AutoTTS β 43 stars
π paper
via @Papers.Data.Code
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling
π€ Tong Zheng, Haolin Liu, Chengsong Huang et al.
π― Task
Test-time scaling for LLM reasoning
π‘ Idea
Offline replay controller synthesis for width-depth TTS: an agent learns branch, continue, probe, prune, and stop rules from pre-collected trajectories, using single-Ξ² parameterization and execution-trace feedback.
β¨ Why it's interesting
Discovery costs $39.9/160 min and improves accuracy-cost tradeoffs over hand-crafted baselines.
π» Repo
β zhengkid/AutoTTS β 43 stars
π paper
via @Papers.Data.Code
GitHub
GitHub - zhengkid/AutoTTS: The offical repo for "LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling"
The offical repo for "LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling" - zhengkid/AutoTTS
π Dataset #Dataset #TimeSeries #GlobalAI #CountryIndicators
AI Index Data: Growth, Talent (Cambridge/Harvard)
π€ patelris
π― Task
Global AI readiness and growth analysis
π‘ Idea
259,546 verified rows in tidy long format across 24,453 indicators, 227 countries and territories, and 8 source systems spanning benchmarks, talent, infrastructure, governance, patents, skills, and GovTech.
β¨ Why it's interesting
Harmonized multi-source panel data enables cross-country trend, clustering, and forecasting analyses over 27 years.
Size: 259,546 observations, 24,453 indicators
Downloads: 242 | Likes: 28
π dataset
via @Papers.Data.Code
AI Index Data: Growth, Talent (Cambridge/Harvard)
π€ patelris
π― Task
Global AI readiness and growth analysis
π‘ Idea
259,546 verified rows in tidy long format across 24,453 indicators, 227 countries and territories, and 8 source systems spanning benchmarks, talent, infrastructure, governance, patents, skills, and GovTech.
β¨ Why it's interesting
Harmonized multi-source panel data enables cross-country trend, clustering, and forecasting analyses over 27 years.
Size: 259,546 observations, 24,453 indicators
Downloads: 242 | Likes: 28
π dataset
via @Papers.Data.Code
Kaggle
AI Index Data: Growth, Talent (Cambridge/Harvard)
259K observations across 24K+ AI metrics from Cambridge/Harvard
π Paper #Paper #LLM #ReinforcementLearning #PostTraining
Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex
π€ Yun Qu, Qi Wang, Yixiu Mao et al.
π― Task
LLM post-training with verifiable rewards
π‘ Idea
Explicit target-projection on the response simplex: build a closed-form Gibbs target over sampled responses, then project the policy to it with forward or reverse KL instead of implicit group-based policy gradients.
β¨ Why it's interesting
Across reasoning tasks and LLM backbones, LPO consistently beats matched PG baselines in Pass@1/Pass@k.
π paper
via @Papers.Data.Code
Listwise Policy Optimization: Group-based RLVR as Target-Projection on the LLM Response Simplex
π€ Yun Qu, Qi Wang, Yixiu Mao et al.
π― Task
LLM post-training with verifiable rewards
π‘ Idea
Explicit target-projection on the response simplex: build a closed-form Gibbs target over sampled responses, then project the policy to it with forward or reverse KL instead of implicit group-based policy gradients.
β¨ Why it's interesting
Across reasoning tasks and LLM backbones, LPO consistently beats matched PG baselines in Pass@1/Pass@k.
π paper
via @Papers.Data.Code
arXiv.org
Listwise Policy Optimization: Group-based RLVR as...
Reinforcement learning with verifiable rewards (RLVR) has become a standard approach for large language models (LLMs) post-training to incentivize reasoning capacity. Among existing recipes,...
π» Repo #Repo #Robotics #InertialOdometry #SelfSupervised
Kiss Imu
π€ sparolab
π― Task
Self-supervised inertial odometry
π‘ Idea
Train an IMU odometry model from raw IMU plus LiDAR-odometry pseudo-labels, using motion-balanced sampling and a frequency gate to better cover under-represented motion regimes.
β¨ Why it's interesting
Handles under-represented motion regimes during training via motion-balanced sampling and frequency gating.
π» Repo
β sparolab/KISS-IMU β 63 stars (+43 3d)
Python
π paper
via @Papers.Data.Code
Kiss Imu
π€ sparolab
π― Task
Self-supervised inertial odometry
π‘ Idea
Train an IMU odometry model from raw IMU plus LiDAR-odometry pseudo-labels, using motion-balanced sampling and a frequency gate to better cover under-represented motion regimes.
β¨ Why it's interesting
Handles under-represented motion regimes during training via motion-balanced sampling and frequency gating.
π» Repo
β sparolab/KISS-IMU β 63 stars (+43 3d)
Python
π paper
via @Papers.Data.Code
GitHub
GitHub - sparolab/KISS-IMU: KISS-IMU: Self-supervised Inertial Odometry with Motion-balanced Learning and Uncertainty-aware Inference.β¦
KISS-IMU: Self-supervised Inertial Odometry with Motion-balanced Learning and Uncertainty-aware Inference. @ ICRA'26 Award Finalist - sparolab/KISS-IMU
π Paper #Paper #LLM #TestTimeScaling #MultiAgentReasoning
TMAS: Scaling Test-Time Compute via Multi-Agent Synergy
π€ George Wu, Nan Jing, Qing Yi et al.
π― Task
Test-time scaling for LLM reasoning
π‘ Idea
Multi-agent inference with hierarchical memories: an experience bank stores reliable intermediate conclusions and feedback, while a guideline bank tracks explored strategies to avoid redundancy; hybrid-reward RL trains correctness, memory use, and novel exploration.
β¨ Why it's interesting
On challenging reasoning benchmarks, it shows stronger iterative scaling than prior TTS baselines.
π» Repo
β george-QF/TMAS-code β 4 stars
π paper
via @Papers.Data.Code
TMAS: Scaling Test-Time Compute via Multi-Agent Synergy
π€ George Wu, Nan Jing, Qing Yi et al.
π― Task
Test-time scaling for LLM reasoning
π‘ Idea
Multi-agent inference with hierarchical memories: an experience bank stores reliable intermediate conclusions and feedback, while a guideline bank tracks explored strategies to avoid redundancy; hybrid-reward RL trains correctness, memory use, and novel exploration.
β¨ Why it's interesting
On challenging reasoning benchmarks, it shows stronger iterative scaling than prior TTS baselines.
π» Repo
β george-QF/TMAS-code β 4 stars
π paper
via @Papers.Data.Code
GitHub
GitHub - george-QF/TMAS-code
Contribute to george-QF/TMAS-code development by creating an account on GitHub.
π₯1
π Dataset #Dataset #Multimodal #HyperspectralImaging #RemoteSensing
Hyperspectral Invasive Detection Dataset
π€ ziya07
π― Task
Hyperspectral invasive plant classification
π‘ Idea
Hyperspectral vegetation observations with .mat image cubes plus tabular metadata: 10 spectral bands, PCA and Gabor features, geolocation, environmental variables, species/status labels, confidence, and ground truth across ecological regions.
β¨ Why it's interesting
Combines spectra, texture, environment, and verified labels to support invasive-species detection and ecological mapping studies.
Downloads: 33 | Likes: 13
π dataset
via @Papers.Data.Code
Hyperspectral Invasive Detection Dataset
π€ ziya07
π― Task
Hyperspectral invasive plant classification
π‘ Idea
Hyperspectral vegetation observations with .mat image cubes plus tabular metadata: 10 spectral bands, PCA and Gabor features, geolocation, environmental variables, species/status labels, confidence, and ground truth across ecological regions.
β¨ Why it's interesting
Combines spectra, texture, environment, and verified labels to support invasive-species detection and ecological mapping studies.
Downloads: 33 | Likes: 13
π dataset
via @Papers.Data.Code
Kaggle
Hyperspectral Invasive Detection Dataset
Spectral-Spatial Vegetation Features for Intelligent Ecological Mapping
π₯ Repo #Repo #LLM #Metal #KvCache
Ds4
π€ antirez
π― Task
Local LLM inference and serving
π‘ Idea
Run DeepSeek V4 Flash locally on Apple Metal with a model-specific engine, chat CLI, OpenAI/Anthropic-compatible server, long-context support, and disk-persistent KV cache to reuse prompt prefixes across sessions.
β¨ Why it's interesting
Supports up to 1M-token context and disk KV persistence; reports 468 t/s prefill on M3 Ultra q2.
π» Repo
β antirez/ds4 β 8.0k stars (+5.3k 3d)
C
via @Papers.Data.Code
Ds4
π€ antirez
π― Task
Local LLM inference and serving
π‘ Idea
Run DeepSeek V4 Flash locally on Apple Metal with a model-specific engine, chat CLI, OpenAI/Anthropic-compatible server, long-context support, and disk-persistent KV cache to reuse prompt prefixes across sessions.
β¨ Why it's interesting
Supports up to 1M-token context and disk KV persistence; reports 468 t/s prefill on M3 Ultra q2.
π» Repo
β antirez/ds4 β 8.0k stars (+5.3k 3d)
C
via @Papers.Data.Code
GitHub
GitHub - antirez/ds4: DeepSeek 4 Flash local inference engine for Metal and CUDA
DeepSeek 4 Flash local inference engine for Metal and CUDA - antirez/ds4
π Paper #Paper #LLM #MemoryMechanisms #Attention
Ξ΄-mem: Efficient Online Memory for Large Language Models
π€ Jingdi Lei, Di Zhang, Junxian Li et al.
π― Task
Long-term memory augmentation for LLMs
π‘ Idea
Instead of storing history as extra tokens, retrieval text, or static adapters, it keeps a fixed-size online associative state and turns its readout into low-rank attention corrections for a frozen backbone.
β¨ Why it's interesting
With only an 8Γ8 state, average score reaches 1.10Γ the frozen backbone and 1.15Γ the best non-Ξ΄-mem baseline; 1.31Γ on MemoryAgentBench and 1.20Γ on LoCoMo.
π» Repo
β declare-lab/delta-Mem β 53 stars
π paper
via @Papers.Data.Code
Ξ΄-mem: Efficient Online Memory for Large Language Models
π€ Jingdi Lei, Di Zhang, Junxian Li et al.
π― Task
Long-term memory augmentation for LLMs
π‘ Idea
Instead of storing history as extra tokens, retrieval text, or static adapters, it keeps a fixed-size online associative state and turns its readout into low-rank attention corrections for a frozen backbone.
β¨ Why it's interesting
With only an 8Γ8 state, average score reaches 1.10Γ the frozen backbone and 1.15Γ the best non-Ξ΄-mem baseline; 1.31Γ on MemoryAgentBench and 1.20Γ on LoCoMo.
π» Repo
β declare-lab/delta-Mem β 53 stars
π paper
via @Papers.Data.Code
GitHub
GitHub - declare-lab/delta-Mem: The official repo of the paper: delta-Mem: Efficient Online Memory for Large Language Models
The official repo of the paper: delta-Mem: Efficient Online Memory for Large Language Models - declare-lab/delta-Mem
π Dataset #Dataset #Tabular #Epidemiology #InfectiousDisease
π¦ Hantavirus (Andes Virus) β Global Epidemiology
π€ zkskhurram
π― Task
Infectious disease epidemiology analysis
π‘ Idea
7 linked tables covering 25 countries across 5 WHO regions: yearly data from 1993β2025, outbreaks, monthly trends, clinical outcomes, environmental risk factors, virus strains, and a consolidated master table.
β¨ Why it's interesting
Combines epidemiology, clinical, environmental, and strain data in one dataset, enabling cross-country HPS/HFRS trend and risk analysis from a single source.
Size: 7 tables, 25 countries, 1993β2025
π Dataset
π₯ 662 downloads
β€οΈ 26 likes
π dataset
via @Papers.Data.Code
π¦ Hantavirus (Andes Virus) β Global Epidemiology
π€ zkskhurram
π― Task
Infectious disease epidemiology analysis
π‘ Idea
7 linked tables covering 25 countries across 5 WHO regions: yearly data from 1993β2025, outbreaks, monthly trends, clinical outcomes, environmental risk factors, virus strains, and a consolidated master table.
β¨ Why it's interesting
Combines epidemiology, clinical, environmental, and strain data in one dataset, enabling cross-country HPS/HFRS trend and risk analysis from a single source.
Size: 7 tables, 25 countries, 1993β2025
π Dataset
π₯ 662 downloads
β€οΈ 26 likes
π dataset
via @Papers.Data.Code
Kaggle
π¦ Hantavirus (Andes Virus) β Global Epidemiology
π Comprehensive worldwide dataset covering HPS/HFRS cases, clinical outcomes
π» Repo #Repo #CV #4dReconstruction #DynamicScenes
D4rt
π€ lucidrains
π― Task
dynamic scene reconstruction from video
π‘ Idea
Predict 3D points in dynamic scenes from video plus coordinate and time queries, with a trainable PyTorch model that can return losses for supervision or direct point predictions.
β¨ Why it's interesting
Provides a ready-to-use D4RT implementation with batched variable-length video/query handling for 4D reconstruction experiments.
π» Repo
β lucidrains/d4rt β 50 stars (+50 3d)
Python
via @Papers.Data.Code
D4rt
π€ lucidrains
π― Task
dynamic scene reconstruction from video
π‘ Idea
Predict 3D points in dynamic scenes from video plus coordinate and time queries, with a trainable PyTorch model that can return losses for supervision or direct point predictions.
β¨ Why it's interesting
Provides a ready-to-use D4RT implementation with batched variable-length video/query handling for 4D reconstruction experiments.
π» Repo
β lucidrains/d4rt β 50 stars (+50 3d)
Python
via @Papers.Data.Code
GitHub
GitHub - lucidrains/d4rt: Implementation of D4RT, Efficiently Reconstructing Dynamic Scenes, Deepmind
Implementation of D4RT, Efficiently Reconstructing Dynamic Scenes, Deepmind - lucidrains/d4rt
π Paper #Paper #Multimodal #VisionLanguageModels #ImageGeneration
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
π€ Haiwen Diao, Penghao Wu, Hanming Deng et al.
π― Task
Unified multimodal understanding and generation
π‘ Idea
Instead of bolting together encoder-based understanding and VAE/diffusion generation, it uses one native pixel-text backbone with shared attention and stream-specific MoT blocks, trained jointly for text prediction and pixel-space flow matching.
β¨ Why it's interesting
Authors claim it rivals top understanding-only VLMs and outperforms prior open-source unified models across understanding, reasoning, and generation; generation runs at 32Γ compression.
π» Repo
β OpenSenseNova/SenseNova-U1 β 1.7k stars
π paper
via @Papers.Data.Code
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
π€ Haiwen Diao, Penghao Wu, Hanming Deng et al.
π― Task
Unified multimodal understanding and generation
π‘ Idea
Instead of bolting together encoder-based understanding and VAE/diffusion generation, it uses one native pixel-text backbone with shared attention and stream-specific MoT blocks, trained jointly for text prediction and pixel-space flow matching.
β¨ Why it's interesting
Authors claim it rivals top understanding-only VLMs and outperforms prior open-source unified models across understanding, reasoning, and generation; generation runs at 32Γ compression.
π» Repo
β OpenSenseNova/SenseNova-U1 β 1.7k stars
π paper
via @Papers.Data.Code
GitHub
GitHub - OpenSenseNova/SenseNova-U1: SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles
SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles - OpenSenseNova/SenseNova-U1
π Paper #Paper #CV #VideoGeneration #DiffusionModels
AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation
π€ Yuchao Gu, Guian Fang, Yuxin Jiang et al.
π― Task
Any-step video generation
π‘ Idea
Instead of endpoint consistency maps for fixed few-step sampling, it learns arbitrary-time flow-map transitions along the full ODE path, then uses shortcut backward simulation for on-policy distillation to cut discretization error and causal exposure bias.
β¨ Why it's interesting
On 14B T2V, it gets 84.05 VBench at 4 NFEs and 84.41 at 32; beats Krea-Realtime-14B's 83.25 at 4 and rCM-14B's 83.73 at 4.
π» Repo
β NVlabs/AnyFlow β 202 stars
β NVLabs/AnyFlow β 202 stars
π paper
via @Papers.Data.Code
AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation
π€ Yuchao Gu, Guian Fang, Yuxin Jiang et al.
π― Task
Any-step video generation
π‘ Idea
Instead of endpoint consistency maps for fixed few-step sampling, it learns arbitrary-time flow-map transitions along the full ODE path, then uses shortcut backward simulation for on-policy distillation to cut discretization error and causal exposure bias.
β¨ Why it's interesting
On 14B T2V, it gets 84.05 VBench at 4 NFEs and 84.41 at 32; beats Krea-Realtime-14B's 83.25 at 4 and rCM-14B's 83.73 at 4.
π» Repo
β NVlabs/AnyFlow β 202 stars
β NVLabs/AnyFlow β 202 stars
π paper
via @Papers.Data.Code
GitHub
GitHub - NVlabs/AnyFlow
Contribute to NVlabs/AnyFlow development by creating an account on GitHub.
π Paper #Paper #LLM #ReinforcementLearning #AgentTraining
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards
π€ Gaotang Li, Bhavana Dalvi Mishra, Zifeng Wang et al.
π― Task
Long-form deep research agent training
π‘ Idea
Instead of using rubrics only to score final answers, RubricEM uses them to structure execution, reward each stage, and store experience. It decomposes research into Plan/Research/Review/Answer, applies stagewise GRPO for denser credit, and jointly trains a reflection policy as reusable memory.
β¨ Why it's interesting
RubricEM-8B outperforms comparable open models on 4 long-form research benchmarks and approaches proprietary deep-research systems after 1400 RL steps.
π paper
via @Papers.Data.Code
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards
π€ Gaotang Li, Bhavana Dalvi Mishra, Zifeng Wang et al.
π― Task
Long-form deep research agent training
π‘ Idea
Instead of using rubrics only to score final answers, RubricEM uses them to structure execution, reward each stage, and store experience. It decomposes research into Plan/Research/Review/Answer, applies stagewise GRPO for denser credit, and jointly trains a reflection policy as reusable memory.
β¨ Why it's interesting
RubricEM-8B outperforms comparable open models on 4 long-form research benchmarks and approaches proprietary deep-research systems after 1400 RL steps.
π paper
via @Papers.Data.Code
arXiv.org
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond...
Training deep research agents, namely systems that plan, search, evaluate evidence, and synthesize long-form reports, pushes reinforcement learning beyond the regime of verifiable rewards. Their...
π Dataset #Dataset #NLP #StemReasoning #VisualQuestionAnswering
open-mm-rl
π€ TuringEnterprises
π― Task
Multimodal STEM question answering
π‘ Idea
40 MIT-licensed STEM QA examples across physics, math, biology, and chemistry, spanning single-image, multi-panel, and multi-image formats with deterministic final answers.
β¨ Why it's interesting
Deterministic, programmatically checkable answers make advanced multimodal STEM reasoning benchmarkable for RL and outcome-supervised training.
Size: 40 examples, 15.5 MB
π Dataset
π₯ 2.6k downloads
β€οΈ 94 likes
π dataset
via @Papers.Data.Code
open-mm-rl
π€ TuringEnterprises
π― Task
Multimodal STEM question answering
π‘ Idea
40 MIT-licensed STEM QA examples across physics, math, biology, and chemistry, spanning single-image, multi-panel, and multi-image formats with deterministic final answers.
β¨ Why it's interesting
Deterministic, programmatically checkable answers make advanced multimodal STEM reasoning benchmarkable for RL and outcome-supervised training.
Size: 40 examples, 15.5 MB
π Dataset
π₯ 2.6k downloads
β€οΈ 94 likes
π dataset
via @Papers.Data.Code
huggingface.co
TuringEnterprises/Open-MM-RL Β· Datasets at Hugging Face
Weβre on a journey to advance and democratize artificial intelligence through open source and open science.
π» Repo #Repo #CV #ImageToVideo #2kGeneration
Swifti2v
π€ HKUST-LongGroup
π― Task
high-resolution image-to-video generation
π‘ Idea
Generate native 2K videos from a single image by first producing a low-res motion reference, then refining to high resolution while conditioning on both the input image and the Stage I video.
β¨ Why it's interesting
Matches strong 2K end-to-end I2V baselines on key VBench-I2V metrics with 202Γ less GPU-time; 81-frame 2K output runs in ~111s on one H800 and fits on a 24 GB RTX 4090.
π» Repo
β HKUST-LongGroup/SwiftI2V β 71 stars (+47 3d)
HTML
π paper
via @Papers.Data.Code
Swifti2v
π€ HKUST-LongGroup
π― Task
high-resolution image-to-video generation
π‘ Idea
Generate native 2K videos from a single image by first producing a low-res motion reference, then refining to high resolution while conditioning on both the input image and the Stage I video.
β¨ Why it's interesting
Matches strong 2K end-to-end I2V baselines on key VBench-I2V metrics with 202Γ less GPU-time; 81-frame 2K output runs in ~111s on one H800 and fits on a 24 GB RTX 4090.
π» Repo
β HKUST-LongGroup/SwiftI2V β 71 stars (+47 3d)
HTML
π paper
via @Papers.Data.Code
GitHub
GitHub - HKUST-LongGroup/SwiftI2V: Project page for paper "SwiftI2V: Efficient High-Resolution Image-to-Video Generation via Conditionalβ¦
Project page for paper "SwiftI2V: Efficient High-Resolution Image-to-Video Generation via Conditional Segment-wise Generation" - HKUST-LongGroup/SwiftI2V
π Weekly Digest Β· May 09 β May 16
#WeeklyDigest
π Papers
AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation
#VideoGeneration #DiffusionModels #Distillation
Any-step video diffusion βΆ 84.05 VBench at 4 NFEs
β Learn more...
Flow-OPD: On-Policy Distillation for Flow Matching Models
#TextToImage #KnowledgeDistillation #ReinforcementLearning
On-policy flow distillation βΆ boosts GenEval and OCR
β Learn more...
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
#VisionLanguageModels #ImageGeneration #MixtureOfExperts
NEO-unify multimodal model βΆ unifies understanding and generation
β Learn more...
Ξ΄-mem: Efficient Online Memory for Large Language Models
#MemoryMechanisms #Attention #ParameterEfficientTuning
Online associative memory βΆ steers attention for long-horizon tasks
β Learn more...
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling
#TestTimeScaling #Reasoning #AgenticSearch
Offline replay controller βΆ improves accuracy-cost tradeoffs
β Learn more...
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards
#ReinforcementLearning #AgentTraining #LongContextReasoning
Rubric-guided meta-RL βΆ stagewise credit for research agents
β Learn more...
π» Repos
antirez/ds4 β
#Metal #KvCache #OpenaiCompatible
Metal local inference βΆ 1M context with disk KV cache
β Learn more...
facebookresearch/ProgramBench β
#Benchmark #SoftwareEngineering #ReverseEngineering
Program reconstruction benchmark βΆ tests LM reverse engineering
β Learn more...
sparolab/KISS-IMU β
#InertialOdometry #SelfSupervised #LidarPseudoLabels
Self-supervised IMU odometry βΆ denoises raw IMU with LiDAR labels
β Learn more...
π Datasets
AI Index Data: Growth, Talent (Cambridge/Harvard)
#GlobalAI #CountryIndicators #LongitudinalData
Global AI panel dataset βΆ cross-country trend forecasting
β Learn more...
giant-permissive-image-corpus
#ImageGeneration #PermissiveLicense #ImageDataset
Permissive image corpus βΆ trains visual generation
β Learn more...
β‘οΈ Tomorrow β Efficient ML Monthly
via @Papers.Data.Code
#WeeklyDigest
π Papers
AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation
#VideoGeneration #DiffusionModels #Distillation
Any-step video diffusion βΆ 84.05 VBench at 4 NFEs
β Learn more...
Flow-OPD: On-Policy Distillation for Flow Matching Models
#TextToImage #KnowledgeDistillation #ReinforcementLearning
On-policy flow distillation βΆ boosts GenEval and OCR
β Learn more...
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
#VisionLanguageModels #ImageGeneration #MixtureOfExperts
NEO-unify multimodal model βΆ unifies understanding and generation
β Learn more...
Ξ΄-mem: Efficient Online Memory for Large Language Models
#MemoryMechanisms #Attention #ParameterEfficientTuning
Online associative memory βΆ steers attention for long-horizon tasks
β Learn more...
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling
#TestTimeScaling #Reasoning #AgenticSearch
Offline replay controller βΆ improves accuracy-cost tradeoffs
β Learn more...
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards
#ReinforcementLearning #AgentTraining #LongContextReasoning
Rubric-guided meta-RL βΆ stagewise credit for research agents
β Learn more...
π» Repos
antirez/ds4 β
#Metal #KvCache #OpenaiCompatible
Metal local inference βΆ 1M context with disk KV cache
β Learn more...
facebookresearch/ProgramBench β
#Benchmark #SoftwareEngineering #ReverseEngineering
Program reconstruction benchmark βΆ tests LM reverse engineering
β Learn more...
sparolab/KISS-IMU β
#InertialOdometry #SelfSupervised #LidarPseudoLabels
Self-supervised IMU odometry βΆ denoises raw IMU with LiDAR labels
β Learn more...
π Datasets
AI Index Data: Growth, Talent (Cambridge/Harvard)
#GlobalAI #CountryIndicators #LongitudinalData
Global AI panel dataset βΆ cross-country trend forecasting
β Learn more...
giant-permissive-image-corpus
#ImageGeneration #PermissiveLicense #ImageDataset
Permissive image corpus βΆ trains visual generation
β Learn more...
β‘οΈ Tomorrow β Efficient ML Monthly
via @Papers.Data.Code
π1
π Monthly Β· Efficient ML Β· Apr 17 β May 17
#MonthlyDigest #EfficientML
π Papers
Trees to Flows and Back: Unifying Decision Trees and Diffusion Models
#DiffusionModels #DecisionTrees #KnowledgeDistillation
Trees and flows βΆ faster tabular generation
β Learn more...
π Datasets
MSR-ACC/TAE25
#QuantumChemistry #AtomizationEnergy #CoupledCluster
Quantum chemistry dataset βΆ trains atomization energy models
β Learn more...
AI Index Data: Growth, Talent (Cambridge/Harvard)
#GlobalAI #CountryIndicators #LongitudinalData
Global AI panel dataset βΆ cross-country trend forecasting
β Learn more...
WHO Global Health Indicators for Prediction
#GlobalHealth #CountryLevel #WorldBank
Global health panel data βΆ cross-country trend analysis
β Learn more...
β‘ Trends
βΈ Longitudinal country-level datasets increasingly target forecasting and cross-country trend analysis.
βΈ Wide, linked, multi-table dataset formats are becoming standard for benchmarking.
βΈ Efficiency gains come from unifying model families and distilling complex systems.
π§ TL;DR
π Trees to Flows and Back: Unifying Decision Trees and Diffusion Models
Unifies trees and diffusion, delivering faster tabular generation and effective distillation.
π‘ Efficiency advances increasingly come from unifying classical structures with generative modeling.
via @Papers.Data.Code
#MonthlyDigest #EfficientML
π Papers
Trees to Flows and Back: Unifying Decision Trees and Diffusion Models
#DiffusionModels #DecisionTrees #KnowledgeDistillation
Trees and flows βΆ faster tabular generation
β Learn more...
π Datasets
MSR-ACC/TAE25
#QuantumChemistry #AtomizationEnergy #CoupledCluster
Quantum chemistry dataset βΆ trains atomization energy models
β Learn more...
AI Index Data: Growth, Talent (Cambridge/Harvard)
#GlobalAI #CountryIndicators #LongitudinalData
Global AI panel dataset βΆ cross-country trend forecasting
β Learn more...
WHO Global Health Indicators for Prediction
#GlobalHealth #CountryLevel #WorldBank
Global health panel data βΆ cross-country trend analysis
β Learn more...
β‘ Trends
βΈ Longitudinal country-level datasets increasingly target forecasting and cross-country trend analysis.
βΈ Wide, linked, multi-table dataset formats are becoming standard for benchmarking.
βΈ Efficiency gains come from unifying model families and distilling complex systems.
π§ TL;DR
π Trees to Flows and Back: Unifying Decision Trees and Diffusion Models
Unifies trees and diffusion, delivering faster tabular generation and effective distillation.
π‘ Efficiency advances increasingly come from unifying classical structures with generative modeling.
via @Papers.Data.Code