π Dataset #Dataset #LLM #CodingAgent #AgentTraces
SWE-chat
π€ SALT-NLP
π― Task
AI coding session modeling
π‘ Idea
205+ repositories of real developerβAI coding sessions with full chat transcripts, tool calls, thinking traces, code changes, and authorship attribution between humans and agents.
β¨ Why it's interesting
Combines interaction traces with code edits and authorship labels, enabling study of real human-agent coding workflows.
Size: 205+ repositories
Downloads: 1.5k | Likes: 34
π dataset
via @Papers.Data.Code
SWE-chat
π€ SALT-NLP
π― Task
AI coding session modeling
π‘ Idea
205+ repositories of real developerβAI coding sessions with full chat transcripts, tool calls, thinking traces, code changes, and authorship attribution between humans and agents.
β¨ Why it's interesting
Combines interaction traces with code edits and authorship labels, enabling study of real human-agent coding workflows.
Size: 205+ repositories
Downloads: 1.5k | Likes: 34
π dataset
via @Papers.Data.Code
π Weekly Digest | May 02 β May 09
#WeeklyDigest
π Papers
MolmoAct2: Action Reasoning Models for Real-world Deployment
#VisionLanguageAction #EmbodiedReasoning #ImitationLearning
Vision-language-action model βΆ beats VLA baselines on 7 benchmarks
β Learn more...
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents
#AgenticRL #ToolUse #MultimodalReasoning
Failure-aware multimodal search RL βΆ +13.8 points on 7 benchmarks
β Learn more...
OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories
#SearchAgents #SupervisedFineTuning #ToolUse
10.6k hard trajectories βΆ SOTA ~30B search agents
β Learn more...
Heterogeneous Scientific Foundation Model Collaboration
#AgentSystems #FoundationModels #ScientificAI
LLM-FM agent interface βΆ scientific tasks on structured data
β Learn more...
PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World
#3DGeneration #ArticulatedObjects #DiffusionModels
Two-stage 3D generation βΆ simulation-ready articulated assets
β Learn more...
MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons
#MotionCapture #PoseEstimation #3DHumanPoseEstimation
Monocular motion capture βΆ predicts animation-ready rotations
β Learn more...
π» Repos
PKU-YuanGroup/TIDE β
#DiscreteDiffusion #Distillation #CodeGeneration
Cross-architecture distillation βΆ 0.6B student, lower cost
β Learn more...
Vinayak-VG/GenWildSplat β
#3DReconstruction #GaussianSplatting #NovelViewSynthesis
Sparse-view 3D reconstruction βΆ 3D Gaussian splat in 3s
β Learn more...
YanFangCS/GenLIP β
#VisionEncoder #AutoregressivePretraining #OCR
Autoregressive ViT pretraining βΆ strong Doc and OCR gains
β Learn more...
π Datasets
SWE-chat
#CodingAgent #AgentTraces #HumanAICollaboration
SWE-chat dataset βΆ studies human-agent coding workflows
β Learn more...
gpic
#ImageGeneration #PermissiveLicense #ImageText
Permissive 100M image corpus βΆ visual generation research
β Learn more...
β‘οΈ Tomorrow β Multimodal & Agents Monthly
via @Papers.Data.Code
#WeeklyDigest
π Papers
MolmoAct2: Action Reasoning Models for Real-world Deployment
#VisionLanguageAction #EmbodiedReasoning #ImitationLearning
Vision-language-action model βΆ beats VLA baselines on 7 benchmarks
β Learn more...
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents
#AgenticRL #ToolUse #MultimodalReasoning
Failure-aware multimodal search RL βΆ +13.8 points on 7 benchmarks
β Learn more...
OpenSeeker-v2: Pushing the Limits of Search Agents with Informative and High-Difficulty Trajectories
#SearchAgents #SupervisedFineTuning #ToolUse
10.6k hard trajectories βΆ SOTA ~30B search agents
β Learn more...
Heterogeneous Scientific Foundation Model Collaboration
#AgentSystems #FoundationModels #ScientificAI
LLM-FM agent interface βΆ scientific tasks on structured data
β Learn more...
PhysForge: Generating Physics-Grounded 3D Assets for Interactive Virtual World
#3DGeneration #ArticulatedObjects #DiffusionModels
Two-stage 3D generation βΆ simulation-ready articulated assets
β Learn more...
MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons
#MotionCapture #PoseEstimation #3DHumanPoseEstimation
Monocular motion capture βΆ predicts animation-ready rotations
β Learn more...
π» Repos
PKU-YuanGroup/TIDE β
#DiscreteDiffusion #Distillation #CodeGeneration
Cross-architecture distillation βΆ 0.6B student, lower cost
β Learn more...
Vinayak-VG/GenWildSplat β
#3DReconstruction #GaussianSplatting #NovelViewSynthesis
Sparse-view 3D reconstruction βΆ 3D Gaussian splat in 3s
β Learn more...
YanFangCS/GenLIP β
#VisionEncoder #AutoregressivePretraining #OCR
Autoregressive ViT pretraining βΆ strong Doc and OCR gains
β Learn more...
π Datasets
SWE-chat
#CodingAgent #AgentTraces #HumanAICollaboration
SWE-chat dataset βΆ studies human-agent coding workflows
β Learn more...
gpic
#ImageGeneration #PermissiveLicense #ImageText
Permissive 100M image corpus βΆ visual generation research
β Learn more...
β‘οΈ Tomorrow β Multimodal & Agents Monthly
via @Papers.Data.Code