ML Research Hub
32.8K subscribers
4.35K photos
267 videos
23 files
4.7K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
🔹 Title: Reinforcement Mid-Training

🔹 Publication Date: Published on Sep 29

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2509.24375
• PDF: https://arxiv.org/pdf/2509.24375
• Github: https://github.com/Mid-Training/RMT

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: M3Retrieve: Benchmarking Multimodal Retrieval for Medicine

🔹 Publication Date: Published on Oct 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.06888
• PDF: https://arxiv.org/pdf/2510.06888

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints

🔹 Publication Date: Published on Oct 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08565
• PDF: https://arxiv.org/pdf/2510.08565
• Project Page: https://internvl.github.io/blog/2025-10-10-NaViL/
• Github: https://github.com/OpenGVLab/NaViL

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: Agent Learning via Early Experience

🔹 Publication Date: Published on Oct 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08558
• PDF: https://arxiv.org/pdf/2510.08558

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
1
🔹 Title: MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

🔹 Publication Date: Published on Oct 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08540
• PDF: https://arxiv.org/pdf/2510.08540

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: UNIDOC-BENCH: A Unified Benchmark for Document-Centric Multimodal RAG

🔹 Publication Date: Published on Oct 4

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.03663
• PDF: https://arxiv.org/pdf/2510.03663
• Github: https://github.com/SalesforceAIResearch/UniDoc-Bench

🔹 Datasets citing this paper:
https://huggingface.co/datasets/Salesforce/UniDoc-Bench

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: DeepPrune: Parallel Scaling without Inter-trace Redundancy

🔹 Publication Date: Published on Oct 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08483
• PDF: https://arxiv.org/pdf/2510.08483
• Project Page: https://deepprune.github.io/
• Github: https://github.com/THU-KEG/DeepPrune

🔹 Datasets citing this paper:
https://huggingface.co/datasets/THU-KEG/DeepPrune

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

🔹 Publication Date: Published on Oct 3

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.03222
• PDF: https://arxiv.org/pdf/2510.03222
• Github: https://github.com/CarlanLark/Lp-Reg-dev

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: Large Scale Diffusion Distillation via Score-Regularized Continuous-Time Consistency

🔹 Publication Date: Published on Oct 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08431
• PDF: https://arxiv.org/pdf/2510.08431
• Github: https://github.com/NVlabs/rcm

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: UniVideo: Unified Understanding, Generation, and Editing for Videos

🔹 Publication Date: Published on Oct 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08377
• PDF: https://arxiv.org/pdf/2510.08377
• Project Page: https://congwei1230.github.io/UniVideo/

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: First Try Matters: Revisiting the Role of Reflection in Reasoning Models

🔹 Publication Date: Published on Oct 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08308
• PDF: https://arxiv.org/pdf/2510.08308

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
1
🔹 Title: NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents

🔹 Publication Date: Published on Oct 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.07172
• PDF: https://arxiv.org/pdf/2510.07172
• Github: https://github.com/HKUST-KnowComp/NewtonBench

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning

🔹 Publication Date: Published on Oct 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08555
• PDF: https://arxiv.org/pdf/2510.08555
• Project Page: https://onevfall.github.io/project_page/videocanvas/

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: Recycling Pretrained Checkpoints: Orthogonal Growth of Mixture-of-Experts for Efficient Large Language Model Pre-Training

🔹 Publication Date: Published on Oct 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08008
• PDF: https://arxiv.org/pdf/2510.08008

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: Learning to Route LLMs from Bandit Feedback: One Policy, Many Trade-offs

🔹 Publication Date: Published on Oct 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.07429
• PDF: https://arxiv.org/pdf/2510.07429

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling

🔹 Publication Date: Published on Oct 8

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.06915
• PDF: https://arxiv.org/pdf/2510.06915
• Github: https://github.com/LCM-Lab/LongRM

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models

🔹 Publication Date: Published on Oct 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08559
• PDF: https://arxiv.org/pdf/2510.08559
• Project Page: https://scivideobench.github.io/
• Github: https://github.com/dengandong/SciVideoBench

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints

🔹 Publication Date: Published on Oct 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08549
• PDF: https://arxiv.org/pdf/2510.08549
• Project Page: https://nothingbutbut.github.io/era/
• Github: https://github.com/nothingbutbut/era

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: Reinforcing Diffusion Models by Direct Group Preference Optimization

🔹 Publication Date: Published on Oct 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08425
• PDF: https://arxiv.org/pdf/2510.08425

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: Training-Free Group Relative Policy Optimization

🔹 Publication Date: Published on Oct 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08191
• PDF: https://arxiv.org/pdf/2510.08191
• Github: https://github.com/TencentCloudADP/youtu-agent/tree/training_free_GRPO

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution

🔹 Publication Date: Published on Oct 9

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.08143
• PDF: https://arxiv.org/pdf/2510.08143
• Project Page: https://shiandu.github.io/UniMMVSR-website/

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT