ML Research Hub
32.9K subscribers
4.37K photos
269 videos
23 files
4.73K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
🔹 Title: PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14528
• PDF: https://arxiv.org/pdf/2510.14528

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: Attention Is All You Need for KV Cache in Diffusion LLMs

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14973
• PDF: https://arxiv.org/pdf/2510.14973
• Project Page: https://vila-lab.github.io/elastic-cache-webpage/
• Github: https://vila-lab.github.io/elastic-cache-webpage/

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: AI for Service: Proactive Assistance with AI Glasses

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14359
• PDF: https://arxiv.org/pdf/2510.14359

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning

🔹 Publication Date: Published on Oct 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.10518
• PDF: https://arxiv.org/pdf/2510.10518
• Github: https://github.com/qunzhongwang/vr-thinker

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14958
• PDF: https://arxiv.org/pdf/2510.14958
• Project Page: https://mathcanvas.github.io/
• Github: https://github.com/shiwk24/MathCanvas

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: Qwen3Guard Technical Report

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14276
• PDF: https://arxiv.org/pdf/2510.14276
• Github: https://github.com/QwenLM/Qwen3Guard

🔹 Datasets citing this paper:
https://huggingface.co/datasets/Qwen/Qwen3GuardTest

🔹 Spaces citing this paper:
https://huggingface.co/spaces/DSDUDEd/DUDEAIBeta1.1
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: MoM: Mixtures of Scenario-Aware Document Memories for Retrieval-Augmented Generation Systems

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14252
• PDF: https://arxiv.org/pdf/2510.14252

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: Large Language Models Do NOT Really Know What They Don't Know

🔹 Publication Date: Published on Oct 10

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.09033
• PDF: https://arxiv.org/pdf/2510.09033

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: BitNet Distillation

🔹 Publication Date: Published on Oct 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13998
• PDF: https://arxiv.org/pdf/2510.13998
• Github: https://github.com/microsoft/BitNet

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14972
• PDF: https://arxiv.org/pdf/2510.14972
• Github: https://github.com/uw-swag/tokdrift

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14880
• PDF: https://arxiv.org/pdf/2510.14880

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models

🔹 Publication Date: Published on Oct 12

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.10390
• PDF: https://arxiv.org/pdf/2510.10390

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: Expertise need not monopolize: Action-Specialized Mixture of Experts for Vision-Language-Action Learning

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14300
• PDF: https://arxiv.org/pdf/2510.14300

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14211
• PDF: https://arxiv.org/pdf/2510.14211

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14976
• PDF: https://arxiv.org/pdf/2510.14976
• Project Page: https://stevenlsw.github.io/ponimator/
• Github: https://github.com/stevenlsw/ponimator

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14763
• PDF: https://arxiv.org/pdf/2510.14763
• Project Page: https://huggingface.co/m-a-p
• Github: https://COIG-Writer.github.io/

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14616
• PDF: https://arxiv.org/pdf/2510.14616
• Github: https://WritingPreferenceBench.github.io/

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models

🔹 Publication Date: Published on Oct 15

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13996
• PDF: https://arxiv.org/pdf/2510.13996

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
🔹 Title: RealDPO: Real or Not Real, that is the Preference

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14955
• PDF: https://arxiv.org/pdf/2510.14955
• Project Page: https://vchitect.github.io/RealDPO-Project/
• Github: https://github.com/Vchitect/RealDPO

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
👍1
🔹 Title: DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14949
• PDF: https://arxiv.org/pdf/2510.14949
• Project Page: https://dialectgen.github.io/
• Github: https://github.com/DialectGen/DialectGen

🔹 Datasets citing this paper:
https://huggingface.co/datasets/uclanlp/DialectGen

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
👍1
🔹 Title: VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation

🔹 Publication Date: Published on Oct 16

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14902
• PDF: https://arxiv.org/pdf/2510.14902
• Project Page: https://vla-2.github.io

🔹 Datasets citing this paper:
No datasets found

🔹 Spaces citing this paper:
No spaces found
==================================

For more data science resources:
https://t.me/DataScienceT
👍1