ML Research Hub
32.9K subscribers
4.45K photos
273 videos
23 files
4.81K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
πŸ”Ή Title: VER: Vision Expert Transformer for Robot Learning via Foundation Distillation and Dynamic Routing

πŸ”Ή Publication Date: Published on Oct 6

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.05213
β€’ PDF: https://arxiv.org/pdf/2510.05213
β€’ Project Page: https://yixiaowang7.github.io/ver_page/

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: HUME: Measuring the Human-Model Performance Gap in Text Embedding Task

πŸ”Ή Publication Date: Published on Oct 11

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.10062
β€’ PDF: https://arxiv.org/pdf/2510.10062

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
❀2
πŸ”Ή Title: Spotlight on Token Perception for Multimodal Reinforcement Learning

πŸ”Ή Publication Date: Published on Oct 10

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.09285
β€’ PDF: https://arxiv.org/pdf/2510.09285
β€’ Project Page: https://huggingface.co/collections/chamber111/vppo-data-68e7aaafe1bffbab844d341b
β€’ Github: https://github.com/huaixuheqing/VPPO-RL

πŸ”Ή Datasets citing this paper:
β€’ https://huggingface.co/datasets/chamber111/VPPO-Eval
β€’ https://huggingface.co/datasets/chamber111/VPPO_ViRL39K_train
β€’ https://huggingface.co/datasets/chamber111/VPPO_MMK12_validation

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems

πŸ”Ή Publication Date: Published on Oct 13

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.11652
β€’ PDF: https://arxiv.org/pdf/2510.11652

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
❀1
πŸ”Ή Title: InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models

πŸ”Ή Publication Date: Published on Oct 13

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.11341
β€’ PDF: https://arxiv.org/pdf/2510.11341
β€’ Project Page: https://hmwang2002.github.io/release/internsvg/
β€’ Github: https://github.com/hmwang2002/InternSVG

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: From Data to Rewards: a Bilevel Optimization Perspective on Maximum Likelihood Estimation

πŸ”Ή Publication Date: Published on Oct 8

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.07624
β€’ PDF: https://arxiv.org/pdf/2510.07624
β€’ Github: https://github.com/abenechehab/nll_to_po

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Latent Refinement Decoding: Enhancing Diffusion-Based Language Models by Refining Belief States

πŸ”Ή Publication Date: Published on Oct 13

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.11052
β€’ PDF: https://arxiv.org/pdf/2510.11052

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: CoBia: Constructed Conversations Can Trigger Otherwise Concealed Societal Biases in LLMs

πŸ”Ή Publication Date: Published on Oct 10

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.09871
β€’ PDF: https://arxiv.org/pdf/2510.09871
β€’ Project Page: https://github.com/nafisenik/CoBia
β€’ Github: https://github.com/nafisenik/CoBia

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Through the Perspective of LiDAR: A Feature-Enriched and Uncertainty-Aware Annotation Pipeline for Terrestrial Point Cloud Segmentation

πŸ”Ή Publication Date: Published on Oct 8

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.06582
β€’ PDF: https://arxiv.org/pdf/2510.06582
β€’ Project Page: https://fz-rit.github.io/through-the-lidars-eye/

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: The Curious Case of Factual (Mis)Alignment between LLMs' Short- and Long-Form Answers

πŸ”Ή Publication Date: Published on Oct 13

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.11218
β€’ PDF: https://arxiv.org/pdf/2510.11218
β€’ Github: https://github.com/WorldHellow/SLAQ

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for Large Vision-and-Language Models

πŸ”Ή Publication Date: Published on Oct 12

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/pdf/2510.10606
β€’ PDF: https://arxiv.org/pdf/2510.10606
β€’ Github: https://github.com/dvlab-research/ViSurf?tab=readme-ov-file

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
❀1
πŸ”Ή Title: SwarmSys: Decentralized Swarm-Inspired Agents for Scalable and Adaptive Reasoning

πŸ”Ή Publication Date: Published on Oct 11

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.10047
β€’ PDF: https://arxiv.org/pdf/2510.10047

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: World-To-Image: Grounding Text-to-Image Generation with Agent-Driven World Knowledge

πŸ”Ή Publication Date: Published on Oct 5

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.04201
β€’ PDF: https://arxiv.org/pdf/2510.04201
β€’ Github: https://github.com/mhson-kyle/World-To-Image

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: AndesVL Technical Report: An Efficient Mobile-side Multimodal Large Language Model

πŸ”Ή Publication Date: Published on Oct 13

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.11496
β€’ PDF: https://arxiv.org/pdf/2510.11496

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: A Tale of LLMs and Induced Small Proxies: Scalable Agents for Knowledge Mining

πŸ”Ή Publication Date: Published on Oct 1

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.01427
β€’ PDF: https://arxiv.org/pdf/2510.01427
β€’ Github: https://github.com/LongfeiYun17/falconer

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: Multimodal Policy Internalization for Conversational Agents

πŸ”Ή Publication Date: Published on Oct 10

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.09474
β€’ PDF: https://arxiv.org/pdf/2510.09474
β€’ Project Page: https://mikewangwzhl.github.io/TriMPI/
β€’ Github: https://mikewangwzhl.github.io/TriMPI/

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: The Attacker Moves Second: Stronger Adaptive Attacks Bypass Defenses Against Llm Jailbreaks and Prompt Injections

πŸ”Ή Publication Date: Published on Oct 10

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.09023
β€’ PDF: https://arxiv.org/pdf/2510.09023

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: MultiCOIN: Multi-Modal COntrollable Video INbetweening

πŸ”Ή Publication Date: Published on Oct 9

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.08561
β€’ PDF: https://arxiv.org/pdf/2510.08561

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: oMeBench: Towards Robust Benchmarking of LLMs in Organic Mechanism Elucidation and Reasoning

πŸ”Ή Publication Date: Published on Oct 9

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.07731
β€’ PDF: https://arxiv.org/pdf/2510.07731
β€’ Github: https://github.com/skylarkie/oMeBench

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ”Ή Title: VLM-Guided Adaptive Negative Prompting for Creative Generation

πŸ”Ή Publication Date: Published on Oct 12

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2510.10715
β€’ PDF: https://arxiv.org/pdf/2510.10715
β€’ Github: https://shelley-golan.github.io/VLM-Guided-Creative-Generation/

πŸ”Ή Datasets citing this paper:
No datasets found

πŸ”Ή Spaces citing this paper:
No spaces found
==================================

For more data science resources:
βœ“ https://t.me/DataScienceT
πŸ€–πŸ§  Thinking with Camera 2.0: A Powerful Multimodal Model for Camera-Centric Understanding and Generation

πŸ—“οΈ 14 Oct 2025
πŸ“š AI News & Trends

In the rapidly evolving field of multimodal AI, bridging gaps between vision, language and geometry is one of the frontier challenges. Traditional vision-language models excel at describing what is in an image β€œa cat on a sofa” β€œa red car on the road” but struggle to reason about how the image was captured: the camera’s ...

#MultimodalAI #CameraCentricUnderstanding #VisionLanguageModels #AIResearch #ComputerVision #GenerativeModels