πΉ Title: VER: Vision Expert Transformer for Robot Learning via Foundation Distillation and Dynamic Routing
πΉ Publication Date: Published on Oct 6
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.05213
β’ PDF: https://arxiv.org/pdf/2510.05213
β’ Project Page: https://yixiaowang7.github.io/ver_page/
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 6
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.05213
β’ PDF: https://arxiv.org/pdf/2510.05213
β’ Project Page: https://yixiaowang7.github.io/ver_page/
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: HUME: Measuring the Human-Model Performance Gap in Text Embedding Task
πΉ Publication Date: Published on Oct 11
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.10062
β’ PDF: https://arxiv.org/pdf/2510.10062
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 11
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.10062
β’ PDF: https://arxiv.org/pdf/2510.10062
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
β€2
πΉ Title: Spotlight on Token Perception for Multimodal Reinforcement Learning
πΉ Publication Date: Published on Oct 10
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.09285
β’ PDF: https://arxiv.org/pdf/2510.09285
β’ Project Page: https://huggingface.co/collections/chamber111/vppo-data-68e7aaafe1bffbab844d341b
β’ Github: https://github.com/huaixuheqing/VPPO-RL
πΉ Datasets citing this paper:
β’ https://huggingface.co/datasets/chamber111/VPPO-Eval
β’ https://huggingface.co/datasets/chamber111/VPPO_ViRL39K_train
β’ https://huggingface.co/datasets/chamber111/VPPO_MMK12_validation
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 10
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.09285
β’ PDF: https://arxiv.org/pdf/2510.09285
β’ Project Page: https://huggingface.co/collections/chamber111/vppo-data-68e7aaafe1bffbab844d341b
β’ Github: https://github.com/huaixuheqing/VPPO-RL
πΉ Datasets citing this paper:
β’ https://huggingface.co/datasets/chamber111/VPPO-Eval
β’ https://huggingface.co/datasets/chamber111/VPPO_ViRL39K_train
β’ https://huggingface.co/datasets/chamber111/VPPO_MMK12_validation
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems
πΉ Publication Date: Published on Oct 13
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.11652
β’ PDF: https://arxiv.org/pdf/2510.11652
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 13
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.11652
β’ PDF: https://arxiv.org/pdf/2510.11652
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
β€1
πΉ Title: InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models
πΉ Publication Date: Published on Oct 13
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.11341
β’ PDF: https://arxiv.org/pdf/2510.11341
β’ Project Page: https://hmwang2002.github.io/release/internsvg/
β’ Github: https://github.com/hmwang2002/InternSVG
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 13
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.11341
β’ PDF: https://arxiv.org/pdf/2510.11341
β’ Project Page: https://hmwang2002.github.io/release/internsvg/
β’ Github: https://github.com/hmwang2002/InternSVG
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: From Data to Rewards: a Bilevel Optimization Perspective on Maximum Likelihood Estimation
πΉ Publication Date: Published on Oct 8
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.07624
β’ PDF: https://arxiv.org/pdf/2510.07624
β’ Github: https://github.com/abenechehab/nll_to_po
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 8
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.07624
β’ PDF: https://arxiv.org/pdf/2510.07624
β’ Github: https://github.com/abenechehab/nll_to_po
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: Latent Refinement Decoding: Enhancing Diffusion-Based Language Models by Refining Belief States
πΉ Publication Date: Published on Oct 13
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.11052
β’ PDF: https://arxiv.org/pdf/2510.11052
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 13
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.11052
β’ PDF: https://arxiv.org/pdf/2510.11052
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: CoBia: Constructed Conversations Can Trigger Otherwise Concealed Societal Biases in LLMs
πΉ Publication Date: Published on Oct 10
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.09871
β’ PDF: https://arxiv.org/pdf/2510.09871
β’ Project Page: https://github.com/nafisenik/CoBia
β’ Github: https://github.com/nafisenik/CoBia
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 10
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.09871
β’ PDF: https://arxiv.org/pdf/2510.09871
β’ Project Page: https://github.com/nafisenik/CoBia
β’ Github: https://github.com/nafisenik/CoBia
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: Through the Perspective of LiDAR: A Feature-Enriched and Uncertainty-Aware Annotation Pipeline for Terrestrial Point Cloud Segmentation
πΉ Publication Date: Published on Oct 8
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.06582
β’ PDF: https://arxiv.org/pdf/2510.06582
β’ Project Page: https://fz-rit.github.io/through-the-lidars-eye/
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 8
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.06582
β’ PDF: https://arxiv.org/pdf/2510.06582
β’ Project Page: https://fz-rit.github.io/through-the-lidars-eye/
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: The Curious Case of Factual (Mis)Alignment between LLMs' Short- and Long-Form Answers
πΉ Publication Date: Published on Oct 13
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.11218
β’ PDF: https://arxiv.org/pdf/2510.11218
β’ Github: https://github.com/WorldHellow/SLAQ
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 13
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.11218
β’ PDF: https://arxiv.org/pdf/2510.11218
β’ Github: https://github.com/WorldHellow/SLAQ
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for Large Vision-and-Language Models
πΉ Publication Date: Published on Oct 12
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/pdf/2510.10606
β’ PDF: https://arxiv.org/pdf/2510.10606
β’ Github: https://github.com/dvlab-research/ViSurf?tab=readme-ov-file
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 12
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/pdf/2510.10606
β’ PDF: https://arxiv.org/pdf/2510.10606
β’ Github: https://github.com/dvlab-research/ViSurf?tab=readme-ov-file
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
β€1
πΉ Title: SwarmSys: Decentralized Swarm-Inspired Agents for Scalable and Adaptive Reasoning
πΉ Publication Date: Published on Oct 11
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.10047
β’ PDF: https://arxiv.org/pdf/2510.10047
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 11
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.10047
β’ PDF: https://arxiv.org/pdf/2510.10047
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: World-To-Image: Grounding Text-to-Image Generation with Agent-Driven World Knowledge
πΉ Publication Date: Published on Oct 5
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.04201
β’ PDF: https://arxiv.org/pdf/2510.04201
β’ Github: https://github.com/mhson-kyle/World-To-Image
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 5
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.04201
β’ PDF: https://arxiv.org/pdf/2510.04201
β’ Github: https://github.com/mhson-kyle/World-To-Image
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: AndesVL Technical Report: An Efficient Mobile-side Multimodal Large Language Model
πΉ Publication Date: Published on Oct 13
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.11496
β’ PDF: https://arxiv.org/pdf/2510.11496
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 13
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.11496
β’ PDF: https://arxiv.org/pdf/2510.11496
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: A Tale of LLMs and Induced Small Proxies: Scalable Agents for Knowledge Mining
πΉ Publication Date: Published on Oct 1
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.01427
β’ PDF: https://arxiv.org/pdf/2510.01427
β’ Github: https://github.com/LongfeiYun17/falconer
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 1
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.01427
β’ PDF: https://arxiv.org/pdf/2510.01427
β’ Github: https://github.com/LongfeiYun17/falconer
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: Multimodal Policy Internalization for Conversational Agents
πΉ Publication Date: Published on Oct 10
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.09474
β’ PDF: https://arxiv.org/pdf/2510.09474
β’ Project Page: https://mikewangwzhl.github.io/TriMPI/
β’ Github: https://mikewangwzhl.github.io/TriMPI/
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 10
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.09474
β’ PDF: https://arxiv.org/pdf/2510.09474
β’ Project Page: https://mikewangwzhl.github.io/TriMPI/
β’ Github: https://mikewangwzhl.github.io/TriMPI/
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: The Attacker Moves Second: Stronger Adaptive Attacks Bypass Defenses Against Llm Jailbreaks and Prompt Injections
πΉ Publication Date: Published on Oct 10
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.09023
β’ PDF: https://arxiv.org/pdf/2510.09023
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 10
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.09023
β’ PDF: https://arxiv.org/pdf/2510.09023
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: MultiCOIN: Multi-Modal COntrollable Video INbetweening
πΉ Publication Date: Published on Oct 9
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.08561
β’ PDF: https://arxiv.org/pdf/2510.08561
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 9
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.08561
β’ PDF: https://arxiv.org/pdf/2510.08561
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: oMeBench: Towards Robust Benchmarking of LLMs in Organic Mechanism Elucidation and Reasoning
πΉ Publication Date: Published on Oct 9
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.07731
β’ PDF: https://arxiv.org/pdf/2510.07731
β’ Github: https://github.com/skylarkie/oMeBench
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 9
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.07731
β’ PDF: https://arxiv.org/pdf/2510.07731
β’ Github: https://github.com/skylarkie/oMeBench
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Title: VLM-Guided Adaptive Negative Prompting for Creative Generation
πΉ Publication Date: Published on Oct 12
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.10715
β’ PDF: https://arxiv.org/pdf/2510.10715
β’ Github: https://shelley-golan.github.io/VLM-Guided-Creative-Generation/
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
πΉ Publication Date: Published on Oct 12
πΉ Paper Links:
β’ arXiv Page: https://arxiv.org/abs/2510.10715
β’ PDF: https://arxiv.org/pdf/2510.10715
β’ Github: https://shelley-golan.github.io/VLM-Guided-Creative-Generation/
πΉ Datasets citing this paper:
No datasets found
πΉ Spaces citing this paper:
No spaces found
==================================
For more data science resources:
β https://t.me/DataScienceT
π€π§ Thinking with Camera 2.0: A Powerful Multimodal Model for Camera-Centric Understanding and Generation
ποΈ 14 Oct 2025
π AI News & Trends
In the rapidly evolving field of multimodal AI, bridging gaps between vision, language and geometry is one of the frontier challenges. Traditional vision-language models excel at describing what is in an image βa cat on a sofaβ βa red car on the roadβ but struggle to reason about how the image was captured: the cameraβs ...
#MultimodalAI #CameraCentricUnderstanding #VisionLanguageModels #AIResearch #ComputerVision #GenerativeModels
ποΈ 14 Oct 2025
π AI News & Trends
In the rapidly evolving field of multimodal AI, bridging gaps between vision, language and geometry is one of the frontier challenges. Traditional vision-language models excel at describing what is in an image βa cat on a sofaβ βa red car on the roadβ but struggle to reason about how the image was captured: the cameraβs ...
#MultimodalAI #CameraCentricUnderstanding #VisionLanguageModels #AIResearch #ComputerVision #GenerativeModels