ML Research Hub
32.7K subscribers
4.09K photos
237 videos
23 files
4.41K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
ML Engineers: NVIDIA has released a guide for beginners on fine-tuning LLMs using Unsloth.

The guide covers:

- training methods: LoRA, FFT, RL
- when and why to do fine-tuning, real use cases
- how much data and VRAM are required
- how to train locally on DGX Spark, RTX graphics cards, and more

Guide: https://blogs.nvidia.com/blog/rtx-ai-garage-fine-tuning-unsloth-dgx-spark/

πŸ‘‰ https://t.me/DataScienceT
Please open Telegram to view this post
VIEW IN TELEGRAM
❀4
✨Latent Implicit Visual Reasoning

πŸ“ Summary:
Large Multimodal Models struggle with visual reasoning due to their text-centric nature and limitations of prior methods. This paper introduces a task-agnostic mechanism for LMMs to discover and use visual reasoning tokens without explicit supervision. The approach achieves state-of-the-art resul...

πŸ”Ή Publication Date: Published on Dec 24

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2512.21218
β€’ PDF: https://arxiv.org/pdf/2512.21218

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#LMMs #VisualReasoning #AI #ComputerVision #DeepLearning
❀1
✨Schoenfeld's Anatomy of Mathematical Reasoning by Language Models

πŸ“ Summary:
This paper introduces ThinkARM, a framework based on Schoenfelds Episode Theory, to abstract LLM reasoning traces into functional steps. It reveals distinct thinking dynamics and structural differences in models solving math problems, with exploration being key for correctness. This makes LLM rea...

πŸ”Ή Publication Date: Published on Dec 23

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2512.19995
β€’ PDF: https://arxiv.org/pdf/2512.19995
β€’ Github: https://github.com/MingLiiii/ThinkARM

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#LLM #AIReasoning #MathematicalReasoning #AI #MachineLearning
❀1
Media is too big
VIEW IN TELEGRAM
✨Spatia: Video Generation with Updatable Spatial Memory

πŸ“ Summary:
Spatia is a video generation framework that improves long-term consistency by using an updatable 3D scene point cloud as persistent spatial memory. It iteratively generates video clips and updates this memory via visual SLAM, enabling realistic videos and 3D-aware interactive editing.

πŸ”Ή Publication Date: Published on Dec 17

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2512.15716
β€’ PDF: https://arxiv.org/pdf/2512.15716
β€’ Project Page: https://zhaojingjing713.github.io/Spatia/
β€’ Github: https://github.com/ZhaoJingjing713/Spatia

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#VideoGeneration #GenerativeAI #ComputerVision #3DReconstruction #SLAM
❀1
✨VA-Ο€: Variational Policy Alignment for Pixel-Aware Autoregressive Generation

πŸ“ Summary:
VA-$\pi$ optimizes autoregressive visual generators using a pixel-space objective to improve image quality and performance without retraining tokenizers or using external rewards. AI-generated summary...

πŸ”Ή Publication Date: Published on Dec 22

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2512.19680
β€’ PDF: https://arxiv.org/pdf/2512.19680
β€’ Project Page: https://lil-shake.github.io/va-pi.github.io/
β€’ Github: https://github.com/Lil-Shake/VA-Pi

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
❀1
✨GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training

πŸ“ Summary:
Multi-turn reinforcement learning (RL) for multi-modal agents built upon vision-language models (VLMs) is hampered by sparse rewards and long-horizon credit assignment. Recent methods densify the rewa...

πŸ”Ή Publication Date: Published on Dec 15

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2512.13043
β€’ PDF: https://arxiv.org/pdf/2512.13043

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
❀1
✨How Much 3D Do Video Foundation Models Encode?

πŸ“ Summary:
A new framework quantifies 3D understanding in Video Foundation Models VidFMs. VidFMs, trained only on video, show strong 3D awareness, often surpassing expert 3D models, providing insights for 3D AI.

πŸ”Ή Publication Date: Published on Dec 23

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2512.19949
β€’ PDF: https://arxiv.org/pdf/2512.19949
β€’ Project Page: https://vidfm-3d-probe.github.io/
β€’ Github: https://vidfm-3d-probe.github.io

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#VideoFoundationModels #3DUnderstanding #ComputerVision #AIResearch #DeepLearning
❀2
✨Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

πŸ“ Summary:
AR models face inefficient exploration and sparse rewards in RL. Internal RL uses a higher-order model to learn temporal abstraction controllers. This enables efficient learning from sparse rewards where standard RL fails.

πŸ”Ή Publication Date: Published on Dec 23

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2512.20605
β€’ PDF: https://arxiv.org/pdf/2512.20605

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#ReinforcementLearning #HierarchicalRL #AutoregressiveModels #MachineLearning #ArtificialIntelligence
❀2
πŸš€ Master Data Science & Programming!

Unlock your potential with this curated list of Telegram channels. Whether you need books, datasets, interview prep, or project ideas, we have the perfect resource for you. Join the community today!


πŸ”° Machine Learning with Python
Learn Machine Learning with hands-on Python tutorials, real-world code examples, and clear explanations for researchers and developers.
https://t.me/CodeProgrammer

πŸ”– Machine Learning
Machine learning insights, practical tutorials, and clear explanations for beginners and aspiring data scientists. Follow the channel for models, algorithms, coding guides, and real-world ML applications.
https://t.me/DataScienceM

🧠 Code With Python
This channel delivers clear, practical content for developers, covering Python, Django, Data Structures, Algorithms, and DSA – perfect for learning, coding, and mastering key programming skills.
https://t.me/DataScience4

🎯 PyData Careers | Quiz
Python Data Science jobs, interview tips, and career insights for aspiring professionals.
https://t.me/DataScienceQ

πŸ’Ύ Kaggle Data Hub
Your go-to hub for Kaggle datasets – explore, analyze, and leverage data for Machine Learning and Data Science projects.
https://t.me/datasets1

πŸ§‘β€πŸŽ“ Udemy Coupons | Courses
The first channel in Telegram that offers free Udemy coupons
https://t.me/DataScienceC

πŸ˜€ ML Research Hub
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.
https://t.me/DataScienceT

πŸ’¬ Data Science Chat
An active community group for discussing data challenges and networking with peers.
https://t.me/DataScience9

🐍 Python Arab| Ψ¨Ψ§ΩŠΨ«ΩˆΩ† عربي
The largest Arabic-speaking group for Python developers to share knowledge and help.
https://t.me/PythonArab

πŸ–Š Data Science Jupyter Notebooks
Explore the world of Data Science through Jupyter Notebooksβ€”insights, tutorials, and tools to boost your data journey. Code, analyze, and visualize smarter with every post.
https://t.me/DataScienceN

πŸ“Ί Free Online Courses | Videos
Free online courses covering data science, machine learning, analytics, programming, and essential skills for learners.
https://t.me/DataScienceV

πŸ“ˆ Data Analytics
Dive into the world of Data Analytics – uncover insights, explore trends, and master data-driven decision making.
https://t.me/DataAnalyticsX

🎧 Learn Python Hub
Master Python with step-by-step courses – from basics to advanced projects and practical applications.
https://t.me/Python53

⭐️ Research Papers
Professional Academic Writing & Simulation Services
https://t.me/DataScienceY

━━━━━━━━━━━━━━━━━━
Admin: @HusseinSheikho
Please open Telegram to view this post
VIEW IN TELEGRAM
❀1
✨Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

πŸ“ Summary:
Fast3R is a Transformer-based method for efficient and scalable multi-view 3D reconstruction. It processes many images in parallel in a single forward pass, improving speed and accuracy over pairwise approaches like DUSt3R.

πŸ”Ή Publication Date: Published on Jan 23

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2501.13928
β€’ PDF: https://arxiv.org/pdf/2501.13928
β€’ Github: https://github.com/naver/dust3r/pull/16

πŸ”Ή Models citing this paper:
β€’ https://huggingface.co/jedyang97/Fast3R_ViT_Large_512

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#3DReconstruction #ComputerVision #Transformers #Fast3R #DeepLearning
✨SkyReels-V2: Infinite-length Film Generative Model

πŸ“ Summary:
SkyReels-V2 is an infinite-length film generative model that addresses video generation challenges by synergizing MLLMs, reinforcement learning, and a diffusion forcing framework. It enables high-quality, long-form video synthesis with realistic motion and cinematic grammar awareness through mult...

πŸ”Ή Publication Date: Published on Apr 17

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2504.13074
β€’ PDF: https://arxiv.org/pdf/2504.13074
β€’ Github: https://github.com/skyworkai/skyreels-v2

πŸ”Ή Models citing this paper:
β€’ https://huggingface.co/Skywork/SkyReels-V2-I2V-14B-540P
β€’ https://huggingface.co/Skywork/SkyCaptioner-V1
β€’ https://huggingface.co/Skywork/SkyReels-V2-I2V-1.3B-540P

✨ Spaces citing this paper:
β€’ https://huggingface.co/spaces/fffiloni/SkyReels-V2
β€’ https://huggingface.co/spaces/Dudu0043/SkyReels-V2
β€’ https://huggingface.co/spaces/14eee109giet/SkyReels-V2

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#VideoGeneration #GenerativeAI #MLLM #DiffusionModels #AIResearch
❀2
✨VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos

πŸ“ Summary:
VideoRAG introduces the first RAG framework for long videos, using a dual-channel architecture to integrate textual knowledge grounding and multi-modal context encoding. This enables unlimited-length video processing and significantly outperforms existing methods.

πŸ”Ή Publication Date: Published on Feb 3

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2502.01549
β€’ PDF: https://arxiv.org/pdf/2502.01549
β€’ Github: https://github.com/hkuds/videorag

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#VideoRAG #RAG #LongVideo #AI #MultimodalAI
❀2
πŸš€ Master Data Science & Programming!

Unlock your potential with this curated list of Telegram channels. Whether you need books, datasets, interview prep, or project ideas, we have the perfect resource for you. Join the community today!


πŸ”° Machine Learning with Python
Learn Machine Learning with hands-on Python tutorials, real-world code examples, and clear explanations for researchers and developers.
https://t.me/CodeProgrammer

πŸ”– Machine Learning
Machine learning insights, practical tutorials, and clear explanations for beginners and aspiring data scientists. Follow the channel for models, algorithms, coding guides, and real-world ML applications.
https://t.me/DataScienceM

🧠 Code With Python
This channel delivers clear, practical content for developers, covering Python, Django, Data Structures, Algorithms, and DSA – perfect for learning, coding, and mastering key programming skills.
https://t.me/DataScience4

🎯 PyData Careers | Quiz
Python Data Science jobs, interview tips, and career insights for aspiring professionals.
https://t.me/DataScienceQ

πŸ’Ύ Kaggle Data Hub
Your go-to hub for Kaggle datasets – explore, analyze, and leverage data for Machine Learning and Data Science projects.
https://t.me/datasets1

πŸ§‘β€πŸŽ“ Udemy Coupons | Courses
The first channel in Telegram that offers free Udemy coupons
https://t.me/DataScienceC

πŸ˜€ ML Research Hub
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.
https://t.me/DataScienceT

πŸ’¬ Data Science Chat
An active community group for discussing data challenges and networking with peers.
https://t.me/DataScience9

🐍 Python Arab| Ψ¨Ψ§ΩŠΨ«ΩˆΩ† عربي
The largest Arabic-speaking group for Python developers to share knowledge and help.
https://t.me/PythonArab

πŸ–Š Data Science Jupyter Notebooks
Explore the world of Data Science through Jupyter Notebooksβ€”insights, tutorials, and tools to boost your data journey. Code, analyze, and visualize smarter with every post.
https://t.me/DataScienceN

πŸ“Ί Free Online Courses | Videos
Free online courses covering data science, machine learning, analytics, programming, and essential skills for learners.
https://t.me/DataScienceV

πŸ“ˆ Data Analytics
Dive into the world of Data Analytics – uncover insights, explore trends, and master data-driven decision making.
https://t.me/DataAnalyticsX

🎧 Learn Python Hub
Master Python with step-by-step courses – from basics to advanced projects and practical applications.
https://t.me/Python53

⭐️ Research Papers
Professional Academic Writing & Simulation Services
https://t.me/DataScienceY

━━━━━━━━━━━━━━━━━━
Admin: @HusseinSheikho
Please open Telegram to view this post
VIEW IN TELEGRAM
❀1
✨A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

πŸ“ Summary:
This survey reviews self-evolving AI agents that adapt to dynamic environments via automatic enhancement from interaction data. It proposes a unified framework and systematically reviews current techniques, addressing evaluation, safety, and ethics.

πŸ”Ή Publication Date: Published on Aug 10

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2508.07407
β€’ PDF: https://arxiv.org/pdf/2508.07407
β€’ Project Page: https://huggingface.co/spaces/X-iZhang/Awesome-Self-Evolving-Agents
β€’ Github: https://github.com/EvoAgentX/Awesome-Self-Evolving-Agents

✨ Spaces citing this paper:
β€’ https://huggingface.co/spaces/X-iZhang/Awesome-Self-Evolving-Agents

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#SelfEvolvingAI #AIAgents #FoundationModels #LifelongLearning #ArtificialIntelligence
❀1
πŸš€ Master Data Science & Programming!

Unlock your potential with this curated list of Telegram channels. Whether you need books, datasets, interview prep, or project ideas, we have the perfect resource for you. Join the community today!


πŸ”° Machine Learning with Python
Learn Machine Learning with hands-on Python tutorials, real-world code examples, and clear explanations for researchers and developers.
https://t.me/CodeProgrammer

πŸ”– Machine Learning
Machine learning insights, practical tutorials, and clear explanations for beginners and aspiring data scientists. Follow the channel for models, algorithms, coding guides, and real-world ML applications.
https://t.me/DataScienceM

🧠 Code With Python
This channel delivers clear, practical content for developers, covering Python, Django, Data Structures, Algorithms, and DSA – perfect for learning, coding, and mastering key programming skills.
https://t.me/DataScience4

🎯 PyData Careers | Quiz
Python Data Science jobs, interview tips, and career insights for aspiring professionals.
https://t.me/DataScienceQ

πŸ’Ύ Kaggle Data Hub
Your go-to hub for Kaggle datasets – explore, analyze, and leverage data for Machine Learning and Data Science projects.
https://t.me/datasets1

πŸ§‘β€πŸŽ“ Udemy Coupons | Courses
The first channel in Telegram that offers free Udemy coupons
https://t.me/DataScienceC

πŸ˜€ ML Research Hub
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.
https://t.me/DataScienceT

πŸ’¬ Data Science Chat
An active community group for discussing data challenges and networking with peers.
https://t.me/DataScience9

🐍 Python Arab| Ψ¨Ψ§ΩŠΨ«ΩˆΩ† عربي
The largest Arabic-speaking group for Python developers to share knowledge and help.
https://t.me/PythonArab

πŸ–Š Data Science Jupyter Notebooks
Explore the world of Data Science through Jupyter Notebooksβ€”insights, tutorials, and tools to boost your data journey. Code, analyze, and visualize smarter with every post.
https://t.me/DataScienceN

πŸ“Ί Free Online Courses | Videos
Free online courses covering data science, machine learning, analytics, programming, and essential skills for learners.
https://t.me/DataScienceV

πŸ“ˆ Data Analytics
Dive into the world of Data Analytics – uncover insights, explore trends, and master data-driven decision making.
https://t.me/DataAnalyticsX

🎧 Learn Python Hub
Master Python with step-by-step courses – from basics to advanced projects and practical applications.
https://t.me/Python53

⭐️ Research Papers
Professional Academic Writing & Simulation Services
https://t.me/DataScienceY

━━━━━━━━━━━━━━━━━━
Admin: @HusseinSheikho
Please open Telegram to view this post
VIEW IN TELEGRAM
❀1
This media is not supported in your browser
VIEW IN TELEGRAM
✨InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion

πŸ“ Summary:
InsertAnywhere is a framework for realistic video object insertion. It uses 4D aware mask generation for geometric consistency and an extended diffusion model for appearance-faithful synthesis, outperforming existing methods.

πŸ”Ή Publication Date: Published on Dec 19

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2512.17504
β€’ PDF: https://arxiv.org/pdf/2512.17504
β€’ Project Page: https://myyzzzoooo.github.io/InsertAnywhere/
β€’ Github: https://github.com/myyzzzoooo/InsertAnywhere

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#VideoEditing #DiffusionModels #ComputerVision #DeepLearning #GenerativeAI
❀1
✨MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

πŸ“ Summary:
MAI-UI introduces a family of foundation GUI agents tackling real-world deployment challenges. It uses a self-evolving data pipeline, device-cloud collaboration, and online RL to set new state-of-the-art in GUI grounding and mobile navigation, significantly boosting performance and privacy.

πŸ”Ή Publication Date: Published on Dec 26

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2512.22047
β€’ PDF: https://arxiv.org/pdf/2512.22047

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#GUIAgents #AI #ReinforcementLearning #MobileTech #HCI
❀2
✨See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning

πŸ“ Summary:
Bi-directional Perceptual Shaping BiPS improves vision-language models by using question-conditioned masked views to shape perception during training. It employs two constraints to ensure complete coverage of relevant pixels and enforce fine-grained visual reliance, preventing text-only shortcuts...

πŸ”Ή Publication Date: Published on Dec 26

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2512.22120
β€’ PDF: https://arxiv.org/pdf/2512.22120
β€’ Github: https://github.com/zss02/BiPS

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#MultimodalAI #VisionLanguageModels #MachineLearning #AIResearch #DeepLearning
❀1
✨ProEdit: Inversion-based Editing From Prompts Done Right

πŸ“ Summary:
Inversion-based visual editing provides an effective and training-free way to edit an image or a video based on user instructions. Existing methods typically inject source image information during the...

πŸ”Ή Publication Date: Published on Dec 26

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2512.22118
β€’ PDF: https://arxiv.org/pdf/2512.22118
β€’ Project Page: https://isee-laboratory.github.io/ProEdit/
β€’ Github: https://isee-laboratory.github.io/ProEdit

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
❀1
✨SVBench: Evaluation of Video Generation Models on Social Reasoning

πŸ“ Summary:
Recent text-to-video generation models exhibit remarkable progress in visual realism, motion fidelity, and text-video alignment, yet they remain fundamentally limited in their ability to generate soci...

πŸ”Ή Publication Date: Published on Dec 25

πŸ”Ή Paper Links:
β€’ arXiv Page: https://arxiv.org/abs/2512.21507
β€’ PDF: https://arxiv.org/pdf/2512.21507
β€’ Github: https://github.com/Gloria2tt/SVBench-Evaluation

==================================

For more data science resources:
βœ“ https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
❀1