ML Research Hub
32.8K subscribers
4.29K photos
258 videos
23 files
4.63K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

📝 Summary:
OpenDevin is a platform for developing AI agents that interact with the world by writing code, using command lines, and browsing the web, with support for multiple agents and evaluation benchmarks. AI...

🔹 Publication Date: Published on Jul 23, 2024

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2407.16741
• PDF: https://arxiv.org/pdf/2407.16741
• Github: https://github.com/OpenDevin/OpenDevin/?tab=readme-ov-file#-join-our-community

==================================

For more data science resources:
https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams

📝 Summary:
InfiniteVGGT enables continuous 3D visual geometry understanding through a causal transformer with adaptive memory management, outperforming existing streaming methods in long-term stability while int...

🔹 Publication Date: Published on Jan 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02281
• PDF: https://arxiv.org/pdf/2601.02281
• Github: https://github.com/AutoLab-SAI-SJTU/InfiniteVGGT

==================================

For more data science resources:
https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

📝 Summary:
NextFlow is a unified decoder-only autoregressive transformer that processes interleaved text-image tokens, enabling fast multimodal generation through novel next-token and next-scale prediction strat...

🔹 Publication Date: Published on Jan 5

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02204
• PDF: https://arxiv.org/pdf/2601.02204
• Github: https://github.com/ByteVisionLab/NextFlow

Datasets citing this paper:
https://huggingface.co/datasets/madebyollin/megalith-10m

==================================

For more data science resources:
https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Zep: A Temporal Knowledge Graph Architecture for Agent Memory

📝 Summary:
Zep, a memory layer service, outperforms MemGPT in the DMR benchmark and LongMemEval by excelling in dynamic knowledge integration and temporal reasoning, critical for enterprise use cases. AI-generat...

🔹 Publication Date: Published on Jan 20, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2501.13956
• PDF: https://arxiv.org/pdf/2501.13956
• Github: https://github.com/getzep/graphiti

==================================

For more data science resources:
https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

📝 Summary:
DataFlow is an LLM-driven data preparation framework that enhances data quality and reproducibility for various tasks, improving LLM performance with automatically generated pipelines. AI-generated su...

🔹 Publication Date: Published on Dec 18, 2025

🔹 Paper Links:
• arXiv Page: https://arxivlens.com/PaperView/Details/dataflow-an-llm-driven-framework-for-unified-data-preparation-and-workflow-automation-in-the-era-of-data-centric-ai-3906-5f097fd0
• PDF: https://arxiv.org/pdf/2512.16676
• Project Page: https://github.com/OpenDCAI/DataFlow
• Github: https://github.com/OpenDCAI/DataFlow

Datasets citing this paper:
https://huggingface.co/datasets/OpenDCAI/dataflow-demo-Text2SQL
https://huggingface.co/datasets/OpenDCAI/dataflow-instruct-10k
https://huggingface.co/datasets/OpenDCAI/dataflow-demo-Reasoning

==================================

For more data science resources:
https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions

📝 Summary:
Existing feedforward subject-driven video customization methods mainly study single-subject scenarios due to the difficulty of constructing multi-subject training data pairs. Another challenging probl...

🔹 Publication Date: Published on Jun 29, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2506.23361
• PDF: https://arxiv.org/pdf/2506.23361
• Project Page: https://caiyuanhao1998.github.io/project/OmniVCus/
• Github: https://github.com/caiyuanhao1998/Open-OmniVCus

🔹 Models citing this paper:
https://huggingface.co/CaiYuanhao/OmniVCus

Datasets citing this paper:
https://huggingface.co/datasets/CaiYuanhao/OmniVCus
https://huggingface.co/datasets/CaiYuanhao/OmniVCus-Test
https://huggingface.co/datasets/CaiYuanhao/OmniVCus-Train

==================================

For more data science resources:
https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Multi-module GRPO: Composing Policy Gradients and Prompt Optimization for Language Model Programs

📝 Summary:
mmGRPO, a multi-module extension of GRPO, enhances accuracy in modular AI systems by optimizing LM calls and prompts across various tasks. AI-generated summary Group Relative Policy Optimization ( GRP...

🔹 Publication Date: Published on Aug 6, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.04660
• PDF: https://arxiv.org/pdf/2508.04660
• Project Page: https://dspy.ai
• Github: https://github.com/stanfordnlp/dspy

==================================

For more data science resources:
https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

📝 Summary:
InternVL3 is a multimodal pre-trained language model that jointly learns from both multimodal data and text, improving performance and scalability through advanced techniques and setting a new state-o...

🔹 Publication Date: Published on Apr 14, 2025

🔹 Paper Links:
• arXiv Page: https://arxivlens.com/PaperView/Details/internvl3-exploring-advanced-training-and-test-time-recipes-for-open-source-multimodal-models-4439-1c8e76a9
• PDF: https://arxiv.org/pdf/2504.10479
• Project Page: https://internvl.github.io/blog/2025-04-11-InternVL-3.0/

🔹 Models citing this paper:
https://huggingface.co/OpenGVLab/InternVL3-78B
https://huggingface.co/OpenGVLab/InternVL3_5-241B-A28B
https://huggingface.co/OpenGVLab/InternVL3-8B

Datasets citing this paper:
https://huggingface.co/datasets/OpenGVLab/MMPR-v1.2-prompts

Spaces citing this paper:
https://huggingface.co/spaces/AntResearchNLP/ViLaBench
https://huggingface.co/spaces/TIGER-Lab/MEGA-Bench
https://huggingface.co/spaces/developer0hye/InternVL3-8B

==================================

For more data science resources:
https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting

📝 Summary:
Dolphin, a multimodal document image parsing model, uses heterogeneous anchor prompting to achieve state-of-the-art performance on diverse page-level and element-level tasks through an efficient analy...

🔹 Publication Date: Published on May 20, 2025

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2505.14059
• PDF: https://arxiv.org/pdf/2505.14059
• Github: https://github.com/bytedance/dolphin

==================================

For more data science resources:
https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research
LightRAG: Simple and Fast Retrieval-Augmented Generation

📝 Summary:
LightRAG improves Retrieval-Augmented Generation by integrating graph structures for enhanced contextual awareness and efficient information retrieval, achieving better accuracy and response times. AI...

🔹 Publication Date: Published on Oct 8, 2024

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2410.05779
• PDF: https://arxiv.org/pdf/2410.05779
• Github: https://github.com/hkuds/lightrag

Spaces citing this paper:
https://huggingface.co/spaces/rm-lht/lightrag

==================================

For more data science resources:
https://t.me/DataScienceT

#AI #DataScience #MachineLearning #HuggingFace #Research