✨OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
📝 Summary:
OpenDevin is a platform for developing AI agents that interact with the world by writing code, using command lines, and browsing the web, with support for multiple agents and evaluation benchmarks. AI...
🔹 Publication Date: Published on Jul 23, 2024
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2407.16741
• PDF: https://arxiv.org/pdf/2407.16741
• Github: https://github.com/OpenDevin/OpenDevin/?tab=readme-ov-file#-join-our-community
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
OpenDevin is a platform for developing AI agents that interact with the world by writing code, using command lines, and browsing the web, with support for multiple agents and evaluation benchmarks. AI...
🔹 Publication Date: Published on Jul 23, 2024
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2407.16741
• PDF: https://arxiv.org/pdf/2407.16741
• Github: https://github.com/OpenDevin/OpenDevin/?tab=readme-ov-file#-join-our-community
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams
📝 Summary:
InfiniteVGGT enables continuous 3D visual geometry understanding through a causal transformer with adaptive memory management, outperforming existing streaming methods in long-term stability while int...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02281
• PDF: https://arxiv.org/pdf/2601.02281
• Github: https://github.com/AutoLab-SAI-SJTU/InfiniteVGGT
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
InfiniteVGGT enables continuous 3D visual geometry understanding through a causal transformer with adaptive memory management, outperforming existing streaming methods in long-term stability while int...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02281
• PDF: https://arxiv.org/pdf/2601.02281
• Github: https://github.com/AutoLab-SAI-SJTU/InfiniteVGGT
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation
📝 Summary:
NextFlow is a unified decoder-only autoregressive transformer that processes interleaved text-image tokens, enabling fast multimodal generation through novel next-token and next-scale prediction strat...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02204
• PDF: https://arxiv.org/pdf/2601.02204
• Github: https://github.com/ByteVisionLab/NextFlow
✨ Datasets citing this paper:
• https://huggingface.co/datasets/madebyollin/megalith-10m
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
NextFlow is a unified decoder-only autoregressive transformer that processes interleaved text-image tokens, enabling fast multimodal generation through novel next-token and next-scale prediction strat...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02204
• PDF: https://arxiv.org/pdf/2601.02204
• Github: https://github.com/ByteVisionLab/NextFlow
✨ Datasets citing this paper:
• https://huggingface.co/datasets/madebyollin/megalith-10m
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Zep: A Temporal Knowledge Graph Architecture for Agent Memory
📝 Summary:
Zep, a memory layer service, outperforms MemGPT in the DMR benchmark and LongMemEval by excelling in dynamic knowledge integration and temporal reasoning, critical for enterprise use cases. AI-generat...
🔹 Publication Date: Published on Jan 20, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2501.13956
• PDF: https://arxiv.org/pdf/2501.13956
• Github: https://github.com/getzep/graphiti
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Zep, a memory layer service, outperforms MemGPT in the DMR benchmark and LongMemEval by excelling in dynamic knowledge integration and temporal reasoning, critical for enterprise use cases. AI-generat...
🔹 Publication Date: Published on Jan 20, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2501.13956
• PDF: https://arxiv.org/pdf/2501.13956
• Github: https://github.com/getzep/graphiti
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
📝 Summary:
DataFlow is an LLM-driven data preparation framework that enhances data quality and reproducibility for various tasks, improving LLM performance with automatically generated pipelines. AI-generated su...
🔹 Publication Date: Published on Dec 18, 2025
🔹 Paper Links:
• arXiv Page: https://arxivlens.com/PaperView/Details/dataflow-an-llm-driven-framework-for-unified-data-preparation-and-workflow-automation-in-the-era-of-data-centric-ai-3906-5f097fd0
• PDF: https://arxiv.org/pdf/2512.16676
• Project Page: https://github.com/OpenDCAI/DataFlow
• Github: https://github.com/OpenDCAI/DataFlow
✨ Datasets citing this paper:
• https://huggingface.co/datasets/OpenDCAI/dataflow-demo-Text2SQL
• https://huggingface.co/datasets/OpenDCAI/dataflow-instruct-10k
• https://huggingface.co/datasets/OpenDCAI/dataflow-demo-Reasoning
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
DataFlow is an LLM-driven data preparation framework that enhances data quality and reproducibility for various tasks, improving LLM performance with automatically generated pipelines. AI-generated su...
🔹 Publication Date: Published on Dec 18, 2025
🔹 Paper Links:
• arXiv Page: https://arxivlens.com/PaperView/Details/dataflow-an-llm-driven-framework-for-unified-data-preparation-and-workflow-automation-in-the-era-of-data-centric-ai-3906-5f097fd0
• PDF: https://arxiv.org/pdf/2512.16676
• Project Page: https://github.com/OpenDCAI/DataFlow
• Github: https://github.com/OpenDCAI/DataFlow
✨ Datasets citing this paper:
• https://huggingface.co/datasets/OpenDCAI/dataflow-demo-Text2SQL
• https://huggingface.co/datasets/OpenDCAI/dataflow-instruct-10k
• https://huggingface.co/datasets/OpenDCAI/dataflow-demo-Reasoning
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
Arxivlens
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI - AI Research…
AI-powered analysis of 'DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI'. The rapidly growing demand for high-quality data in Large Language Models (LLMs) has intensified the need for scalable…
✨OmniVCus: Feedforward Subject-driven Video Customization with Multimodal Control Conditions
📝 Summary:
Existing feedforward subject-driven video customization methods mainly study single-subject scenarios due to the difficulty of constructing multi-subject training data pairs. Another challenging probl...
🔹 Publication Date: Published on Jun 29, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2506.23361
• PDF: https://arxiv.org/pdf/2506.23361
• Project Page: https://caiyuanhao1998.github.io/project/OmniVCus/
• Github: https://github.com/caiyuanhao1998/Open-OmniVCus
🔹 Models citing this paper:
• https://huggingface.co/CaiYuanhao/OmniVCus
✨ Datasets citing this paper:
• https://huggingface.co/datasets/CaiYuanhao/OmniVCus
• https://huggingface.co/datasets/CaiYuanhao/OmniVCus-Test
• https://huggingface.co/datasets/CaiYuanhao/OmniVCus-Train
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Existing feedforward subject-driven video customization methods mainly study single-subject scenarios due to the difficulty of constructing multi-subject training data pairs. Another challenging probl...
🔹 Publication Date: Published on Jun 29, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2506.23361
• PDF: https://arxiv.org/pdf/2506.23361
• Project Page: https://caiyuanhao1998.github.io/project/OmniVCus/
• Github: https://github.com/caiyuanhao1998/Open-OmniVCus
🔹 Models citing this paper:
• https://huggingface.co/CaiYuanhao/OmniVCus
✨ Datasets citing this paper:
• https://huggingface.co/datasets/CaiYuanhao/OmniVCus
• https://huggingface.co/datasets/CaiYuanhao/OmniVCus-Test
• https://huggingface.co/datasets/CaiYuanhao/OmniVCus-Train
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
arXiv.org
OmniVCus: Feedforward Subject-driven Video Customization with...
Existing feedforward subject-driven video customization methods mainly study single-subject scenarios due to the difficulty of constructing multi-subject training data pairs. Another challenging...
✨Multi-module GRPO: Composing Policy Gradients and Prompt Optimization for Language Model Programs
📝 Summary:
mmGRPO, a multi-module extension of GRPO, enhances accuracy in modular AI systems by optimizing LM calls and prompts across various tasks. AI-generated summary Group Relative Policy Optimization ( GRP...
🔹 Publication Date: Published on Aug 6, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.04660
• PDF: https://arxiv.org/pdf/2508.04660
• Project Page: https://dspy.ai
• Github: https://github.com/stanfordnlp/dspy
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
mmGRPO, a multi-module extension of GRPO, enhances accuracy in modular AI systems by optimizing LM calls and prompts across various tasks. AI-generated summary Group Relative Policy Optimization ( GRP...
🔹 Publication Date: Published on Aug 6, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.04660
• PDF: https://arxiv.org/pdf/2508.04660
• Project Page: https://dspy.ai
• Github: https://github.com/stanfordnlp/dspy
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
📝 Summary:
InternVL3 is a multimodal pre-trained language model that jointly learns from both multimodal data and text, improving performance and scalability through advanced techniques and setting a new state-o...
🔹 Publication Date: Published on Apr 14, 2025
🔹 Paper Links:
• arXiv Page: https://arxivlens.com/PaperView/Details/internvl3-exploring-advanced-training-and-test-time-recipes-for-open-source-multimodal-models-4439-1c8e76a9
• PDF: https://arxiv.org/pdf/2504.10479
• Project Page: https://internvl.github.io/blog/2025-04-11-InternVL-3.0/
🔹 Models citing this paper:
• https://huggingface.co/OpenGVLab/InternVL3-78B
• https://huggingface.co/OpenGVLab/InternVL3_5-241B-A28B
• https://huggingface.co/OpenGVLab/InternVL3-8B
✨ Datasets citing this paper:
• https://huggingface.co/datasets/OpenGVLab/MMPR-v1.2-prompts
✨ Spaces citing this paper:
• https://huggingface.co/spaces/AntResearchNLP/ViLaBench
• https://huggingface.co/spaces/TIGER-Lab/MEGA-Bench
• https://huggingface.co/spaces/developer0hye/InternVL3-8B
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
InternVL3 is a multimodal pre-trained language model that jointly learns from both multimodal data and text, improving performance and scalability through advanced techniques and setting a new state-o...
🔹 Publication Date: Published on Apr 14, 2025
🔹 Paper Links:
• arXiv Page: https://arxivlens.com/PaperView/Details/internvl3-exploring-advanced-training-and-test-time-recipes-for-open-source-multimodal-models-4439-1c8e76a9
• PDF: https://arxiv.org/pdf/2504.10479
• Project Page: https://internvl.github.io/blog/2025-04-11-InternVL-3.0/
🔹 Models citing this paper:
• https://huggingface.co/OpenGVLab/InternVL3-78B
• https://huggingface.co/OpenGVLab/InternVL3_5-241B-A28B
• https://huggingface.co/OpenGVLab/InternVL3-8B
✨ Datasets citing this paper:
• https://huggingface.co/datasets/OpenGVLab/MMPR-v1.2-prompts
✨ Spaces citing this paper:
• https://huggingface.co/spaces/AntResearchNLP/ViLaBench
• https://huggingface.co/spaces/TIGER-Lab/MEGA-Bench
• https://huggingface.co/spaces/developer0hye/InternVL3-8B
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
Arxivlens
InternVL3: Exploring Advanced Training and Test-Time Recipes for
Open-Source Multimodal Models - AI Research Paper Analysis |…
Open-Source Multimodal Models - AI Research Paper Analysis |…
AI-powered analysis of 'InternVL3: Exploring Advanced Training and Test-Time Recipes for
Open-Source Multimodal Models'. We introduce InternVL3, a significant advancement in the InternVL series
featuring a native multimodal pre-training paradigm. Rather…
Open-Source Multimodal Models'. We introduce InternVL3, a significant advancement in the InternVL series
featuring a native multimodal pre-training paradigm. Rather…
✨Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting
📝 Summary:
Dolphin, a multimodal document image parsing model, uses heterogeneous anchor prompting to achieve state-of-the-art performance on diverse page-level and element-level tasks through an efficient analy...
🔹 Publication Date: Published on May 20, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2505.14059
• PDF: https://arxiv.org/pdf/2505.14059
• Github: https://github.com/bytedance/dolphin
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Dolphin, a multimodal document image parsing model, uses heterogeneous anchor prompting to achieve state-of-the-art performance on diverse page-level and element-level tasks through an efficient analy...
🔹 Publication Date: Published on May 20, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2505.14059
• PDF: https://arxiv.org/pdf/2505.14059
• Github: https://github.com/bytedance/dolphin
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨LightRAG: Simple and Fast Retrieval-Augmented Generation
📝 Summary:
LightRAG improves Retrieval-Augmented Generation by integrating graph structures for enhanced contextual awareness and efficient information retrieval, achieving better accuracy and response times. AI...
🔹 Publication Date: Published on Oct 8, 2024
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2410.05779
• PDF: https://arxiv.org/pdf/2410.05779
• Github: https://github.com/hkuds/lightrag
✨ Spaces citing this paper:
• https://huggingface.co/spaces/rm-lht/lightrag
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
LightRAG improves Retrieval-Augmented Generation by integrating graph structures for enhanced contextual awareness and efficient information retrieval, achieving better accuracy and response times. AI...
🔹 Publication Date: Published on Oct 8, 2024
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2410.05779
• PDF: https://arxiv.org/pdf/2410.05779
• Github: https://github.com/hkuds/lightrag
✨ Spaces citing this paper:
• https://huggingface.co/spaces/rm-lht/lightrag
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨SAM Audio: Segment Anything in Audio
📝 Summary:
SAM Audio, a diffusion transformer-based foundation model, achieves superior performance in general audio separation using unified text, visual, and temporal span prompts across various audio types. A...
🔹 Publication Date: Published on Dec 19, 2025
🔹 Paper Links:
• arXiv Page: https://arxivlens.com/PaperView/Details/sam-audio-segment-anything-in-audio-1718-de85c75a
• PDF: https://arxiv.org/pdf/2512.18099
• Project Page: https://ai.meta.com/samaudio/
• Github: https://github.com/facebookresearch/sam-audio
🔹 Models citing this paper:
• https://huggingface.co/facebook/sam-audio-large
• https://huggingface.co/facebook/sam-audio-small
• https://huggingface.co/facebook/sam-audio-base
✨ Datasets citing this paper:
• https://huggingface.co/datasets/facebook/sam-audio-bench
✨ Spaces citing this paper:
• https://huggingface.co/spaces/lpeterl/sam-audio-webui
• https://huggingface.co/spaces/Arrcttacsrks/SAM-Audio-Demo
• https://huggingface.co/spaces/chippie1/SAM-Audio-Demo
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
SAM Audio, a diffusion transformer-based foundation model, achieves superior performance in general audio separation using unified text, visual, and temporal span prompts across various audio types. A...
🔹 Publication Date: Published on Dec 19, 2025
🔹 Paper Links:
• arXiv Page: https://arxivlens.com/PaperView/Details/sam-audio-segment-anything-in-audio-1718-de85c75a
• PDF: https://arxiv.org/pdf/2512.18099
• Project Page: https://ai.meta.com/samaudio/
• Github: https://github.com/facebookresearch/sam-audio
🔹 Models citing this paper:
• https://huggingface.co/facebook/sam-audio-large
• https://huggingface.co/facebook/sam-audio-small
• https://huggingface.co/facebook/sam-audio-base
✨ Datasets citing this paper:
• https://huggingface.co/datasets/facebook/sam-audio-bench
✨ Spaces citing this paper:
• https://huggingface.co/spaces/lpeterl/sam-audio-webui
• https://huggingface.co/spaces/Arrcttacsrks/SAM-Audio-Demo
• https://huggingface.co/spaces/chippie1/SAM-Audio-Demo
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
Arxivlens
SAM Audio: Segment Anything in Audio - AI Research Paper Analysis | ArxivLens
AI-powered analysis of 'SAM Audio: Segment Anything in Audio'. General audio source separation is a key capability for multimodal AI systems that can perceive and reason about sound. Despite substantial progress i... Explore with advanced AI tools for machine…
✨GigaBrain-0: A World Model-Powered Vision-Language-Action Model
📝 Summary:
GigaBrain-0, a VLA foundation model, uses world model-generated data to enhance cross-task generalization and policy robustness, improving real-world performance on complex manipulation tasks. AI-gene...
🔹 Publication Date: Published on Oct 22, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.19430
• PDF: https://arxiv.org/pdf/2510.19430
• Project Page: https://gigabrain0.github.io/
• Github: https://github.com/open-gigaai/giga-brain-0
🔹 Models citing this paper:
• https://huggingface.co/open-gigaai/GigaBrain-0-3.5B-Base
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
GigaBrain-0, a VLA foundation model, uses world model-generated data to enhance cross-task generalization and policy robustness, improving real-world performance on complex manipulation tasks. AI-gene...
🔹 Publication Date: Published on Oct 22, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.19430
• PDF: https://arxiv.org/pdf/2510.19430
• Project Page: https://gigabrain0.github.io/
• Github: https://github.com/open-gigaai/giga-brain-0
🔹 Models citing this paper:
• https://huggingface.co/open-gigaai/GigaBrain-0-3.5B-Base
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨PDFMathTranslate: Scientific Document Translation Preserving Layouts
📝 Summary:
PDFMathTranslate enables layout-preserving scientific document translation using large language models and precise layout detection, offering improved precision, flexibility, and efficiency. AI-genera...
🔹 Publication Date: Published on Jul 2, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2507.03009
• PDF: https://arxiv.org/pdf/2507.03009
• Github: https://github.com/byaidu/pdfmathtranslate
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
PDFMathTranslate enables layout-preserving scientific document translation using large language models and precise layout detection, offering improved precision, flexibility, and efficiency. AI-genera...
🔹 Publication Date: Published on Jul 2, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2507.03009
• PDF: https://arxiv.org/pdf/2507.03009
• Github: https://github.com/byaidu/pdfmathtranslate
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel
📝 Summary:
PyTorch Fully Sharded Data Parallel (FSDP) enables efficient and scalable training of large models across hardware configurations. AI-generated summary It is widely acknowledged that large models have...
🔹 Publication Date: Published on Apr 21, 2023
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2304.11277
• PDF: https://arxiv.org/pdf/2304.11277
• Github: https://github.com/pytorch/pytorch/blob/main/torch/distributed/fsdp/fully_sharded_data_parallel.py
🔹 Models citing this paper:
• https://huggingface.co/Undi95/dbrx-base
• https://huggingface.co/alpindale/dbrx-instruct
• https://huggingface.co/SinclairSchneider/dbrx-instruct-quantization-fixed
✨ Spaces citing this paper:
• https://huggingface.co/spaces/nanotron/ultrascale-playbook
• https://huggingface.co/spaces/Ki-Seki/ultrascale-playbook-zh-cn
• https://huggingface.co/spaces/weege007/ultrascale-playbook
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
PyTorch Fully Sharded Data Parallel (FSDP) enables efficient and scalable training of large models across hardware configurations. AI-generated summary It is widely acknowledged that large models have...
🔹 Publication Date: Published on Apr 21, 2023
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2304.11277
• PDF: https://arxiv.org/pdf/2304.11277
• Github: https://github.com/pytorch/pytorch/blob/main/torch/distributed/fsdp/fully_sharded_data_parallel.py
🔹 Models citing this paper:
• https://huggingface.co/Undi95/dbrx-base
• https://huggingface.co/alpindale/dbrx-instruct
• https://huggingface.co/SinclairSchneider/dbrx-instruct-quantization-fixed
✨ Spaces citing this paper:
• https://huggingface.co/spaces/nanotron/ultrascale-playbook
• https://huggingface.co/spaces/Ki-Seki/ultrascale-playbook-zh-cn
• https://huggingface.co/spaces/weege007/ultrascale-playbook
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
arXiv.org
PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel
It is widely acknowledged that large models have the potential to deliver superior performance across a broad range of domains. Despite the remarkable progress made in the field of machine...
✨PyTorch Distributed: Experiences on Accelerating Data Parallel Training
📝 Summary:
The PyTorch distributed data parallel module optimizes large-scale model training using techniques like gradient bucketing, computation-communication overlap, and selective synchronization to achieve ...
🔹 Publication Date: Published on Jun 28, 2020
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2006.15704
• PDF: https://arxiv.org/pdf/2006.15704
• Github: https://github.com/pytorch/pytorch/blob/master/torch/nn/parallel/distributed.py
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
The PyTorch distributed data parallel module optimizes large-scale model training using techniques like gradient bucketing, computation-communication overlap, and selective synchronization to achieve ...
🔹 Publication Date: Published on Jun 28, 2020
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2006.15704
• PDF: https://arxiv.org/pdf/2006.15704
• Github: https://github.com/pytorch/pytorch/blob/master/torch/nn/parallel/distributed.py
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Video Generation Models Are Good Latent Reward Models
📝 Summary:
PRFL optimizes video generation preferences in latent space, improving alignment with human preferences while reducing memory consumption and training time. AI-generated summary Reward feedback learni...
🔹 Publication Date: Published on Nov 26, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.21541
• PDF: https://arxiv.org/pdf/2511.21541
• Project Page: https://hy-video-prfl.github.io/HY-VIDEO-PRFL/
• Github: https://github.com/Tencent-Hunyuan/HY-Video-PRFL
🔹 Models citing this paper:
• https://huggingface.co/tencent/HY-Video-PRFL
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
PRFL optimizes video generation preferences in latent space, improving alignment with human preferences while reducing memory consumption and training time. AI-generated summary Reward feedback learni...
🔹 Publication Date: Published on Nov 26, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.21541
• PDF: https://arxiv.org/pdf/2511.21541
• Project Page: https://hy-video-prfl.github.io/HY-VIDEO-PRFL/
• Github: https://github.com/Tencent-Hunyuan/HY-Video-PRFL
🔹 Models citing this paper:
• https://huggingface.co/tencent/HY-Video-PRFL
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research