✨Safety at One Shot: Patching Fine-Tuned LLMs with A Single Instance
📝 Summary:
Safety alignment of large language models can be fully recovered with a single safety example, maintaining utility and achieving convergence in few epochs through identified low-rank gradient structur...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01887
• PDF: https://arxiv.org/pdf/2601.01887
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Safety alignment of large language models can be fully recovered with a single safety example, maintaining utility and achieving convergence in few epochs through identified low-rank gradient structur...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.01887
• PDF: https://arxiv.org/pdf/2601.01887
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
📝 Summary:
We present MiroThinker v1.0, an open-source research agent designed to advance tool-augmented reasoning and information-seeking capabilities. Unlike previous agents that only scale up model size or co...
🔹 Publication Date: Published on Nov 14, 2025
🔹 Paper Links:
• arXiv Page: https://arxivlens.com/PaperView/Details/mirothinker-pushing-the-performance-boundaries-of-open-source-research-agents-via-model-context-and-interactive-scaling-9611-0f2289e7
• PDF: https://arxiv.org/pdf/2511.11793
• Project Page: https://dr.miromind.ai/
• Github: https://github.com/MiroMindAI/MiroThinker
🔹 Models citing this paper:
• https://huggingface.co/miromind-ai/MiroThinker-v1.5-235B
• https://huggingface.co/miromind-ai/MiroThinker-v1.5-30B
• https://huggingface.co/miromind-ai/MiroThinker-v1.0-72B
✨ Datasets citing this paper:
• https://huggingface.co/datasets/miromind-ai/MiroVerse-v0.1
✨ Spaces citing this paper:
• https://huggingface.co/spaces/zoom-ai/hle-leaderboard
• https://huggingface.co/spaces/miromind-ai/MiroMind-Open-Source-Deep-Research
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
We present MiroThinker v1.0, an open-source research agent designed to advance tool-augmented reasoning and information-seeking capabilities. Unlike previous agents that only scale up model size or co...
🔹 Publication Date: Published on Nov 14, 2025
🔹 Paper Links:
• arXiv Page: https://arxivlens.com/PaperView/Details/mirothinker-pushing-the-performance-boundaries-of-open-source-research-agents-via-model-context-and-interactive-scaling-9611-0f2289e7
• PDF: https://arxiv.org/pdf/2511.11793
• Project Page: https://dr.miromind.ai/
• Github: https://github.com/MiroMindAI/MiroThinker
🔹 Models citing this paper:
• https://huggingface.co/miromind-ai/MiroThinker-v1.5-235B
• https://huggingface.co/miromind-ai/MiroThinker-v1.5-30B
• https://huggingface.co/miromind-ai/MiroThinker-v1.0-72B
✨ Datasets citing this paper:
• https://huggingface.co/datasets/miromind-ai/MiroVerse-v0.1
✨ Spaces citing this paper:
• https://huggingface.co/spaces/zoom-ai/hle-leaderboard
• https://huggingface.co/spaces/miromind-ai/MiroMind-Open-Source-Deep-Research
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
Arxivlens
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling - AI…
AI-powered analysis of 'MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling'. We present MiroThinker v1.0, an open-source research agent designed to advance tool-augmented reasoning and…
✨LTX-2: Efficient Joint Audio-Visual Foundation Model
📝 Summary:
LTX-2 is an open-source audiovisual diffusion model that generates synchronized video and audio content using a dual-stream transformer architecture with cross-modal attention and classifier-free guid...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03233
• PDF: https://arxiv.org/pdf/2601.03233
• Project Page: https://huggingface.co/papers/2511.12072
• Github: https://github.com/Lightricks/LTX-2
🔹 Models citing this paper:
• https://huggingface.co/Lightricks/LTX-2
• https://huggingface.co/unsloth/LTX-2-GGUF
• https://huggingface.co/Lightricks/LTX-2-19b-IC-LoRA-Canny-Control
✨ Spaces citing this paper:
• https://huggingface.co/spaces/Lightricks/ltx-2-distilled
• https://huggingface.co/spaces/Lightricks/ltx-2
• https://huggingface.co/spaces/alexnasa/ltx-2-TURBO
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
LTX-2 is an open-source audiovisual diffusion model that generates synchronized video and audio content using a dual-stream transformer architecture with cross-modal attention and classifier-free guid...
🔹 Publication Date: Published on Jan 6
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.03233
• PDF: https://arxiv.org/pdf/2601.03233
• Project Page: https://huggingface.co/papers/2511.12072
• Github: https://github.com/Lightricks/LTX-2
🔹 Models citing this paper:
• https://huggingface.co/Lightricks/LTX-2
• https://huggingface.co/unsloth/LTX-2-GGUF
• https://huggingface.co/Lightricks/LTX-2-19b-IC-LoRA-Canny-Control
✨ Spaces citing this paper:
• https://huggingface.co/spaces/Lightricks/ltx-2-distilled
• https://huggingface.co/spaces/Lightricks/ltx-2
• https://huggingface.co/spaces/alexnasa/ltx-2-TURBO
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
arXiv.org
LTX-2: Efficient Joint Audio-Visual Foundation Model
Recent text-to-video diffusion models can generate compelling video sequences, yet they remain silent -- missing the semantic, emotional, and atmospheric cues that audio provides. We introduce...
✨SimpleMem: Efficient Lifelong Memory for LLM Agents
📝 Summary:
To support reliable long-term interaction in complex environments, LLM agents require memory systems that efficiently manage historical experiences. Existing approaches either retain full interaction ...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02553
• PDF: https://arxiv.org/pdf/2601.02553
• Project Page: https://aiming-lab.github.io/SimpleMem-Page/
• Github: https://aiming-lab.github.io/SimpleMem-Page/
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
To support reliable long-term interaction in complex environments, LLM agents require memory systems that efficiently manage historical experiences. Existing approaches either retain full interaction ...
🔹 Publication Date: Published on Jan 5
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2601.02553
• PDF: https://arxiv.org/pdf/2601.02553
• Project Page: https://aiming-lab.github.io/SimpleMem-Page/
• Github: https://aiming-lab.github.io/SimpleMem-Page/
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
📝 Summary:
SmolDocling is a compact vision-language model that performs end-to-end document conversion with robust performance across various document types using 256M parameters and a new markup format. AI-gene...
🔹 Publication Date: Published on Mar 14, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2503.11576
• PDF: https://huggingface.co/papers/2502.18443
• Project Page: https://huggingface.co/spaces/docling-project/SmolDocling-256M-Demo
• Github: https://github.com/docling-project/docling
🔹 Models citing this paper:
• https://huggingface.co/docling-project/SmolDocling-256M-preview
• https://huggingface.co/ibm-granite/granite-docling-258M
• https://huggingface.co/prithivMLmods/granite-docling-258M-f32-GGUF
✨ Datasets citing this paper:
• https://huggingface.co/datasets/HuggingFaceM4/DoclingMatix
• https://huggingface.co/datasets/docling-project/SynthCodeNet
• https://huggingface.co/datasets/docling-project/SynthFormulaNet
✨ Spaces citing this paper:
• https://huggingface.co/spaces/ibm-granite/granite-docling-258m-demo
• https://huggingface.co/spaces/ibm-granite/granite-docling-258M-WebGPU
• https://huggingface.co/spaces/docling-project/SmolDocling-256M-Demo
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
SmolDocling is a compact vision-language model that performs end-to-end document conversion with robust performance across various document types using 256M parameters and a new markup format. AI-gene...
🔹 Publication Date: Published on Mar 14, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2503.11576
• PDF: https://huggingface.co/papers/2502.18443
• Project Page: https://huggingface.co/spaces/docling-project/SmolDocling-256M-Demo
• Github: https://github.com/docling-project/docling
🔹 Models citing this paper:
• https://huggingface.co/docling-project/SmolDocling-256M-preview
• https://huggingface.co/ibm-granite/granite-docling-258M
• https://huggingface.co/prithivMLmods/granite-docling-258M-f32-GGUF
✨ Datasets citing this paper:
• https://huggingface.co/datasets/HuggingFaceM4/DoclingMatix
• https://huggingface.co/datasets/docling-project/SynthCodeNet
• https://huggingface.co/datasets/docling-project/SynthFormulaNet
✨ Spaces citing this paper:
• https://huggingface.co/spaces/ibm-granite/granite-docling-258m-demo
• https://huggingface.co/spaces/ibm-granite/granite-docling-258M-WebGPU
• https://huggingface.co/spaces/docling-project/SmolDocling-256M-Demo
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
arXiv.org
SmolDocling: An ultra-compact vision-language model for end-to-end...
We introduce SmolDocling, an ultra-compact vision-language model targeting end-to-end document conversion. Our model comprehensively processes entire pages by generating DocTags, a new universal...
✨VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos
📝 Summary:
VideoRAG enhances large language models for multi-modal video processing with a dual-channel architecture that integrates textual knowledge grounding and multi-modal context encoding. AI-generated sum...
🔹 Publication Date: Published on Feb 3, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2502.01549
• PDF: https://arxiv.org/pdf/2502.01549
• Github: https://github.com/hkuds/videorag
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
VideoRAG enhances large language models for multi-modal video processing with a dual-channel architecture that integrates textual knowledge grounding and multi-modal context encoding. AI-generated sum...
🔹 Publication Date: Published on Feb 3, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2502.01549
• PDF: https://arxiv.org/pdf/2502.01549
• Github: https://github.com/hkuds/videorag
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Agent READMEs: An Empirical Study of Context Files for Agentic Coding
📝 Summary:
Agentic coding tools receive goals written in natural language as input, break them down into specific tasks, and write or execute the actual code with minimal human intervention. Central to this proc...
🔹 Publication Date: Published on Nov 17, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.12884
• PDF: https://arxiv.org/pdf/2511.12884
• Project Page: https://huggingface.co/papers/2511.03404
• Github: https://github.com/openai/agents.md
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Agentic coding tools receive goals written in natural language as input, break them down into specific tasks, and write or execute the actual code with minimal human intervention. Central to this proc...
🔹 Publication Date: Published on Nov 17, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.12884
• PDF: https://arxiv.org/pdf/2511.12884
• Project Page: https://huggingface.co/papers/2511.03404
• Github: https://github.com/openai/agents.md
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨Bitnet.cpp: Efficient Edge Inference for Ternary LLMs
📝 Summary:
Bitnet.cpp enhances edge inference for ternary LLMs using a novel mixed-precision matrix multiplication library, achieving significant speed improvements over baselines. AI-generated summary The adven...
🔹 Publication Date: Published on Feb 17, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2502.11880
• PDF: https://arxiv.org/pdf/2502.11880
• Github: https://github.com/microsoft/BitNet/tree/paper
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
Bitnet.cpp enhances edge inference for ternary LLMs using a novel mixed-precision matrix multiplication library, achieving significant speed improvements over baselines. AI-generated summary The adven...
🔹 Publication Date: Published on Feb 17, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2502.11880
• PDF: https://arxiv.org/pdf/2502.11880
• Github: https://github.com/microsoft/BitNet/tree/paper
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨BitNet b1.58 2B4T Technical Report
📝 Summary:
BitNet b1.58 2B4T, a 1-bit Large Language Model with 2 billion parameters, matches the performance of full-precision models while improving computational efficiency. AI-generated summary We introduce ...
🔹 Publication Date: Published on Apr 16, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2504.12285
• PDF: https://arxiv.org/pdf/2504.12285
• Github: https://github.com/microsoft/bitnet
🔹 Models citing this paper:
• https://huggingface.co/microsoft/bitnet-b1.58-2B-4T
• https://huggingface.co/microsoft/bitnet-b1.58-2B-4T-gguf
• https://huggingface.co/microsoft/bitnet-b1.58-2B-4T-bf16
✨ Spaces citing this paper:
• https://huggingface.co/spaces/suayptalha/Chat-with-Bitnet-b1.58-2B-4T
• https://huggingface.co/spaces/aizip-dev/SLM-RAG-Arena
• https://huggingface.co/spaces/Tonic/Native_1-bit_LLM
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
BitNet b1.58 2B4T, a 1-bit Large Language Model with 2 billion parameters, matches the performance of full-precision models while improving computational efficiency. AI-generated summary We introduce ...
🔹 Publication Date: Published on Apr 16, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2504.12285
• PDF: https://arxiv.org/pdf/2504.12285
• Github: https://github.com/microsoft/bitnet
🔹 Models citing this paper:
• https://huggingface.co/microsoft/bitnet-b1.58-2B-4T
• https://huggingface.co/microsoft/bitnet-b1.58-2B-4T-gguf
• https://huggingface.co/microsoft/bitnet-b1.58-2B-4T-bf16
✨ Spaces citing this paper:
• https://huggingface.co/spaces/suayptalha/Chat-with-Bitnet-b1.58-2B-4T
• https://huggingface.co/spaces/aizip-dev/SLM-RAG-Arena
• https://huggingface.co/spaces/Tonic/Native_1-bit_LLM
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
arXiv.org
BitNet b1.58 2B4T Technical Report
We introduce BitNet b1.58 2B4T, the first open-source, native 1-bit Large Language Model (LLM) at the 2-billion parameter scale. Trained on a corpus of 4 trillion tokens, the model has been...
✨BitNet Distillation
📝 Summary:
BitNet Distillation fine-tunes large language models to 1.58-bit precision using SubLN, multi-head attention distillation, and continual pre-training, achieving comparable performance with significant...
🔹 Publication Date: Published on Oct 15, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13998
• PDF: https://arxiv.org/pdf/2510.13998
• Github: https://github.com/microsoft/BitNet
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
BitNet Distillation fine-tunes large language models to 1.58-bit precision using SubLN, multi-head attention distillation, and continual pre-training, achieving comparable performance with significant...
🔹 Publication Date: Published on Oct 15, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.13998
• PDF: https://arxiv.org/pdf/2510.13998
• Github: https://github.com/microsoft/BitNet
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
✨LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
📝 Summary:
LlamaFactory is a unified framework enabling efficient fine-tuning of large language models across various tasks using a web-based user interface. AI-generated summary Efficient fine-tuning is vital f...
🔹 Publication Date: Published on Mar 20, 2024
🔹 Paper Links:
• arXiv Page: https://arxivexplained.com/papers/llamafactory-unified-efficient-fine-tuning-of-100-language-models
• PDF: https://arxiv.org/pdf/2403.13372
• Project Page: https://huggingface.co/spaces/hiyouga/LLaMA-Board
• Github: https://github.com/hiyouga/LLaMA-Factory
🔹 Models citing this paper:
• https://huggingface.co/AELLM/Llama-3.2-Chibi-3B
• https://huggingface.co/GXMZU/Qwen3-14B-ai-expert
• https://huggingface.co/XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k
✨ Spaces citing this paper:
• https://huggingface.co/spaces/hiyouga/LLaMA-Board
• https://huggingface.co/spaces/Justinrune/LLaMA-Factory
• https://huggingface.co/spaces/featherless-ai/try-this-model
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
LlamaFactory is a unified framework enabling efficient fine-tuning of large language models across various tasks using a web-based user interface. AI-generated summary Efficient fine-tuning is vital f...
🔹 Publication Date: Published on Mar 20, 2024
🔹 Paper Links:
• arXiv Page: https://arxivexplained.com/papers/llamafactory-unified-efficient-fine-tuning-of-100-language-models
• PDF: https://arxiv.org/pdf/2403.13372
• Project Page: https://huggingface.co/spaces/hiyouga/LLaMA-Board
• Github: https://github.com/hiyouga/LLaMA-Factory
🔹 Models citing this paper:
• https://huggingface.co/AELLM/Llama-3.2-Chibi-3B
• https://huggingface.co/GXMZU/Qwen3-14B-ai-expert
• https://huggingface.co/XavierSpycy/Meta-Llama-3-8B-Instruct-zh-10k
✨ Spaces citing this paper:
• https://huggingface.co/spaces/hiyouga/LLaMA-Board
• https://huggingface.co/spaces/Justinrune/LLaMA-Factory
• https://huggingface.co/spaces/featherless-ai/try-this-model
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
Arxivexplained
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models - Explained Simply
By Yaowei Zheng, Richong Zhang, Junhao Zhang et al.. # LlamaFactory: The Game-Changer That Makes AI Customization Accessible to Everyone
**The Problem:*...
**The Problem:*...
✨PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
📝 Summary:
PaddleOCR-VL, a vision-language model combining NaViT-style dynamic resolution and ERNIE, achieves state-of-the-art performance in document parsing and element recognition with high efficiency. AI-gen...
🔹 Publication Date: Published on Oct 16, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14528
• PDF: https://arxiv.org/pdf/2510.14528
• Github: https://github.com/PaddlePaddle/PaddleOCR
🔹 Models citing this paper:
• https://huggingface.co/PaddlePaddle/PaddleOCR-VL
• https://huggingface.co/PaddlePaddle/PP-DocLayoutV2
• https://huggingface.co/unsloth/PaddleOCR-VL
✨ Spaces citing this paper:
• https://huggingface.co/spaces/PaddlePaddle/PaddleOCR-VL_Online_Demo
• https://huggingface.co/spaces/seanpedrickcase/document_redaction
• https://huggingface.co/spaces/markobinario/PaddleOCR-VL_Online_Demo
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
PaddleOCR-VL, a vision-language model combining NaViT-style dynamic resolution and ERNIE, achieves state-of-the-art performance in document parsing and element recognition with high efficiency. AI-gen...
🔹 Publication Date: Published on Oct 16, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2510.14528
• PDF: https://arxiv.org/pdf/2510.14528
• Github: https://github.com/PaddlePaddle/PaddleOCR
🔹 Models citing this paper:
• https://huggingface.co/PaddlePaddle/PaddleOCR-VL
• https://huggingface.co/PaddlePaddle/PP-DocLayoutV2
• https://huggingface.co/unsloth/PaddleOCR-VL
✨ Spaces citing this paper:
• https://huggingface.co/spaces/PaddlePaddle/PaddleOCR-VL_Online_Demo
• https://huggingface.co/spaces/seanpedrickcase/document_redaction
• https://huggingface.co/spaces/markobinario/PaddleOCR-VL_Online_Demo
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
arXiv.org
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B...
In this report, we propose PaddleOCR-VL, a SOTA and resource-efficient model tailored for document parsing. Its core component is PaddleOCR-VL-0.9B, a compact yet powerful vision-language model...
✨VibeVoice Technical Report
📝 Summary:
VibeVoice synthesizes long-form multi-speaker speech using next-token diffusion and a highly efficient continuous speech tokenizer, achieving superior performance and fidelity. AI-generated summary Th...
🔹 Publication Date: Published on Aug 26, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.19205
• PDF: https://arxiv.org/pdf/2508.19205
• Project Page: https://microsoft.github.io/VibeVoice/
• Github: https://huggingface.co/collections/microsoft/vibevoice
🔹 Models citing this paper:
• https://huggingface.co/microsoft/VibeVoice-1.5B
• https://huggingface.co/microsoft/VibeVoice-Realtime-0.5B
• https://huggingface.co/aoi-ot/VibeVoice-Large
✨ Spaces citing this paper:
• https://huggingface.co/spaces/ChaitanyaChandra/VibeVoice
• https://huggingface.co/spaces/lths/VibeVoice-Demo
• https://huggingface.co/spaces/yasserrmd/VibeVoice
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
VibeVoice synthesizes long-form multi-speaker speech using next-token diffusion and a highly efficient continuous speech tokenizer, achieving superior performance and fidelity. AI-generated summary Th...
🔹 Publication Date: Published on Aug 26, 2025
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2508.19205
• PDF: https://arxiv.org/pdf/2508.19205
• Project Page: https://microsoft.github.io/VibeVoice/
• Github: https://huggingface.co/collections/microsoft/vibevoice
🔹 Models citing this paper:
• https://huggingface.co/microsoft/VibeVoice-1.5B
• https://huggingface.co/microsoft/VibeVoice-Realtime-0.5B
• https://huggingface.co/aoi-ot/VibeVoice-Large
✨ Spaces citing this paper:
• https://huggingface.co/spaces/ChaitanyaChandra/VibeVoice
• https://huggingface.co/spaces/lths/VibeVoice-Demo
• https://huggingface.co/spaces/yasserrmd/VibeVoice
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
arXiv.org
VibeVoice Technical Report
This report presents VibeVoice, a novel model designed to synthesize long-form speech with multiple speakers by employing next-token diffusion, which is a unified method for modeling continuous...
✨Efficient Memory Management for Large Language Model Serving with PagedAttention
📝 Summary:
PagedAttention algorithm and vLLM system enhance the throughput of large language models by efficiently managing memory and reducing waste in the key-value cache. AI-generated summary High throughput ...
🔹 Publication Date: Published on Sep 12, 2023
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2309.06180
• PDF: https://arxiv.org/pdf/2309.06180
• Github: https://github.com/vllm-project/vllm
🔹 Models citing this paper:
• https://huggingface.co/theonlyengine/Flash-attention1
✨ Datasets citing this paper:
• https://huggingface.co/datasets/TheBlueScrubs/TheBlueScrubs-v1
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research
📝 Summary:
PagedAttention algorithm and vLLM system enhance the throughput of large language models by efficiently managing memory and reducing waste in the key-value cache. AI-generated summary High throughput ...
🔹 Publication Date: Published on Sep 12, 2023
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2309.06180
• PDF: https://arxiv.org/pdf/2309.06180
• Github: https://github.com/vllm-project/vllm
🔹 Models citing this paper:
• https://huggingface.co/theonlyengine/Flash-attention1
✨ Datasets citing this paper:
• https://huggingface.co/datasets/TheBlueScrubs/TheBlueScrubs-v1
==================================
For more data science resources:
✓ https://t.me/DataScienceT
#AI #DataScience #MachineLearning #HuggingFace #Research