ML Research Hub
32.8K subscribers
4.13K photos
244 videos
23 files
4.46K links
Advancing research in Machine Learning – practical insights, tools, and techniques for researchers.

Admin: @HusseinSheikho || @Hussein_Sheikho
Download Telegram
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

📝 Summary:
ToolOrchestra uses reinforcement learning to train small orchestrators that coordinate intelligent tools. This method enables an 8B model to outperform GPT-5 on complex tasks like Humanitys Last Exam, achieving higher accuracy at significantly lower cost and improving efficiency.

🔹 Publication Date: Published on Nov 26

🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.21689
• PDF: https://arxiv.org/pdf/2511.21689
• Project Page: https://research.nvidia.com/labs/lpr/ToolOrchestra/
• Github: https://github.com/NVlabs/ToolOrchestra/

🔹 Models citing this paper:
https://huggingface.co/nvidia/Orchestrator-8B
https://huggingface.co/Mungert/Orchestrator-8B-GGUF
https://huggingface.co/cyankiwi/Orchestrator-8B-AWQ-4bit

Datasets citing this paper:
https://huggingface.co/datasets/nvidia/ToolScale
https://huggingface.co/datasets/victor/ToolScale
https://huggingface.co/datasets/FranckAbgrall/ToolScale

==================================

For more data science resources:
https://t.me/DataScienceT

#ToolOrchestra #ModelOrchestration #ReinforcementLearning #LLMs #AI