https://github.com/darinkishore/codex_dspy
darinkishore/codex_dspy: DSPy module for OpenAI Codex SDK - signature-driven agentic workflows
darinkishore/codex_dspy: DSPy module for OpenAI Codex SDK - signature-driven agentic workflows
GitHub
GitHub - darinkishore/codex_dspy: DSPy module for OpenAI Codex SDK - signature-driven agentic workflows
DSPy module for OpenAI Codex SDK - signature-driven agentic workflows - darinkishore/codex_dspy
https://deepmind.google/blog/teaching-ai-to-see-the-world-more-like-we-do/?utm_source=x&utm_medium=social&utm_campaign=&utm_content=
Teaching AI to See the World More Like Humans Do - Google DeepMind
Teaching AI to See the World More Like Humans Do - Google DeepMind
Google DeepMind
Teaching AI to See the World More Like Humans Do
Aligning AI vision models with human knowledge, improves their robustness and ability to generalize.
https://www.youtube.com/watch?v=Xb34YmbEiOc
Ray Summit 2025 Keynote: The Shift to LLM Fine-Tuning with Thinking Machines - YouTube
Ray Summit 2025 Keynote: The Shift to LLM Fine-Tuning with Thinking Machines - YouTube
YouTube
Ray Summit 2025 Keynote: The Shift to LLM Fine-Tuning with Thinking Machines
Devendra Chaplot, Member of Technical Staff at Thinking Machines, closes out Day 1 with an in-depth look at Tinkr, the powerful system driving innovation in AI and machine learning development.
In this session, Devendra walks through the story of how Tinkr…
In this session, Devendra walks through the story of how Tinkr…
https://www.worldlabs.ai/blog/marble-world-model
Marble: A Multimodal World Model | World Labs
Marble: A Multimodal World Model | World Labs
www.worldlabs.ai
Marble: A Multimodal World Model
Marble, our frontier multimodal world model, is available to everyone starting today
https://robocasa.ai/
RoboCasa
RoboCasa is a large-scale simulation framework for training generally capable robots to perform everyday tasks. It features realistic and diverse human-centered environments with a focus on kitchen scenes. We create these environments with the aid of generative AI tools, such as large language models (LLMs) and text-to-image/3D generative models. We provide over 2,500 3D assets across 150+ object categories and dozens of interactable furniture and appliances. As part of the first release, we include a suite of 100 tasks, representing a wide spectrum of everyday activities. Together with the simulated tasks, we offer a dataset of high-quality human demonstrations and leverage automated trajectory generation techniques to significantly expand the amount of training data with little additional cost.
RoboCasa
RoboCasa is a large-scale simulation framework for training generally capable robots to perform everyday tasks. It features realistic and diverse human-centered environments with a focus on kitchen scenes. We create these environments with the aid of generative AI tools, such as large language models (LLMs) and text-to-image/3D generative models. We provide over 2,500 3D assets across 150+ object categories and dozens of interactable furniture and appliances. As part of the first release, we include a suite of 100 tasks, representing a wide spectrum of everyday activities. Together with the simulated tasks, we offer a dataset of high-quality human demonstrations and leverage automated trajectory generation techniques to significantly expand the amount of training data with little additional cost.
robocasa-web
https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook#introduction
The Smol Training Playbook - a Hugging Face Space by HuggingFaceTB
The Smol Training Playbook - a Hugging Face Space by HuggingFaceTB
huggingface.co
The Smol Training Playbook: The Secrets to Building World-Class LLMs - a Hugging Face Space by HuggingFaceTB
Discover amazing ML apps made by the community
https://github.com/NVIDIA/Isaac-GR00T
NVIDIA/Isaac-GR00T: NVIDIA Isaac GR00T N1.5 - A Foundation Model for Generalist Robots.
NVIDIA/Isaac-GR00T: NVIDIA Isaac GR00T N1.5 - A Foundation Model for Generalist Robots.
GitHub
GitHub - NVIDIA/Isaac-GR00T: NVIDIA Isaac GR00T N1.5 - A Foundation Model for Generalist Robots.
NVIDIA Isaac GR00T N1.5 - A Foundation Model for Generalist Robots. - NVIDIA/Isaac-GR00T
https://www.youtube.com/watch?v=Zphax4f6Rls
Introducing SIMA 2, the next milestone in our research creating general and helpful AI agents. - YouTube
Introducing SIMA 2, the next milestone in our research creating general and helpful AI agents. - YouTube
YouTube
SIMA 2: An agent that plays, reasons, and learns with you in virtual 3D worlds
We’re introducing SIMA 2, the next major milestone in general and helpful embodied AI agents. 👾
With Gemini integrated at its core, it moves beyond following basic instructions to think, learn, and collaborate in complex, 3D worlds.
🔵 Advanced reasoning:…
With Gemini integrated at its core, it moves beyond following basic instructions to think, learn, and collaborate in complex, 3D worlds.
🔵 Advanced reasoning:…
https://www.youtube.com/watch?v=_AZ7ptRuzs8
Introducing Scribe v2 Realtime - YouTube
Introducing Scribe v2 Realtime - YouTube
YouTube
Introducing Scribe v2 Realtime
Today we’re introducing a state-of-the-art Speech to Text model.
Scribe v2 Realtime is the most accurate low-latency model, delivering live transcription in under 150ms for voice agents, meeting notetakers, and live apps — across 90+ languages!
Key highlights:…
Scribe v2 Realtime is the most accurate low-latency model, delivering live transcription in under 150ms for voice agents, meeting notetakers, and live apps — across 90+ languages!
Key highlights:…