Offshore
Photo
Robert Scoble
RT @brunotorious: πŸ€– Check out https://t.co/mbDLcZPeqo! We're proud to announce that we are now open source, so you can use our code to create your own amazing projects. Check out our GitHub repository here: https://t.co/wfYYIdJ9Hl

#OpenSource #Code #ai https://t.co/pZZybwZ5AE
tweet
Offshore
Video
Robert Scoble
RT @openbrushapp: Open Brush 2.0 releasing TOMORROW πŸ₯³ Here's one last teaser…

Introducing PLY and WebM support! Enhance your scenes with point cloud data and WebM videos 🐝🌺

This is just the start of a huge upgrade to our import/export pipelineπŸ’ͺ #vr #mr #opensource https://t.co/vYIUTozcc3
tweet
Hugging Face
RT @VoltronData: Inside the #OpenSource Report: Excellent technology. A vibrant community. Knowledge to share. Yep, that’s @huggingface. If it isn’t clear by now, read our report to see why they’re a top project to watch this year:

https://t.co/r3FaDrXg7D
tweet
Offshore
Video
Robert Scoble
RT @ernerfeldt: We’re making Rerun open source today! @Rerundotio is now available with `pip install rerun-sdk` and `cargo add rerun`
#computervision #robotics #opensource #rustlang https://t.co/ploumHSMYz
tweet
Offshore
Photo
Nazneen Rajani in SF for the Open-Source AI meetup
I'll be there tomorrow at the Exploratorium!

Find me to know more about our efforts on instruction fine-tuning, LLaMA, Alpaca, RLHF, PM, red-teaming, and evaluation as we build the open-source alternative to ChatGPT, the πŸ€—H4. https://t.co/T42hEH6GHP
Holy cow!! is this the biggest AI meetup ever? Crazy! Well done, @ClementDelangue and @huggingface team

This is πŸ€—πŸ€—πŸ€— power.

I’ll be here to demo @Scenario_gg 🀟🏻

#SanFrancisco #AI #opensource https://t.co/NupO5o66Yk
- Emm
tweet
Offshore
Photo
DAIR.AI
RT @omarsar0: This team has been publishing some really interesting work on diffusion LLMs.

LLaDA 2.1 is a 100B discrete diffusion LLM with a draft-then-edit approach.

It hits a peak speed of 892 tokens/s on complex coding tasks.

Autoregressive models commit to every token permanently but LLaDA 2.1 can go back and fix mistakes mid-generation. The error handling capabilities are worth looking into.

What if an LLM could EDIT its own tokens in real-time, not just generate them? 🀯
Introducing LLaDA2.1 β€” a diffusion model that breaks from autoregressive dominance. It drafts fast, then fixes its own mistakes on the fly with Token-to-Token editing.
The result? 892 tokens/sec on a 100B model. πŸ”₯
⚑ 892 TPS on HumanEval+ (coding)
⚑ 801 TPS on BigCodeBench
🧠 Real-time self-correction via T2T editing
βœ… @lmsysorg SGLang Day 0 support β€” production-ready now
A "non-consensus" architecture now challenging the mainstream. Open-sourced TODAY. πŸ‘‡
#LLaDA #TokenEditing #OpenSource #LLM #dLLM
- Ant Open Source
tweet
Offshore
Photo
Brady Long
The era of β€œprompt and wait for a response” seems to be over.

As soon as I saw this I went immediately to Hugging Face to try it out. Nuts.

https://t.co/JlERquiIcQ

MiniCPM-o 4.5: Seeing, Listening, and Speaking β€” All at Once. πŸ‘οΈπŸ‘‚πŸ—£οΈ

✨Beyond traditional turn-taking, we’ve built a Native Full-Duplex engine that allows a 9B model to see, listen, and speak in one concurrent, non-blocking stream.

Watch how it masters real-world complexity in real-time:
πŸ”” Proactive Auditory Interaction: Interrupts itself to alert you when it hears a "Ding!" while reading cards.
🎨 Temporal Flow Tracking: Follows your pen in real-time, narrating and "mind-reading" your drawing as you sketch.
🍎 Omni-Perception: Scans groceries & identifies prices on the fly.

✨Why it’s a category-leader:
πŸ“ŒPerformance: Surpasses GPT-4o and Gemini 2.0 Pro on OpenCompass (Avg. 77.6).
πŸ“ŒArchitecture: End-to-end fusion of SigLip2, Whisper, and CosyVoice2 on a Qwen3-8B base.
πŸ“ŒEfficiency: Full-duplex live streaming now runs locally on PCs via llama.cpp-omni.

The era of "Wait-and-Response" AI is over. Proactive, real-time intelligence is now open-source.
πŸš€Experience it on Hugging Face: πŸ”—https://t.co/KzzgiGYhVr

#MiniCPM #Omnimodal #FullDuplex #EdgeAI #OpenSource #ComputerVision
- OpenBMB
tweet
Offshore
Photo
Brady Long
RT @bigaiguy: If this existed like 10 years ago my Grandpa never would have beaten me in GinπŸ€”

Insane. See for yourself on Hugging Face https://t.co/yDVUJ2lMp8

MiniCPM-o 4.5: Seeing, Listening, and Speaking β€” All at Once. πŸ‘οΈπŸ‘‚πŸ—£οΈ

✨Beyond traditional turn-taking, we’ve built a Native Full-Duplex engine that allows a 9B model to see, listen, and speak in one concurrent, non-blocking stream.

Watch how it masters real-world complexity in real-time:
πŸ”” Proactive Auditory Interaction: Interrupts itself to alert you when it hears a "Ding!" while reading cards.
🎨 Temporal Flow Tracking: Follows your pen in real-time, narrating and "mind-reading" your drawing as you sketch.
🍎 Omni-Perception: Scans groceries & identifies prices on the fly.

✨Why it’s a category-leader:
πŸ“ŒPerformance: Surpasses GPT-4o and Gemini 2.0 Pro on OpenCompass (Avg. 77.6).
πŸ“ŒArchitecture: End-to-end fusion of SigLip2, Whisper, and CosyVoice2 on a Qwen3-8B base.
πŸ“ŒEfficiency: Full-duplex live streaming now runs locally on PCs via llama.cpp-omni.

The era of "Wait-and-Response" AI is over. Proactive, real-time intelligence is now open-source.
πŸš€Experience it on Hugging Face: πŸ”—https://t.co/KzzgiGYhVr

#MiniCPM #Omnimodal #FullDuplex #EdgeAI #OpenSource #ComputerVision
- OpenBMB
tweet
Offshore
Photo
Brady Long
RT @thisguyknowsai: The era of β€œprompt and wait for a response” seems to be over.

As soon as I saw this I went immediately to Hugging Face to try it out. Nuts.

https://t.co/JlERquiIcQ

MiniCPM-o 4.5: Seeing, Listening, and Speaking β€” All at Once. πŸ‘οΈπŸ‘‚πŸ—£οΈ

✨Beyond traditional turn-taking, we’ve built a Native Full-Duplex engine that allows a 9B model to see, listen, and speak in one concurrent, non-blocking stream.

Watch how it masters real-world complexity in real-time:
πŸ”” Proactive Auditory Interaction: Interrupts itself to alert you when it hears a "Ding!" while reading cards.
🎨 Temporal Flow Tracking: Follows your pen in real-time, narrating and "mind-reading" your drawing as you sketch.
🍎 Omni-Perception: Scans groceries & identifies prices on the fly.

✨Why it’s a category-leader:
πŸ“ŒPerformance: Surpasses GPT-4o and Gemini 2.0 Pro on OpenCompass (Avg. 77.6).
πŸ“ŒArchitecture: End-to-end fusion of SigLip2, Whisper, and CosyVoice2 on a Qwen3-8B base.
πŸ“ŒEfficiency: Full-duplex live streaming now runs locally on PCs via llama.cpp-omni.

The era of "Wait-and-Response" AI is over. Proactive, real-time intelligence is now open-source.
πŸš€Experience it on Hugging Face: πŸ”—https://t.co/KzzgiGYhVr

#MiniCPM #Omnimodal #FullDuplex #EdgeAI #OpenSource #ComputerVision
- OpenBMB
tweet