Offshore
Video
Robert Scoble
RT @ernerfeldt: We’re making Rerun open source today! @Rerundotio is now available with `pip install rerun-sdk` and `cargo add rerun`
#computervision #robotics #opensource #rustlang https://t.co/ploumHSMYz
tweet
Offshore
GIF
Hugues Bruyère
Through the mirror, again...
but this time at an immersive 1:1 scale

NeRF trained and being rendered in real-time on a tethered Quest 2 using @NVIDIAAIDev Instant NeRF⁣.

#ai #InstantNeRF #VR #InstantNeRFVR #neuralrendering #computervision https://t.co/g0m7yHWika
tweet
Offshore
Photo
Brady Long
The era of β€œprompt and wait for a response” seems to be over.

As soon as I saw this I went immediately to Hugging Face to try it out. Nuts.

https://t.co/JlERquiIcQ

MiniCPM-o 4.5: Seeing, Listening, and Speaking β€” All at Once. πŸ‘οΈπŸ‘‚πŸ—£οΈ

✨Beyond traditional turn-taking, we’ve built a Native Full-Duplex engine that allows a 9B model to see, listen, and speak in one concurrent, non-blocking stream.

Watch how it masters real-world complexity in real-time:
πŸ”” Proactive Auditory Interaction: Interrupts itself to alert you when it hears a "Ding!" while reading cards.
🎨 Temporal Flow Tracking: Follows your pen in real-time, narrating and "mind-reading" your drawing as you sketch.
🍎 Omni-Perception: Scans groceries & identifies prices on the fly.

✨Why it’s a category-leader:
πŸ“ŒPerformance: Surpasses GPT-4o and Gemini 2.0 Pro on OpenCompass (Avg. 77.6).
πŸ“ŒArchitecture: End-to-end fusion of SigLip2, Whisper, and CosyVoice2 on a Qwen3-8B base.
πŸ“ŒEfficiency: Full-duplex live streaming now runs locally on PCs via llama.cpp-omni.

The era of "Wait-and-Response" AI is over. Proactive, real-time intelligence is now open-source.
πŸš€Experience it on Hugging Face: πŸ”—https://t.co/KzzgiGYhVr

#MiniCPM #Omnimodal #FullDuplex #EdgeAI #OpenSource #ComputerVision
- OpenBMB
tweet
Offshore
Photo
Brady Long
RT @bigaiguy: If this existed like 10 years ago my Grandpa never would have beaten me in GinπŸ€”

Insane. See for yourself on Hugging Face https://t.co/yDVUJ2lMp8

MiniCPM-o 4.5: Seeing, Listening, and Speaking β€” All at Once. πŸ‘οΈπŸ‘‚πŸ—£οΈ

✨Beyond traditional turn-taking, we’ve built a Native Full-Duplex engine that allows a 9B model to see, listen, and speak in one concurrent, non-blocking stream.

Watch how it masters real-world complexity in real-time:
πŸ”” Proactive Auditory Interaction: Interrupts itself to alert you when it hears a "Ding!" while reading cards.
🎨 Temporal Flow Tracking: Follows your pen in real-time, narrating and "mind-reading" your drawing as you sketch.
🍎 Omni-Perception: Scans groceries & identifies prices on the fly.

✨Why it’s a category-leader:
πŸ“ŒPerformance: Surpasses GPT-4o and Gemini 2.0 Pro on OpenCompass (Avg. 77.6).
πŸ“ŒArchitecture: End-to-end fusion of SigLip2, Whisper, and CosyVoice2 on a Qwen3-8B base.
πŸ“ŒEfficiency: Full-duplex live streaming now runs locally on PCs via llama.cpp-omni.

The era of "Wait-and-Response" AI is over. Proactive, real-time intelligence is now open-source.
πŸš€Experience it on Hugging Face: πŸ”—https://t.co/KzzgiGYhVr

#MiniCPM #Omnimodal #FullDuplex #EdgeAI #OpenSource #ComputerVision
- OpenBMB
tweet
Offshore
Photo
Brady Long
RT @thisguyknowsai: The era of β€œprompt and wait for a response” seems to be over.

As soon as I saw this I went immediately to Hugging Face to try it out. Nuts.

https://t.co/JlERquiIcQ

MiniCPM-o 4.5: Seeing, Listening, and Speaking β€” All at Once. πŸ‘οΈπŸ‘‚πŸ—£οΈ

✨Beyond traditional turn-taking, we’ve built a Native Full-Duplex engine that allows a 9B model to see, listen, and speak in one concurrent, non-blocking stream.

Watch how it masters real-world complexity in real-time:
πŸ”” Proactive Auditory Interaction: Interrupts itself to alert you when it hears a "Ding!" while reading cards.
🎨 Temporal Flow Tracking: Follows your pen in real-time, narrating and "mind-reading" your drawing as you sketch.
🍎 Omni-Perception: Scans groceries & identifies prices on the fly.

✨Why it’s a category-leader:
πŸ“ŒPerformance: Surpasses GPT-4o and Gemini 2.0 Pro on OpenCompass (Avg. 77.6).
πŸ“ŒArchitecture: End-to-end fusion of SigLip2, Whisper, and CosyVoice2 on a Qwen3-8B base.
πŸ“ŒEfficiency: Full-duplex live streaming now runs locally on PCs via llama.cpp-omni.

The era of "Wait-and-Response" AI is over. Proactive, real-time intelligence is now open-source.
πŸš€Experience it on Hugging Face: πŸ”—https://t.co/KzzgiGYhVr

#MiniCPM #Omnimodal #FullDuplex #EdgeAI #OpenSource #ComputerVision
- OpenBMB
tweet