Offshore

Luma AI
RT @vibrantnebula: I 3D scanned a Lion diorama using @LumaLabsAI 🦁😍 #3D #3dscanning #AI #NeRF #lion #animals #aiart #aiartcommunity #3dscan #3drender #3dart #museum #diorama #computervision #endangeredspecies https://t.co/lEOItGN1VQ
tweet

2 views21:58

Offshore

Video

Robert Scoble
RT @ernerfeldt: We’re making Rerun open source today! @Rerundotio is now available with `pip install rerun-sdk` and `cargo add rerun`
#computervision #robotics #opensource #rustlang https://t.co/ploumHSMYz
tweet

1 view16:00

Offshore

GIF

Hugues Bruyère
Through the mirror, again...
but this time at an immersive 1:1 scale

NeRF trained and being rendered in real-time on a tethered Quest 2 using @NVIDIAAIDev Instant NeRF⁣.

#ai #InstantNeRF #VR #InstantNeRFVR #neuralrendering #computervision https://t.co/g0m7yHWika
tweet

1 view04:25

Offshore

Photo

Brady Long
The era of “prompt and wait for a response” seems to be over.

As soon as I saw this I went immediately to Hugging Face to try it out. Nuts.

https://t.co/JlERquiIcQ



MiniCPM-o 4.5: Seeing, Listening, and Speaking — All at Once. 👁️👂🗣️

✨Beyond traditional turn-taking, we’ve built a Native Full-Duplex engine that allows a 9B model to see, listen, and speak in one concurrent, non-blocking stream.

Watch how it masters real-world complexity in real-time:
🔔 Proactive Auditory Interaction: Interrupts itself to alert you when it hears a "Ding!" while reading cards.
🎨 Temporal Flow Tracking: Follows your pen in real-time, narrating and "mind-reading" your drawing as you sketch.
🍎 Omni-Perception: Scans groceries & identifies prices on the fly.

✨Why it’s a category-leader:
📌Performance: Surpasses GPT-4o and Gemini 2.0 Pro on OpenCompass (Avg. 77.6).
📌Architecture: End-to-end fusion of SigLip2, Whisper, and CosyVoice2 on a Qwen3-8B base.
📌Efficiency: Full-duplex live streaming now runs locally on PCs via llama.cpp-omni.

The era of "Wait-and-Response" AI is over. Proactive, real-time intelligence is now open-source.
🚀Experience it on Hugging Face: 🔗https://t.co/KzzgiGYhVr

#MiniCPM #Omnimodal #FullDuplex #EdgeAI #OpenSource #ComputerVision

- OpenBMB
tweet

1 view16:20

Offshore

Photo

Brady Long
RT @bigaiguy: If this existed like 10 years ago my Grandpa never would have beaten me in Gin🤔

Insane. See for yourself on Hugging Face https://t.co/yDVUJ2lMp8



MiniCPM-o 4.5: Seeing, Listening, and Speaking — All at Once. 👁️👂🗣️

✨Beyond traditional turn-taking, we’ve built a Native Full-Duplex engine that allows a 9B model to see, listen, and speak in one concurrent, non-blocking stream.

Watch how it masters real-world complexity in real-time:
🔔 Proactive Auditory Interaction: Interrupts itself to alert you when it hears a "Ding!" while reading cards.
🎨 Temporal Flow Tracking: Follows your pen in real-time, narrating and "mind-reading" your drawing as you sketch.
🍎 Omni-Perception: Scans groceries & identifies prices on the fly.

✨Why it’s a category-leader:
📌Performance: Surpasses GPT-4o and Gemini 2.0 Pro on OpenCompass (Avg. 77.6).
📌Architecture: End-to-end fusion of SigLip2, Whisper, and CosyVoice2 on a Qwen3-8B base.
📌Efficiency: Full-duplex live streaming now runs locally on PCs via llama.cpp-omni.

The era of "Wait-and-Response" AI is over. Proactive, real-time intelligence is now open-source.
🚀Experience it on Hugging Face: 🔗https://t.co/KzzgiGYhVr

#MiniCPM #Omnimodal #FullDuplex #EdgeAI #OpenSource #ComputerVision

- OpenBMB
tweet

1 view16:47

Offshore

Photo

Brady Long
RT @thisguyknowsai: The era of “prompt and wait for a response” seems to be over.

As soon as I saw this I went immediately to Hugging Face to try it out. Nuts.

https://t.co/JlERquiIcQ



MiniCPM-o 4.5: Seeing, Listening, and Speaking — All at Once. 👁️👂🗣️

✨Beyond traditional turn-taking, we’ve built a Native Full-Duplex engine that allows a 9B model to see, listen, and speak in one concurrent, non-blocking stream.

Watch how it masters real-world complexity in real-time:
🔔 Proactive Auditory Interaction: Interrupts itself to alert you when it hears a "Ding!" while reading cards.
🎨 Temporal Flow Tracking: Follows your pen in real-time, narrating and "mind-reading" your drawing as you sketch.
🍎 Omni-Perception: Scans groceries & identifies prices on the fly.

✨Why it’s a category-leader:
📌Performance: Surpasses GPT-4o and Gemini 2.0 Pro on OpenCompass (Avg. 77.6).
📌Architecture: End-to-end fusion of SigLip2, Whisper, and CosyVoice2 on a Qwen3-8B base.
📌Efficiency: Full-duplex live streaming now runs locally on PCs via llama.cpp-omni.

The era of "Wait-and-Response" AI is over. Proactive, real-time intelligence is now open-source.
🚀Experience it on Hugging Face: 🔗https://t.co/KzzgiGYhVr

#MiniCPM #Omnimodal #FullDuplex #EdgeAI #OpenSource #ComputerVision

- OpenBMB
tweet

1 view09:28

About

Blog

Apps

Platform