🚨 AI News | TestingCatalog
6.15K subscribers
3.5K photos
535 videos
40 files
4.1K links
Latest AI News on AI Agents, Model Releases, Tools, Leaks, and Rumors πŸ—ž
Download Telegram
Early look at Imagine Agent Mode on Grok app for iOS!

Users will be able to use Imagine Agent via a mobile optimised native UI to generate images and videos that require more complex workflows.

SpaceXAI is getting quite ahead of everyone else on this front!

We just need Imagine v2 now πŸ‘€

Additionally, Skills are coming soon on mobile as well.
πŸ‘4❀11
Hermes vs OpenClaw πŸ₯Š

Hermes Agent overtook OpenClaw on the Global OpenRouter token ranking and claimed the first spot.

Tokens are a new currency!
❀9πŸ”₯5
Skills in action on Grok for iOS πŸ‘€

* not available yet
πŸ‘53❀1
We will likely see a deeper integration between Codex and ChatGPT already very soon.

> Use the ChatGPT app on your phone to keep working with Codex whenever your computer is awake.

Additionally, this image from OpenAI sparked loads of speculations, including the one where OpenAI would be teasing their own mobile phone.

Even though it is quite unrealistic, this would be a huge steal of attention from the Google I/O event.
7❀6πŸ‘3πŸ‘1
GOOGLE I/O πŸ”₯: New evidence of the upcoming Gemini Omni vide model has been spotted on the Gemini mobile app.

A video sample below πŸ‘€

> "Meet our new video model. Remix your videos, edit directly in chat, try a template, and more."

> Based on the description, we might be really talking about the true "Omni" model based on Gemini, rather than Veo.

> It also seems to be quickly consuming usage limits, based on early tests. "Usage" is a new tab that will be available on both the web and mobile.
7❀53🀩2
🚨 AI News | TestingCatalog
GOOGLE I/O πŸ”₯: New evidence of the upcoming Gemini Omni vide model has been spotted on the Gemini mobile app. A video sample below πŸ‘€ > "Meet our new video model. Remix your videos, edit directly in chat, try a template, and more." > Based on the description…
This media is not supported in your browser
VIEW IN TELEGRAM
Sample video and early feedback (quotes from Reddit)

> I won’t lie, this is one of the best video models I have seen, maybe not *the* best, but a really strong performance. I was particularly impressed by the prompt adherence (except for the one shot with the missing centerpiece), the model nailed all the constraints.

> Additionally, the voice quality is much better than the Veo models by quite a large margin. It even added some light background music, that would fit right in with an upscale dining experience.

> While there are some continuity issues if you look close enough, the ability to change camera angles on the fly so frequently and with good coherence is impressive to me. Overall this is definitely the new model and quite a step up from the Veo we are used to
❀126πŸ”₯4πŸ‘1
OPENAI πŸ”₯: A mention of a new Ultrafast mode appeared for some time on the Codex GitHub repository.

> "The fastest available responses for latency-sensitive work."

Seems like it was unintended push.
❀5πŸ‘3πŸ”₯3πŸ†’1
🚨 AI News | TestingCatalog
Sample video and early feedback (quotes from Reddit) > I won’t lie, this is one of the best video models I have seen, maybe not *the* best, but a really strong performance. I was particularly impressed by the prompt adherence (except for the one shot with…
This media is not supported in your browser
VIEW IN TELEGRAM
GOOGLE πŸ”₯: An upcoming Gemini Omni video model from Google is expected to be much more advanced in video editing, capable of completing tasks like removing watermarks, replacing objects in the video, and more.

It is also likely that Google will release 2 versions of this model, including a Pro variant.

And I assume what we see isn't Pro?

Anime sample πŸ‘€
h/t @QuantumFast
7❀5🀬31
Google’s Gemini Omni video model surfaces ahead of I/O debut

Leaked Gemini Omni details point to Google unveiling a unified video model at I/O, with strong in-chat editing and remix tools but generation quality trailing Seedance 2. Credit-based limits and possible Flash/Pro tiers also surfaced.

πŸ—ž #gemini @testingcatalog
πŸ‘42❀1
Google keeps preparing its upcoming Gemini Omni models for the release.

> Gemini Omni model will be available on APIs as well

> The model will be considered as Agent, similarly to Deep Research on AI Studio

Soon? πŸ‘€
❀9πŸ‘6πŸ”₯51
Anthropic adds Agent View to Claude Code CLI interface

Anthropic’s Agent View for Claude Code adds a CLI dashboard for managing parallel coding sessions in one place. It shows status, activity, and input needs, supports background jobs, and is available now in Research Preview.

πŸ—ž #claude @testingcatalog
4πŸ‘2❀1
THINKING MACHINES πŸ”₯: Research preview of a new family of realtime voice models have been announced!

> Today, we’re announcing a research preview of interaction models: models that handle interaction natively rather than through external scaffolding.

> Our research preview demonstrates qualitatively new interaction capabilities, as well as state-of-the-art combined performance in intelligence and responsiveness.

A new SOTA?! πŸ‘€
❀4πŸ‘44
OpenAI announces Daybreak initiative around Codex Security

OpenAI launched Daybreak, a cybersecurity program that extends Codex into secure code review, threat modeling, patch validation, and detection support, with verified access, partner integrations, and rollout for defenders and enterprises.

πŸ—ž #chatgpt @testingcatalog
❀4πŸ‘4πŸ”₯2