According to Bloomberg, Apple is testing Gemini and other AI models to power a new version of Siri.
Which one would win from your pov? 👀
Which one would win from your pov? 👀
👾42❤1
This media is not supported in your browser
VIEW IN TELEGRAM
Microsoft dropped VibeVoice, a new open-source text-to-speech model
It can handle long-form audio and is capable of generating up to 90 min of multi-speaker conversations using just 1.5B parameters
It can handle long-form audio and is capable of generating up to 90 min of multi-speaker conversations using just 1.5B parameters
👾42❤1
Please open Telegram to view this post
VIEW IN TELEGRAM
👾40
Please open Telegram to view this post
VIEW IN TELEGRAM
👾41
This media is not supported in your browser
VIEW IN TELEGRAM
Alibaba released Wan2.2-S2V - a 14B model designed for film-grade video making. Now available on the Wan website.
👾43👍2
This media is not supported in your browser
VIEW IN TELEGRAM
Nano-banana, the image editing AI that ranked #1, just debuted as Google's Gemini 2.5 Flash Image
With multimodal reasoning and world knowledge, it supports consistent multi-turn edits and can even blend images
Available for free and paid Gemini users
With multimodal reasoning and world knowledge, it supports consistent multi-turn edits and can even blend images
Available for free and paid Gemini users
👾40
This media is not supported in your browser
VIEW IN TELEGRAM
AI2 unveiled Asta, a suite of agentic tools for scientific research, including:
—Asta agents to assist researchers with scientific tasks
—AstaBench suite & leaderboards for evaluating agents
—Asta resources software components to create and extend agents
—Asta agents to assist researchers with scientific tasks
—AstaBench suite & leaderboards for evaluating agents
—Asta resources software components to create and extend agents
👾40
This media is not supported in your browser
VIEW IN TELEGRAM
Anthropic just launched Claude for Chrome, enabling agentic browsing via a new extension
It is being piloted via a waitlist exclusively for 1,000 Claude Max subscribers in a limited preview
Comet and Dia are soon going to have some serious competition
It is being piloted via a waitlist exclusively for 1,000 Claude Max subscribers in a limited preview
Comet and Dia are soon going to have some serious competition
👾39👍2
This media is not supported in your browser
VIEW IN TELEGRAM
Now you can forward your emails to Manus AI so it can deal with them instead of you.
Cold emails are cooked 👀
Cold emails are cooked 👀
👾41
This media is not supported in your browser
VIEW IN TELEGRAM
Krea AI is opening a waitlist for a new Real-time Video generation model.
"Behind the scenes, our approach is motivated by modern “world model” ideas: systems that learn how scenes evolve and how actions ripple forward in time."
"Behind the scenes, our approach is motivated by modern “world model” ideas: systems that learn how scenes evolve and how actions ripple forward in time."
👾41
This media is not supported in your browser
VIEW IN TELEGRAM
Stitch by Google got a Canvas feature! Now you can plot and edit a whole UI journey over there.
So much to test 🤯
* Attaching a proper video
So much to test 🤯
* Attaching a proper video
👾41
Some interesting Nano Banana/Gemini 2.5 Flash Image use cases 👇🧵
Check comments
👾42🔥3⚡1🏆1
Core Ai News
Some interesting Nano Banana/Gemini 2.5 Flash Image use cases 👇🧵 Check comments
Google Officially Cooked Photoshop and 10+ Start-ups
👾41⚡1🔥1💯1
What do you mainly use AI for▶️
Anonymous Poll
39%
1. Coding & Automation 🖥
62%
2. Asking Everyday Questions ✔️
17%
3. Image / Video Generation ©
27%
4. Writing & Content ✍️
8%
Other (comment below)
👾42
OpenAI just published their official prompting guide for GPT-5
Source - https://cdn.openai.com/API/docs/gpt-5-for-coding-cheatsheet.pdf
#prompt
Source - https://cdn.openai.com/API/docs/gpt-5-for-coding-cheatsheet.pdf
#prompt
👾40❤🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
OpenAI launched gpt-realtime speech-to-speech model with remote MCPs and image support
It has nuanced abilities like detecting nonverbal cues and switching languages while keeping a natural conversation
Scored 82.8% on audio reasoning benchmarks
It has nuanced abilities like detecting nonverbal cues and switching languages while keeping a natural conversation
Scored 82.8% on audio reasoning benchmarks
👾39🤯1