🚨 AI News | TestingCatalog
6.3K subscribers
3.57K photos
548 videos
40 files
4.12K links
Latest AI News on AI Agents, Model Releases, Tools, Leaks, and Rumors πŸ—ž
Download Telegram
Jules agent becomes more context-aware with AGENTS.md support

The updated Jules agent brings improved performance, smarter task execution, consistent environment setup, and context awareness through AGENTS.md, aligning with unique repository workflows and supporting automated test writing and execution.

πŸ—ž #jules
πŸ”₯3❀1
Upcoming Grok update will introduce centralized Files tab

xAI is adding a Files tab to Grok’s web interface, centralizing file management with filters and built-in tools for viewing, editing, and executing content, aligning with efforts to position Grok as a full productivity platform.

πŸ—ž #grok
πŸ‘6
Anthropic developing Memory and AI-powered Artifacts for Claude

Anthropic is expanding Claude's artifacts to support embedded AI capabilities, enabling users to build functional, shareable mini-apps without coding. A new memory feature is also in development to allow Claude to recall past conversations.

πŸ—ž #claude
πŸ”₯3
AI Studio revamp under development with MCP support and Jules tie-in

Google is preparing updates to AI Studio, including a unified build section linked to Jules SWE Agent, support for Imagen 4, a modular tools menu, and daily usage limits, signaling broader capabilities and platform management.

πŸ—ž #aistudio
❀4πŸ”₯3πŸ‘1
Google launched Imagen 4 and Imagen 4 Ultra on AI Studio and APIs

Google released Imagen 4 via the Gemini API with limited free access starting June 24, 2025. It generates four 1024Γ—1024 images per call, includes SynthID watermarking, and supports up to 2K resolution through Vertex AI.

πŸ—ž #aistudio
❀4πŸ‘4
Google launches open-source Gemini CLI Agent with MCP support

Gemini CLI is a free, open-source tool that gives developers direct command-line access to Google's Gemini AI models. It supports coding, content generation, research, and automation, and is powered by Gemini 2.5 Pro with a high usage quota.

πŸ—ž #aistudio
πŸ”₯5❀3πŸ‘1
xAI skips Grok 3.5 and eyes July launch for Grok 4 with new developer features

xAI will bypass Grok 3.5 and move directly to Grok 4, expected after July 4, focusing on autonomous coding support via a web-based editor modeled on VSCode. The move positions Grok to compete with GPT-5 and Gemini updates.

πŸ—ž #grok
❀6
Perplexity prepares $200 subscription to target Labs power users

Perplexity is developing Perplexity Max, a $200/month premium plan aimed at professional users, offering access to Labs, advanced models, early features, and priority support, positioning itself among top-tier AI platforms.

πŸ—ž #perplexity
🀯3😱2🀣2
Google tests Drive search and AI flashcards for NotebookLM

Google is expanding NotebookLM with source discovery across Google Drive, AI-generated flashcards, and public sharing tools, aligning the platform more closely with educational and organizational productivity workflows.

πŸ—ž #notebooklm
πŸ”₯4❀2
xAI prepares Grok 4 and Grok 4 Code for upcoming launch

xAI is preparing to release Grok 4 and Grok 4 Code, targeting general-purpose and code-specific tasks. Grok 4 supports text and vision inputs, while Grok 4 Code integrates with developer tools like Cursor for in-editor code assistance.

πŸ—ž #grok
❀6πŸ‘1😁1
Dia browser to bring back vertical tabs in upcoming update

Dia browser’s upcoming update adds a vertical tabs sidebar and tab search, addressing longstanding user requests. It also introduces skill creation from liked prompts and pulls deeper context from YouTube for its AI-driven features.

πŸ—ž #dia
❀4πŸ‘1
Red teams access Niptune v3 in lead-up to new Claude model

Anthropic is testing Niptune v3, its next safety system iteration, ahead of a likely Claude 4.1 or 4.2 release. Red team access signals the final stage before launch, aligning with the company’s model update cycle and emphasis on safety protocols.

πŸ—ž #claude
πŸ”₯4πŸ‘1
Grok 4 benchmarks leak with 45% score on Humanity Last Exam

xAI's Grok 4 model, though not yet officially released, shows signs of nearing launch amid rising pressure from competitors. Early benchmarks suggest strong performance gains that could position it ahead of current leading LLMs.

πŸ—ž #grok
πŸ‘4❀3πŸ’©1
OpenAI experiments with new "Study together" tool on ChatGPT

OpenAI is testing a "Study together" feature in ChatGPT that guides users through subjects with questions and step-by-step explanations. Aimed at students and educators, it may support broader educational use in future releases.

πŸ—ž #chatgpt
πŸ”₯7❀1πŸ‘1
Claude Neptune v3 shows major math gains in red team testing

Claude Neptune v3 is showing consistent performance in solving complex math problems previously dominated by top-tier models. Its deployment under the guise of existing Opus 4 configurations raises questions about upcoming model releases.

πŸ—ž #claude
πŸ”₯6πŸ‘1
Vidu Q1 update brings full support for Reference-to-Video feature

Vidu's Q1 model introduces Reference-to-Video, allowing up to seven reference images to generate stylistically consistent videos based on prompts. This tool supports modular content creation without scripts or manual editing.
πŸ‘3πŸ”₯2❀1
Grok 4 set for July 9 debut as xAI plans expanded model lineup

xAI is set to launch Grok 4 and Grok Code 4 on July 9, with possible variants like Grok 4 Extended. While performance metrics have improved, moderation issues on X may impact public access despite the planned announcement.

πŸ—ž #grok
πŸ‘5
Anthropic tests new Connectors Directory with desktop automation tools

Anthropic is preparing a connectors directory for its web app, featuring general and desktop-specific MCPs. This initiative supports its MCP standard, aiming to make task automation more accessible without coding knowledge.

πŸ—ž #claude
πŸ”₯5