Paid Claude plans can claim a dedicated monthly credit (2 minute read)
Paid Claude plans will be able to claim a dedicated monthly credit for programmatic usage starting on June 15.
Paid Claude plans will be able to claim a dedicated monthly credit for programmatic usage starting on June 15.
X (formerly Twitter)
ClaudeDevs (@ClaudeDevs) on X
Starting June 15, paid Claude plans can claim a dedicated monthly credit for programmatic usage.
The credit covers usage of:
- Claude Agent SDK
- claude -p
- Claude Code GitHub Actions
- Third-party apps built on the Agent SDK
The credit covers usage of:
- Claude Agent SDK
- claude -p
- Claude Code GitHub Actions
- Third-party apps built on the Agent SDK
Meta's AI Chief On AI Beef, New Models And Life With Zuck (3 minute read)
This post contains a video of Alex Wang's first interview since he started working with Meta.
This post contains a video of Alex Wang's first interview since he started working with Meta.
Corememory
Meta's AI Chief On AI Beef, New Models And Life With Zuck - EP 71 Alex Wang
He's emergent
Google plans to announce a new Gemini model (1 minute read)
The model, which will be announced at Google's annual I/O conference on Tuesday, will roughly be on par with GPT-5.5.
The model, which will be announced at Google's annual I/O conference on Tuesday, will roughly be on par with GPT-5.5.
X (formerly Twitter)
Jukan (@jukan05) on X
Sources: Google plans to announce a new Gemini model, roughly on par with GPT-5.5, at its annual I/O conference on Tuesday.
Adaption aims big with AutoScientist, an AI tool that helps models train themselves (2 minute read)
AutoScientist helps models learn specific capabilities quickly by using an automated approach to conventional fine-tuning.
AutoScientist helps models learn specific capabilities quickly by using an automated approach to conventional fine-tuning.
TechCrunch
Adaption aims big with AutoScientist, an AI tool that helps models train themselves | TechCrunch
Adaption's new AutoScientist tool is designed to let models adapt to specific capabilities quickly through an automated approach to conventional fine-tuning.
Introducing Grok Build (2 minute read)
Grok Build is a coding agent that runs from the terminal. It is now in early beta for SuperGrok Heavy subscribers. AGENTS.md, plugin, hooks, skills, and MCP servers all work out of the box. Grok Build supports subagents for larger tasks, and it also supports deep worktree integrations, so users can launch subagents in their own worktrees. There is a headless mode that allows the easy running of agents inside scripts and automations.
Grok Build is a coding agent that runs from the terminal. It is now in early beta for SuperGrok Heavy subscribers. AGENTS.md, plugin, hooks, skills, and MCP servers all work out of the box. Grok Build supports subagents for larger tasks, and it also supports deep worktree integrations, so users can launch subagents in their own worktrees. There is a headless mode that allows the easy running of agents inside scripts and automations.
x.ai
Introducing Grok Build
Now in early beta for SuperGrok Heavy subscribers — Grok Build is a new coding agent that runs right from your terminal.
Cloud Agent Development Environments (6 minute read)
Cursor detailed a new system for configuring cloud-based development environments tailored to autonomous coding agents. It supports multi-repo, environment configuration as code, automated setup workflows, and governance controls for managing fleets of parallel agents.
Cursor detailed a new system for configuring cloud-based development environments tailored to autonomous coding agents. It supports multi-repo, environment configuration as code, automated setup workflows, and governance controls for managing fleets of parallel agents.
Cursor
Development environments for your cloud agents · Cursor
Cloud agents are easier to parallelize than local agents, continue working when your laptop is closed, and can run autonomously in response to programmatic triggers.
OpenAI Explores Legal Action Against Apple (1 minute read)
Bloomberg reported that OpenAI explored legal options against Apple over dissatisfaction with how deeply ChatGPT was integrated into Apple's ecosystem and the limited subscriber growth that followed.
Bloomberg reported that OpenAI explored legal options against Apple over dissatisfaction with how deeply ChatGPT was integrated into Apple's ecosystem and the limited subscriber growth that followed.
TechCrunch
OpenAI is reportedly preparing legal action against Apple; it wouldn't be the first partner to feel burned | TechCrunch
OpenAI is so frustrated with Apple over a ChatGPT integration that failed to deliver the subscribers and prominence it expected that the company is now actively exploring legal action against the iPhone maker.
2028: Two scenarios for global AI leadership (28 minute read)
Anthropic outlines two possible 2028 global AI leadership scenarios: one where the US retains its compute advantage and shapes AI norms, and another where China competes closely due to policy inaction. The US currently leads due to strong export controls and advanced chip technology preventing China from keeping pace. Closing loopholes on compute access and restricting distillation attacks are crucial for maintaining the US lead and ensuring democracies shape AI governance.
Anthropic outlines two possible 2028 global AI leadership scenarios: one where the US retains its compute advantage and shapes AI norms, and another where China competes closely due to policy inaction. The US currently leads due to strong export controls and advanced chip technology preventing China from keeping pace. Closing loopholes on compute access and restricting distillation attacks are crucial for maintaining the US lead and ensuring democracies shape AI governance.
Anthropic
2028: Two scenarios for global AI leadership
We’re releasing a new paper that explains our views on the competition on AI between the US and China.
How We Built Secure, Scalable Agent Sandbox Infrastructure (8 minute read)
There are two ways to sandbox an agent that can execute code: isolate the tool or isolate the agent. Agents should have nothing worth stealing and nothing worth reserving. Isolating the agent requires an extra network hop on every operation and more services to deploy, but there are no secrets to steal, no state to preserve, and agents can be killed, restarted, and scaled independently.
There are two ways to sandbox an agent that can execute code: isolate the tool or isolate the agent. Agents should have nothing worth stealing and nothing worth reserving. Isolating the agent requires an extra network hop on every operation and more services to deploy, but there are no secrets to steal, no state to preserve, and agents can be killed, restarted, and scaled independently.
X (formerly Twitter)
Larsen Cundric (@larsencc) on X
How We Built Secure, Scalable Agent Sandbox Infrastructure
Codex is getting easier to automate and customize around your code (1 minute read)
Codex has implemented hooks and programmatic tokens to make it easier to automate and customize code. Hooks can customize the Codex loop with scripts that run at key points in a task. Programmatic access provides scoped credentials for Business and Enterprise teams. A video showing how to create access tokens for Codex automations is available.
Codex has implemented hooks and programmatic tokens to make it easier to automate and customize code. Hooks can customize the Codex loop with scripts that run at key points in a task. Programmatic access provides scoped credentials for Business and Enterprise teams. A video showing how to create access tokens for Codex automations is available.
Threadreaderapp
Thread by @OpenAIDevs on Thread Reader App
@OpenAIDevs: Codex is getting easier to automate and customize around your code. 🪝 Hooks customize the Codex loop with scripts that run at key points in a task: • Run validators before or after work •...…
Raindrop Workshop (GitHub Repo)
Raindrop Workshop gives Claude Code the ability to read traces, write evals against codebases, and fix what's broken. It provides livestreamed traces, coding-agent integration, a self-healing eval loop, and local replay. Raindrop Workshop is compatible with TypeScript, Python, Go, and Rust, and most popular SDKs, providers, and coding agents.
Raindrop Workshop gives Claude Code the ability to read traces, write evals against codebases, and fix what's broken. It provides livestreamed traces, coding-agent integration, a self-healing eval loop, and local replay. Raindrop Workshop is compatible with TypeScript, Python, Go, and Rust, and most popular SDKs, providers, and coding agents.
GitHub
GitHub - raindrop-ai/workshop
Contribute to raindrop-ai/workshop development by creating an account on GitHub.
Genkit Middleware (10 minute read)
Genkit is a framework for building full-stack, AI-powered and agentic applications for any platform. It supports TypeScript, Go, Dart, and Python. Genkit uses composable hooks that intercept generation calls to implement retries and fall-backs for maximum reliability, human approval before destructive tool calls, and observability across every layer. Its middleware system runs a tool loop that repeats until the model is done. The Genkit Developer can be used to inspect, test, and debug applications and middleware execution.
Genkit is a framework for building full-stack, AI-powered and agentic applications for any platform. It supports TypeScript, Go, Dart, and Python. Genkit uses composable hooks that intercept generation calls to implement retries and fall-backs for maximum reliability, human approval before destructive tool calls, and observability across every layer. Its middleware system runs a tool loop that repeats until the model is done. The Genkit Developer can be used to inspect, test, and debug applications and middleware execution.
Googleblog
Google for Developers Blog - News about Web, Mobile, AI and Cloud
Explore Genkit, an open-source framework for building production-ready AI agents. Learn how to use composable middleware for retries, model fallbacks, and human-in-the-loop approvals in TypeScript, Go, and Python.
Unlocking asynchronicity in continuous batching (20 minute read)
Asynchronous batching can reduce idle time between CPU and GPU cycles, improving GPU utilization for inference by 22%. By using CUDA streams and events, CPU tasks prepare batch N+1 during batch N's GPU computation, eliminating idle gaps. This method yields more efficient GPU operations without changing kernels or models, enhancing generation speed substantially.
Asynchronous batching can reduce idle time between CPU and GPU cycles, improving GPU utilization for inference by 22%. By using CUDA streams and events, CPU tasks prepare batch N+1 during batch N's GPU computation, eliminating idle gaps. This method yields more efficient GPU operations without changing kernels or models, enhancing generation speed substantially.
huggingface.co
Unlocking asynchronicity in continuous batching
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Microsoft is quietly shopping for an OpenAI replacement (4 minute read)
Microsoft signed a deal with OpenAI late April that amended the company's exclusive license to OpenAI models, freed OpenAI to sell on any other cloud, and removed the AGI clause that would have triggered changes to Microsoft's IP rights once OpenAI's board declared the threshold reached. Microsoft's IP license, a 27% stake worth roughly $135 billion, will be kept through 2032. Microsoft is reportedly looking to purchase Inception, a company that builds diffusion-based language models. It is interesting that Microsoft would spend $13 billion on a partner and then immediately start a shadow procurement process for a replacement.
Microsoft signed a deal with OpenAI late April that amended the company's exclusive license to OpenAI models, freed OpenAI to sell on any other cloud, and removed the AGI clause that would have triggered changes to Microsoft's IP rights once OpenAI's board declared the threshold reached. Microsoft's IP license, a 27% stake worth roughly $135 billion, will be kept through 2032. Microsoft is reportedly looking to purchase Inception, a company that builds diffusion-based language models. It is interesting that Microsoft would spend $13 billion on a partner and then immediately start a shadow procurement process for a replacement.
TNW
Microsoft is quietly shopping for an OpenAI replacement
Three weeks after rewriting its OpenAI contract, Microsoft is quietly shopping for AI startups. Cursor was the first try. Inception is next.
Elon Musk's SpaceXAI has been bleeding staff since its merger (2 minute read)
SpaceXAI is reportedly losing top talent across coding, world models, and Grok voice. Rivals like Meta and Thinking Machines Lab are scooping up former staff. Elon Musk's culture of extreme work has led some staff to leave. Several of the exits could have been driven by a desire to cash out.
SpaceXAI is reportedly losing top talent across coding, world models, and Grok voice. Rivals like Meta and Thinking Machines Lab are scooping up former staff. Elon Musk's culture of extreme work has led some staff to leave. Several of the exits could have been driven by a desire to cash out.
TechCrunch
Elon Musk's SpaceXAI has been bleeding staff since its merger | TechCrunch
More than 50 employees have reportedly left Elon Musk’s newly merged SpaceXAI since February, raising questions about burnout, leadership changes, talent poaching, and whether liquidity events weakened retention incentives.
Igor Babuschkin Seeks Up To $1 Billion For River AI (3 minute read)
Babuschkin, an xAI cofounder, is putting in $100 million of his own money into the company.
Babuschkin, an xAI cofounder, is putting in $100 million of his own money into the company.
Let's Data Science
Igor Babuschkin Seeks Up To $1 Billion For River AI
Forbes reports that xAI cofounder **Igor Babuschkin** is in talks to raise up to **$1 billion** for a new AI research startup called **River AI**, according to multiple people familiar with the deal. Forbes says the company is targeting a valuation of up…
Nvidia's Jensen Huang bets on this British startup to build 'next frontier' of AI (3 minute read)
Nvidia has announced a partnership with Ineffable Intelligence, a startup pursuing superintelligence that was founded in late 2025 by UCL professor and former lead of DeepMind's reinforcement learning team, David Silver.
Nvidia has announced a partnership with Ineffable Intelligence, a startup pursuing superintelligence that was founded in late 2025 by UCL professor and former lead of DeepMind's reinforcement learning team, David Silver.
CNBC
Nvidia's Jensen Huang bets on this British startup to build 'next frontier' of AI
Months-old Ineffable Intelligence announced a record $1.1 billion seed round in April.
Work with Codex from anywhere (6 minute read)
Codex is now available in the ChatGPT mobile app, enabling seamless remote access to ongoing work on laptops, devboxes, or remote environments.
Codex is now available in the ChatGPT mobile app, enabling seamless remote access to ongoing work on laptops, devboxes, or remote environments.
OpenAI
Work with Codex from anywhere
Use Codex anywhere with the ChatGPT mobile app. Monitor, steer, and approve coding tasks in real time across devices and remote environments.
OpenSquilla launches open-source AI agent to cut token costs (4 minute read)
OpenSquilla has introduced an open-source AI agent runtime designed to reduce unnecessary token spend by reusing context efficiently.
OpenSquilla has introduced an open-source AI agent runtime designed to reduce unnecessary token spend by reusing context efficiently.
TestingCatalog AI News
OpenSquilla launches open-source AI agent to cut token costs
OpenSquilla is an open-source Python AI agent with ML model routing, four-tier memory, and syscall-level sandbox isolation.
Toto 2.0: Time series forecasting enters the scaling era (13 minute read)
Datadog's Toto 2.0, a scalable time series forecasting model family, is now available on Hugging Face.
Datadog's Toto 2.0, a scalable time series forecasting model family, is now available on Hugging Face.
Datadog
Toto 2.0: Time series forecasting enters the scaling era | Datadog
For the first time, a time series foundation model gets reliably better with scale—five open-weights sizes from 4m to 2.5B parameters, trained from a single recipe.