π¨ Stuart Russell: AI Race is "Russian Roulette" for Humanity
Prof. Stuart Russell just issued a massive warning about the current AI arms race, calling the push toward human-level AI "Russian roulette" for our species. He is not just talking about job loss but the actual risk of extinction if we do not find a way to contain these models before they outsmart us. This is a heavy perspective from one of the godfathers of AI, especially as Big Tech continues to pour billions into a race where safety feels like an afterthought.
π‘ Why this matters:
When the guy who literally wrote the textbook on AI says we are gambling with human existence, we should probably listen. This is not about being a doomer; it is about the reality that we are building systems we do not fully understand yet. For builders, it means the "move fast and break things" era might be hitting a wall where the thing being broken is much bigger than a codebase.
Source: Fortune
Prof. Stuart Russell just issued a massive warning about the current AI arms race, calling the push toward human-level AI "Russian roulette" for our species. He is not just talking about job loss but the actual risk of extinction if we do not find a way to contain these models before they outsmart us. This is a heavy perspective from one of the godfathers of AI, especially as Big Tech continues to pour billions into a race where safety feels like an afterthought.
π‘ Why this matters:
When the guy who literally wrote the textbook on AI says we are gambling with human existence, we should probably listen. This is not about being a doomer; it is about the reality that we are building systems we do not fully understand yet. For builders, it means the "move fast and break things" era might be hitting a wall where the thing being broken is much bigger than a codebase.
Source: Fortune
Fortune
Big Tech execs playing βRussian rouletteβ in the AI arms race could risk human extinction, warns top researcher | Fortune
An AI 'arms race' has tech giants locked in competition, and humanity could pay the price, says Berkeley's Stuart Russell.
π1π±1
π Google Open-Sources Agent Development Kit (ADK)
Google just open-sourced the Agent Development Kit (ADK) to help developers build model-agnostic agents. It supports Python, Java, Go, and JavaScript, providing a standardized way to handle tool calling and state management. This is a direct play for the agent orchestration layer, moving beyond just providing the models.
π‘ Why this matters:
Google is building the tools to own the agent stack without forcing Gemini lock-in. They are meeting developers where they are to become the default framework for production agents.
Source: Google ADK Docs
Google just open-sourced the Agent Development Kit (ADK) to help developers build model-agnostic agents. It supports Python, Java, Go, and JavaScript, providing a standardized way to handle tool calling and state management. This is a direct play for the agent orchestration layer, moving beyond just providing the models.
π‘ Why this matters:
Google is building the tools to own the agent stack without forcing Gemini lock-in. They are meeting developers where they are to become the default framework for production agents.
Source: Google ADK Docs
adk.dev
Agent Development Kit (ADK)
Build powerful multi-agent systems with Agent Development Kit (ADK)
π1πΎ1
π’ Rackspace and Palantir Partner for Enterprise AI
Rackspace and Palantir are teaming up to get Palantir AIP into production for enterprise clients at scale. They want to cut deployment times from months to weeks by pairing Palantir's AI OS with Rackspaceβs managed cloud expertise. This targets regulated industries that need speed without sacrificing data sovereignty or security.
π‘ Why this matters:
This is about getting AI out of the demo phase and into the messy reality of production. It solves the "implementation gap" for companies that have the data but lack the infrastructure to scale it safely.
Source: Rackspace Newsroom
Rackspace and Palantir are teaming up to get Palantir AIP into production for enterprise clients at scale. They want to cut deployment times from months to weeks by pairing Palantir's AI OS with Rackspaceβs managed cloud expertise. This targets regulated industries that need speed without sacrificing data sovereignty or security.
π‘ Why this matters:
This is about getting AI out of the demo phase and into the messy reality of production. It solves the "implementation gap" for companies that have the data but lack the infrastructure to scale it safely.
Source: Rackspace Newsroom
Rackspace Technology
Rackspace Technology Announce Strategic Partnership
Rackspace Technology and Palantir Technologies Inc. today announced a strategic partnership to accelerate AI and data platform deployments.
π1π1
π Swimlane Unleashes AI SOC Powered by MCP
Swimlane just dropped its new AI SOC, and it is built on a serious agentic back end. We are talking specialized agents for triage and playbook generation that actually use Model Context Protocol (MCP) to talk to your existing security tools. This is not just basic SOAR automation. It uses cyclic graphs and reasoning loops to handle real-time threats at an enterprise scale. This is where the security workforce is heading: autonomous agents doing the heavy lifting so humans can actually focus on strategy.
π‘ Why this matters:
Seeing MCP being integrated into enterprise cybersecurity platforms is a massive validation. It proves that agentic connectivity isn't just for hobbyists or chat interfaces anymore. For builders, this is a clear signal that mastering protocol-level communication is the key to creating tools that actually scale in the "messy" real world.
Source: SiliconANGLE
Swimlane just dropped its new AI SOC, and it is built on a serious agentic back end. We are talking specialized agents for triage and playbook generation that actually use Model Context Protocol (MCP) to talk to your existing security tools. This is not just basic SOAR automation. It uses cyclic graphs and reasoning loops to handle real-time threats at an enterprise scale. This is where the security workforce is heading: autonomous agents doing the heavy lifting so humans can actually focus on strategy.
π‘ Why this matters:
Seeing MCP being integrated into enterprise cybersecurity platforms is a massive validation. It proves that agentic connectivity isn't just for hobbyists or chat interfaces anymore. For builders, this is a clear signal that mastering protocol-level communication is the key to creating tools that actually scale in the "messy" real world.
Source: SiliconANGLE
SiliconANGLE
Swimlane debuts AI SOC with agentic back end to tackle cybersecurity operations
Swimlane Inc., which provides agentic artificial intelligence automation for cybersecurity, today announced a new role in its security analyst playbook with an AI security operations center operated b
π1
π Defense Tech Firm Scout AI Deploys Kinetic Agents
Scout AI just crossed a massive line by successfully demonstrating AI agents that can autonomously identify and destroy physical targets. Unlike the traditional human-in-the-loop drone strikes we have seen for years, these agents are designed to navigate complex environments and execute kinetic missions with minimal external command. This is a direct application of agentic reasoning to physical warfare, moving the technology from the screen into the real world. It signals a shift where autonomous decision-making is no longer just about writing code or summarizing emails, but about taking real action in the physical domain.
π‘ Why this matters:
We have been talking about agents doing our work for months, but this is the ultimate "real world" application of the tech. It proves that agentic architectures are robust enough for mission-critical, high-stakes environments. For builders, it is a reminder that the same protocol-level control we use for automation is being applied to the most serious industries on the planet.
Source: Wired
Scout AI just crossed a massive line by successfully demonstrating AI agents that can autonomously identify and destroy physical targets. Unlike the traditional human-in-the-loop drone strikes we have seen for years, these agents are designed to navigate complex environments and execute kinetic missions with minimal external command. This is a direct application of agentic reasoning to physical warfare, moving the technology from the screen into the real world. It signals a shift where autonomous decision-making is no longer just about writing code or summarizing emails, but about taking real action in the physical domain.
π‘ Why this matters:
We have been talking about agents doing our work for months, but this is the ultimate "real world" application of the tech. It proves that agentic architectures are robust enough for mission-critical, high-stakes environments. For builders, it is a reminder that the same protocol-level control we use for automation is being applied to the most serious industries on the planet.
Source: Wired
WIRED
This Defense Company Made AI Agents That Blow Things Up
Scout AI is using technology borrowed from the AI industry to power lethal weaponsβand recently demonstrated its explosive potential.
π€―2π±1
π OpenAI Drops EVMbench: A Stress Test for Blockchain Agents
OpenAI and Paradigm just dropped EVMbench, an open-source stress test for AI agents in the Ethereum ecosystem. It measures how well agents detect, exploit, and actually patch high-severity smart contract vulnerabilities. Early results show GPT-5.3-Codex is a beast at offensive exploitation but still struggles to fix the mess it finds. This is the ultimate test for agentic reasoning because in crypto, a single hallucination is a permanent financial disaster.
π‘ Why this matters:
Blockchain security protects over $100B in assets, so agent reliability isn't optional. For builders, this benchmark provides a standardized leaderboard to track exactly where reasoning gaps still exist in high-stakes code environments.
Source: OpenAI
OpenAI and Paradigm just dropped EVMbench, an open-source stress test for AI agents in the Ethereum ecosystem. It measures how well agents detect, exploit, and actually patch high-severity smart contract vulnerabilities. Early results show GPT-5.3-Codex is a beast at offensive exploitation but still struggles to fix the mess it finds. This is the ultimate test for agentic reasoning because in crypto, a single hallucination is a permanent financial disaster.
π‘ Why this matters:
Blockchain security protects over $100B in assets, so agent reliability isn't optional. For builders, this benchmark provides a standardized leaderboard to track exactly where reasoning gaps still exist in high-stakes code environments.
Source: OpenAI
Openai
Introducing EVMbench
OpenAI and Paradigm introduce EVMbench, a benchmark evaluating AI agentsβ ability to detect, patch, and exploit high-severity smart contract vulnerabilities.
π₯1π1
π Perplexity Comet Browser Hitting iOS?
Rumors are swirling on X that Perplexity is finally bringing its AI-powered Comet Browser to iOS on March 11. A leaked screenshot suggests the long-awaited Safari rival is just weeks away from a mobile launch. While Perplexity has not officially confirmed the date yet, the hype is building for what Aravind Srinivas previously called the first real competition to Safari on the iPhone. It looks like the mobile browser wars are about to get a lot more interesting.
Source: X (Leaked Signal)
Rumors are swirling on X that Perplexity is finally bringing its AI-powered Comet Browser to iOS on March 11. A leaked screenshot suggests the long-awaited Safari rival is just weeks away from a mobile launch. While Perplexity has not officially confirmed the date yet, the hype is building for what Aravind Srinivas previously called the first real competition to Safari on the iPhone. It looks like the mobile browser wars are about to get a lot more interesting.
Source: X (Leaked Signal)
π1π₯1
This media is not supported in your browser
VIEW IN TELEGRAM
π¨ India AI Summit Fraud: University Booted for Faking Robot Dog
Galgotias University was just kicked out of Indiaβs flagship AI summit for one of the most awkward frauds we have seen yet. A staff member tried to pass off a commercially available Chinese robot dog as their own original work. People quickly realized it was just a Unitree Go2 bought off the shelf. This is a huge embarrassment for the school and a clear sign that the hype for AI clout is driving some people to take really desperate shortcuts.
π€¦π»ββοΈ
Galgotias University was just kicked out of Indiaβs flagship AI summit for one of the most awkward frauds we have seen yet. A staff member tried to pass off a commercially available Chinese robot dog as their own original work. People quickly realized it was just a Unitree Go2 bought off the shelf. This is a huge embarrassment for the school and a clear sign that the hype for AI clout is driving some people to take really desperate shortcuts.
π€¦π»ββοΈ
π¨ Microsoft stores 4.8TB on a piece of glass πͺ
Microsoft's Project Silica just hit a major milestone: they moved from expensive fused silica to everyday borosilicate glass (yes, Pyrex). The result? 4.8TB of data etched into a 120mm x 2mm glass disk that lasts 10,000+ years. No climate control. No migration cycles. Just femtosecond lasers, a camera, and some ML magic reading layers of data like a tiny vinyl record.
That's ~200 4K movies or 1.75 million songs on something the size of a drink coaster. Compare that to your SSD which starts getting nervous after a decade.
π‘ Why this matters: We're talking about true "set it and forget it" archival storage. For government records, scientific datasets, or anything that needs to outlive us, this changes the math entirely. Pilot programs with government and science archives target 2027.
Source: Microsoft Research
Microsoft's Project Silica just hit a major milestone: they moved from expensive fused silica to everyday borosilicate glass (yes, Pyrex). The result? 4.8TB of data etched into a 120mm x 2mm glass disk that lasts 10,000+ years. No climate control. No migration cycles. Just femtosecond lasers, a camera, and some ML magic reading layers of data like a tiny vinyl record.
That's ~200 4K movies or 1.75 million songs on something the size of a drink coaster. Compare that to your SSD which starts getting nervous after a decade.
π‘ Why this matters: We're talking about true "set it and forget it" archival storage. For government records, scientific datasets, or anything that needs to outlive us, this changes the math entirely. Pilot programs with government and science archives target 2027.
Source: Microsoft Research
Microsoft Research
Project Silica
Project Silica is developing the first-ever storage technology designed and built from the ground up for the cloud, using femtosecond lasers to store data.
π₯1
π¨ Agents that evolve together, stay together
UC Santa Barbara researchers just built digital Darwinism. These AI agents evolve as a group, matching human-engineered systems on SWE-bench.
The kicker? ZERO inference cost after evolution. No API calls. No token limits. No ongoing AI bill. Front-load the intelligence, then deploy anywhere for pennies.
π‘ Why this matters: We're finally seeing a path away from the "per-request tax" of modern AI. It turns intelligence into a fixed asset instead of a variable cost.
Source: VentureBeat
UC Santa Barbara researchers just built digital Darwinism. These AI agents evolve as a group, matching human-engineered systems on SWE-bench.
The kicker? ZERO inference cost after evolution. No API calls. No token limits. No ongoing AI bill. Front-load the intelligence, then deploy anywhere for pennies.
π‘ Why this matters: We're finally seeing a path away from the "per-request tax" of modern AI. It turns intelligence into a fixed asset instead of a variable cost.
Source: VentureBeat
Venturebeat
New agent framework matches human-engineered AI systems β and adds zero inference cost to deploy
A new group-evolving agent framework from UC Santa Barbara matches human-engineered AI systems on SWE-bench β and adds zero inference cost to deploy. Here's how it works.
β‘1π1
Forwarded from TestingCatalog AI News π (Alexey)
Gemini 3.1 Pro Preview today confirmed.
Would be breaking π
Would be breaking π
π1
π¨ Meta spends $65M to buy an AI-friendly future
Mark Zuckerberg isn't just building Llama. He's building a political firewall. Meta has reportedly poured $65 million into US state-level elections, targeting candidates who will back pro-AI legislation and data center expansion.
This is a massive strategic shift. While the world watches DC, Meta is focusing on the states where actual AI regulations are being fought. They're backing super PACs in Texas, Florida, and Ohio to ensure the "Year of the Agent" isn't slowed down by red tape.
π‘ Why this matters:
It shows that Big Tech views regulation as the single biggest threat to their AI dominance. By funding the ground game, Meta is ensuring that the rules of the AGI race are written in their favor.
Source: The Decoder
Mark Zuckerberg isn't just building Llama. He's building a political firewall. Meta has reportedly poured $65 million into US state-level elections, targeting candidates who will back pro-AI legislation and data center expansion.
This is a massive strategic shift. While the world watches DC, Meta is focusing on the states where actual AI regulations are being fought. They're backing super PACs in Texas, Florida, and Ohio to ensure the "Year of the Agent" isn't slowed down by red tape.
π‘ Why this matters:
It shows that Big Tech views regulation as the single biggest threat to their AI dominance. By funding the ground game, Meta is ensuring that the rules of the AGI race are written in their favor.
Source: The Decoder
The Decoder
Meta pours $65 million into state elections to back AI-friendly politicians
Meta is investing $65 million to influence state-level elections across the US, backing politicians friendly to AI.
β‘1π€1
This media is not supported in your browser
VIEW IN TELEGRAM
The alignment problem is real. π
Everyone on stage is holding hands at the India AI Summit. Sam Altman and Dario Amodei? Hard pass.
Two people who talk about AI cooperation, existential risk, and the future of humanity every single day, standing right next to each other, refusing to join the circle.
Dario probably ran a safety evaluation on the handhold first. Sam was waiting for the feature to ship.
The hardest coordination problem in AI isn't multi-agent systems. It's getting the two biggest CEOs in the room to link up.
Some bugs can't be patched. π
Everyone on stage is holding hands at the India AI Summit. Sam Altman and Dario Amodei? Hard pass.
Two people who talk about AI cooperation, existential risk, and the future of humanity every single day, standing right next to each other, refusing to join the circle.
Dario probably ran a safety evaluation on the handhold first. Sam was waiting for the feature to ship.
The hardest coordination problem in AI isn't multi-agent systems. It's getting the two biggest CEOs in the room to link up.
Some bugs can't be patched. π
π1
π¨ BREAKING: Google Gemini 3.1 Pro Preview Model is available on Google Vertex AI and AI Studio
Google's latest SOTA reasoning model with unprecedented depth and nuance, and powerful multimodal understanding and coding capabilities:
π° <= 200K tokens β’ Input: $2.00 / Output: $12.00
π° > 200K tokens β’ Input: $4.00 / Output: $18.00
π Knowledge cut off: Jan 2025
Google's latest SOTA reasoning model with unprecedented depth and nuance, and powerful multimodal understanding and coding capabilities:
π° <= 200K tokens β’ Input: $2.00 / Output: $12.00
π° > 200K tokens β’ Input: $4.00 / Output: $18.00
π Knowledge cut off: Jan 2025
π₯1
π¨ OpenAI Nears Historic $100B+ Funding at $850B Valuation
This is it. The biggest private funding round in history is reportedly closing. Bloomberg and TechCrunch both confirm OpenAI is finalizing commitments exceeding $100 billion, pushing their valuation past $850 billion.
The investor lineup reads like a tech summit guest list: Amazon (up to $50B), SoftBank ($30B), Nvidia ($20B), and Microsoft. This isn't just about cash. It's about locking in the compute pipeline for the next decade of AGI development.
π‘ Why this matters:
When you're raising $100 billion, you're no longer a startup. You're building infrastructure at a national scale. This deal effectively guarantees OpenAI's dominance in the foundation model race, unless regulators step in.
Source: Bloomberg | TechCrunch
This is it. The biggest private funding round in history is reportedly closing. Bloomberg and TechCrunch both confirm OpenAI is finalizing commitments exceeding $100 billion, pushing their valuation past $850 billion.
The investor lineup reads like a tech summit guest list: Amazon (up to $50B), SoftBank ($30B), Nvidia ($20B), and Microsoft. This isn't just about cash. It's about locking in the compute pipeline for the next decade of AGI development.
π‘ Why this matters:
When you're raising $100 billion, you're no longer a startup. You're building infrastructure at a national scale. This deal effectively guarantees OpenAI's dominance in the foundation model race, unless regulators step in.
Source: Bloomberg | TechCrunch
Bloomberg.com
OpenAI Funding on Track to Top $100 Billion in Latest Round
OpenAI is close to finalizing the first phase of a new funding round that is likely to bring in more than $100 billion, according to people familiar with the matter, a record-breaking financing deal that would give the startup additional capital to buildβ¦
π₯2
π US State Dept Launches "Freedom.gov" to Bypass Global Censorship
This is a wild one. The US government is officially entering the VPN game. They are launching freedom.gov, a portal designed to let users worldwide bypass content bans from the EU's Digital Services Act and the UK's Online Safety Act.
It is essentially a government-backed tunnel into the "unfiltered" internet, framed as a tool for free expression. It is already causing a massive legal and diplomatic rift between the US and its European allies.
π‘ Why this matters:
We are seeing the first "Browser War" fought at the State Department level. When governments start building their own infrastructure to bypass other governments' laws, the concept of a "global internet" is officially over.
Source: Reuters | Independent
This is a wild one. The US government is officially entering the VPN game. They are launching freedom.gov, a portal designed to let users worldwide bypass content bans from the EU's Digital Services Act and the UK's Online Safety Act.
It is essentially a government-backed tunnel into the "unfiltered" internet, framed as a tool for free expression. It is already causing a massive legal and diplomatic rift between the US and its European allies.
π‘ Why this matters:
We are seeing the first "Browser War" fought at the State Department level. When governments start building their own infrastructure to bypass other governments' laws, the concept of a "global internet" is officially over.
Source: Reuters | Independent
Reuters
Exclusive: US plans online portal to bypass content bans in Europe and elsewhere
The portal could potentially put Washington in the unfamiliar position of appearing to encourage citizens to flout local laws.
π1π€―1
π¨ Pentagon Threatens Anthropic with "Supply Chain Risk" Designation
The standoff between Anthropic and the Pentagon just went nuclear. After weeks of arguing over "red lines" for military use, the Department of Defense is now considering designating Anthropic as a "Supply Chain Risk."
If this goes through, it wouldn't just kill the $200M contract: it would legally ban the entire US government and its thousands of contractors from using Claude. Pentagon CTO Heidi Shyu doubled down today, calling it "not democratic" for a private lab to unilaterally decide which military missions are "safe."
π‘ Why this matters:
This is the first time the US government has used its "Supply Chain" teeth against a domestic AI lab. It proves that for the Pentagon, "safety vibes" aren't an excuse to ignore mission requirements. If Anthropic doesn't blink, they could be effectively locked out of the entire federal market.
Source: The Guardian | Yahoo News | Times of India
The standoff between Anthropic and the Pentagon just went nuclear. After weeks of arguing over "red lines" for military use, the Department of Defense is now considering designating Anthropic as a "Supply Chain Risk."
If this goes through, it wouldn't just kill the $200M contract: it would legally ban the entire US government and its thousands of contractors from using Claude. Pentagon CTO Heidi Shyu doubled down today, calling it "not democratic" for a private lab to unilaterally decide which military missions are "safe."
π‘ Why this matters:
This is the first time the US government has used its "Supply Chain" teeth against a domestic AI lab. It proves that for the Pentagon, "safety vibes" aren't an excuse to ignore mission requirements. If Anthropic doesn't blink, they could be effectively locked out of the entire federal market.
Source: The Guardian | Yahoo News | Times of India
the Guardian
US military used Anthropicβs AI model Claude in Venezuela raid, report says
Wall Street Journal says Claude used in operation via Anthropicβs partnership with Palantir Technologies
π€―1
π¨ Google Drops Gemini 3.1 Pro: The Reasoning Leap π
Google just previewed Gemini 3.1 Pro, and the reasoning jumps are wild. Itβs tuned for complex logic, beating abstract tests that usually trip up the best models.
β’ ARC-AGI-2: Scored 77.1% (more than double 3.0 Pro's 31.1%).
β’ Science/Reasoning: Hits 94.3% on GPQA Diamond, pulling ahead of Claude Opus 4.6 and GPT-5.2.
β’ Coding: 80.6% on SWE-Bench Verified; elite tier for agentic coding.
π‘ Why this matters: Weβre moving from chatbots to agents that can actually synthesize complex data. Gemini 3.1 Pro is Googleβs bid to own the high-reasoning space.
Source: Google Blog
Google just previewed Gemini 3.1 Pro, and the reasoning jumps are wild. Itβs tuned for complex logic, beating abstract tests that usually trip up the best models.
β’ ARC-AGI-2: Scored 77.1% (more than double 3.0 Pro's 31.1%).
β’ Science/Reasoning: Hits 94.3% on GPQA Diamond, pulling ahead of Claude Opus 4.6 and GPT-5.2.
β’ Coding: 80.6% on SWE-Bench Verified; elite tier for agentic coding.
π‘ Why this matters: Weβre moving from chatbots to agents that can actually synthesize complex data. Gemini 3.1 Pro is Googleβs bid to own the high-reasoning space.
Source: Google Blog
Google
Gemini 3.1 Pro: A smarter model for your most complex tasks
3.1 Pro is designed for tasks where a simple answer isnβt enough.
β‘1