Stripe launched Projects to help agents instantly provision services from the CLI.
For example, simply run:
$ stripe projects add posthog/analytics
And it'll create a PostHog account, get an API key, and (as needed) set up billing.
Projects is launching today as a developer preview. You can register for access (it available to everyone soon) at projects.dev.
Also rolling out support for many new providers over the coming weeks.
For example, simply run:
$ stripe projects add posthog/analytics
And it'll create a PostHog account, get an API key, and (as needed) set up billing.
Projects is launching today as a developer preview. You can register for access (it available to everyone soon) at projects.dev.
Also rolling out support for many new providers over the coming weeks.
Stripe Projects
Stripe Projects | Provision and Manage Services from the CLI
Enable you or your agents to provision hosting, databases, auth, AI, and more from the CLI. Generate credentials and manage usage and billing in one place.
Claude Code can now auto-fix your PR in the background.
All you have to do is turn on the Auto Fix setting and go touch grass.
All you have to do is turn on the Auto Fix setting and go touch grass.
Claude Code Docs
Claude Code on the web - Claude Code Docs
Run Claude Code tasks asynchronously on secure cloud infrastructure
👍4❤3🔥2😁1
Cool work by Chroma training a search agent with SoTA efficiency.
Chroma Context-1, a 20B parameter search agent:
•pushes the pareto frontier of agentic search
•order of magnitude faster
•order of magnitude cheaper
•Apache 2.0, open-source
Lots of cool details: a prune tool for editing context mid-search, a synthetic data pipeline with verification steps, and a curriculum that shifts from recall to precision.
Trained with Tinker.
Chroma Context-1, a 20B parameter search agent:
•pushes the pareto frontier of agentic search
•order of magnitude faster
•order of magnitude cheaper
•Apache 2.0, open-source
Lots of cool details: a prune tool for editing context mid-search, a synthetic data pipeline with verification steps, and a curriculum that shifts from recall to precision.
Trained with Tinker.
GitHub
GitHub - chroma-core/context-1-data-gen
Contribute to chroma-core/context-1-data-gen development by creating an account on GitHub.
❤4👍2💯2
Stanford proposed a bold new vision for biology: Virtual Embryos
By integrating single-cell and spatial genomics with AI, researchers can build digital twins of embryogenesis—moving beyond virtual cells to predict cell growth, division, migration, state transitions, and morphogenesis, from genes → cells → organs -> organ systems and whole embryo, in a fully 4D spatiotemporal framework.
This approach could enable truly predictive biology and in silico experimentation to diagnose, prevent, and treat developmental diseases transforming medicine and improving outcomes for future generations.
By integrating single-cell and spatial genomics with AI, researchers can build digital twins of embryogenesis—moving beyond virtual cells to predict cell growth, division, migration, state transitions, and morphogenesis, from genes → cells → organs -> organ systems and whole embryo, in a fully 4D spatiotemporal framework.
This approach could enable truly predictive biology and in silico experimentation to diagnose, prevent, and treat developmental diseases transforming medicine and improving outcomes for future generations.
Nature
Towards predictive virtual embryos with genomics and AI
Nature Methods - Predictive virtual embryo systems that integrate single-cell and spatial data with artificial intelligence (AI) techniques offer a promising avenue for modeling mammalian...
❤3🔥2💯2
Unitree open-sourced UnifoLM-WBT-Dataset a high-quality real-world humanoid robot whole-body teleoperation (WBT) dataset for open environments.
Publicly available since March 5, 2026, the dataset will continue to receive high-frequency rolling updates. It aims to establish the most comprehensive real-world humanoid robot dataset in terms of scenario coverage, task complexity, and manipulation diversity.
Publicly available since March 5, 2026, the dataset will continue to receive high-frequency rolling updates. It aims to establish the most comprehensive real-world humanoid robot dataset in terms of scenario coverage, task complexity, and manipulation diversity.
❤2🔥2💯2
Nicolas Carlini at Anthopic just showed Claude finding zero day vulnerabilities in a live conference demo
Claude has found zero day in Ghost, 50,000 stars on github, never had a critical security vulnerability in its entire, history...
it found the blind SQL injection in 90 minutes, stole the admin api key, then did the exact, same thing to the linux kernel.
Claude has found zero day in Ghost, 50,000 stars on github, never had a critical security vulnerability in its entire, history...
it found the blind SQL injection in 90 minutes, stole the admin api key, then did the exact, same thing to the linux kernel.
YouTube
Nicholas Carlini - Black-hat LLMs | [un]prompted 2026
Nicholas Carlini, Research Scientist, Anthropic, speaks at [un]prompted 2026 on: Black-hat LLMs.
Large language models are now capable of automating attacks that were previously only possible by human adversaries. In this talk, I discuss several ways that…
Large language models are now capable of automating attacks that were previously only possible by human adversaries. In this talk, I discuss several ways that…
❤7🔥2💯2
HuggingFace built infra where an agent can launch GPU experiments, access all of hf datasets as local filesystem, and semantically search arxiv for ideas.
GitHub
GitHub - mishig25/hf-autoresearch: AI agents running research on Hugging Face infra
AI agents running research on Hugging Face infra. Contribute to mishig25/hf-autoresearch development by creating an account on GitHub.
❤5👍2🔥2
Mathematical methods and human thought in the age of AI by Tanya Klowden, Terence Tao
In this paper, researchers consider the rapidly evolving impact of AI to the traditional questions of philosophy with an emphasis on its application in mathematics and on the broader real-world outcomes of its more general use.
They are assert that AI is a natural evolution of human tools developed throughout history to facilitate the creation, organization, and dissemination of ideas, and argue that it is paramount that the development and application of AI remain fundamentally human-centered.
They are propose a pathway to integrating AI into our most challenging and intellectually rigorous fields to the benefit of all humankind.
In this paper, researchers consider the rapidly evolving impact of AI to the traditional questions of philosophy with an emphasis on its application in mathematics and on the broader real-world outcomes of its more general use.
They are assert that AI is a natural evolution of human tools developed throughout history to facilitate the creation, organization, and dissemination of ideas, and argue that it is paramount that the development and application of AI remain fundamentally human-centered.
They are propose a pathway to integrating AI into our most challenging and intellectually rigorous fields to the benefit of all humankind.
❤2🔥2💯2
Microsoft Introduced Critique, a new multi-model deep research system in M365 Copilot.
You can use multiple models together to generate optimal responses and reports.
You can use multiple models together to generate optimal responses and reports.
TECHCOMMUNITY.MICROSOFT.COM
Introducing multi-model intelligence in Researcher | Microsoft Community Hub
Researcher now goes further with two new multi-model capabilities that raise the bar for accuracy, depth, and confidence in AI-generated reports: Critique...
🔥2🥰2💯2
New supply chain attack this time for npm axios, the most popular HTTP client library with 300M weekly downloads.
It's possible to personally defend against these to some extent with local settings e.g. release-age constraints, or containers or etc, but ultimately the defaults of package management projects (pip, npm etc) have to change so that a single infection (usually luckily fairly temporary in nature due to security scanning) does not spread through users at random and at scale via unpinned dependencies.
Supply chain attacks like the currently breaking axios, litellm and xz are only going to be more commonplace in the vibecoding world.
The entire premise of vibecoding is “I don’t need to understand the code” happens to also be the entire premise of a supply chain attack.
It's possible to personally defend against these to some extent with local settings e.g. release-age constraints, or containers or etc, but ultimately the defaults of package management projects (pip, npm etc) have to change so that a single infection (usually luckily fairly temporary in nature due to security scanning) does not spread through users at random and at scale via unpinned dependencies.
Supply chain attacks like the currently breaking axios, litellm and xz are only going to be more commonplace in the vibecoding world.
The entire premise of vibecoding is “I don’t need to understand the code” happens to also be the entire premise of a supply chain attack.
www.stepsecurity.io
axios Compromised on npm - Malicious Versions Drop Remote Access Trojan - StepSecurity
Hijacked maintainer account used to publish poisoned axios releases including 1.14.1 and 0.30.4. The attacker injected a hidden dependency that drops a cross platform RAT. We are actively investigating and will update this post with a full technical analysis.
💯3🔥2👏2
Claude code source code has been leaked via a map file in their npm registry. You can find it here.
GitHub
GitHub - ultraworkers/claw-code: The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars…
The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex. - ultraworkers/claw-code
👍2🔥2👏2🫡2
OMG! Whoop raised $575M at a $10.1B valuation
Whoop is building the world’s leading personal, preventive health platform powered by continuous biometric data, advanced analytics, and AI to help people understand their bodies and improve their health in real time.
In the past 12 months, Whoop has received medical clearances, launched blood testing, and created a platform that has saved lives.
Abbott and Mayo Clinic two of the most respected and influential institutions in global healthcare are now investors in Whoop.
Whoop is building the world’s leading personal, preventive health platform powered by continuous biometric data, advanced analytics, and AI to help people understand their bodies and improve their health in real time.
In the past 12 months, Whoop has received medical clearances, launched blood testing, and created a platform that has saved lives.
Abbott and Mayo Clinic two of the most respected and influential institutions in global healthcare are now investors in Whoop.
Nytimes
Whoop, a Wearable Health Device Maker, Raises $575 Million
With elite athletes like LeBron James and Cristiano Ronaldo as investors, the company, now valued at $10 billion, is courting everyday health enthusiasts.
🔥3🥰2💯2
Google presented new paper on AI Agent Traps
An increasing volume of web content is being created by, and consumed by, advanced AI agents.
This puts environmental AI safety in focus, as it exposes a vast attack surface via the content that AI agents interact with.
This paper explores the landscape of environmental attacks and defenses, aiming to inform mitigations that are needed for ensuring safety of the agentic web.
An increasing volume of web content is being created by, and consumed by, advanced AI agents.
This puts environmental AI safety in focus, as it exposes a vast attack surface via the content that AI agents interact with.
This paper explores the landscape of environmental attacks and defenses, aiming to inform mitigations that are needed for ensuring safety of the agentic web.
Ssrn
AI Agent Traps
As autonomous AI agents increasingly navigate the web, they face a novel challenge: the information environment itself. This gives rise to a critical vulnerabil
❤3👍2💯2
PrismML released the 1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GBs of memory and delivers over 10x the intelligence density of its full-precision counterparts.
PrismML grew out of years of research at Caltech
The first proof point is the 1-bit Bonsai family: models that are small, fast, and efficient enough to run locally, while remaining competitive with full-precision models in their class.
It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class.
Open-source model under Apache 2.0 license, along with Bonsai 4B and 1.7B models.
HuggingFace
PrismML grew out of years of research at Caltech
The first proof point is the 1-bit Bonsai family: models that are small, fast, and efficient enough to run locally, while remaining competitive with full-precision models in their class.
It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class.
Open-source model under Apache 2.0 license, along with Bonsai 4B and 1.7B models.
HuggingFace
Prismml
PrismML — Announcing 1-bit Bonsai: The First Commercially Viable 1-bit LLMs
🔥2👏2💯2
What if AI didn’t just solve math problems but discovered entirely new mathematical structures?
Meet AutoMath from The Omega Institute.
From ONE equation (x² = x + 1) and ZERO extra axioms, team derive 9 branches of math: algebra, combinatorics, topology, dynamical systems… all formally verified in Lean 4 (~2,350 theorems, 25k lines of code).
Their method: Derive, Discover, Name
• Derive → AI exhaustively explores every logical consequence
• Discover → Patterns emerge that humans might never notice
• Name → Human intuition connects them to deep math (rings, finite fields, golden-ratio p-adics…)
GitHub.
Meet AutoMath from The Omega Institute.
From ONE equation (x² = x + 1) and ZERO extra axioms, team derive 9 branches of math: algebra, combinatorics, topology, dynamical systems… all formally verified in Lean 4 (~2,350 theorems, 25k lines of code).
Their method: Derive, Discover, Name
• Derive → AI exhaustively explores every logical consequence
• Discover → Patterns emerge that humans might never notice
• Name → Human intuition connects them to deep math (rings, finite fields, golden-ratio p-adics…)
GitHub.
Derive, Discover, Name
A case study in AI-assisted mathematical discovery. One equation, zero axioms, nine branches of mathematics.
👍5🔥2👏2
The power of the Claw, in the palm of a robot hand. Agentic robotics is here. Nvidia open-sourced CaP-X: vibe agents, alive in the physical world.
They incarnate as robot arms and humanoids with a rich set of perception APIs, actuation APIs, and auto synthesize skill libraries as they go.
CaP-X is a strict superset of Nvidia’s old stack, because policies like VLAs are “just” API calls as well. It solves many tasks zero-shot that a learned policy would struggle with.
CaP-X is a most systematic, scientific study on agentic robotics so far:
1. comprehensive agentic toolkit: perception (SAM3 segmentation, Molmo pointing, depth, point cloud), control (IK solvers, grasp planner, navigation), and visualization (EEF, mask overlays) that work across different robots.
2. CaP-Gym: LLM’s first Physical Exam! 187 manipulation tasks across RoboSuite, LIBERO-PRO, and BEHAVIOR. Tabletop, bimanual, mobile manipulation. Sim and real.
3. CaP-Bench: benchmark 12 frontier LLMs/VLMs (Gemini, GPT, Opus, Qwen, DeepSeek, Kimi, and more) across 8 evaluation tiers.
Lots of insights in paper.
4. CaP-Agent0: a training-free agentic harness that matches or exceeds human expert code on 4 out of 7 tasks without task-specific tuning.
5. CaP-RL: if you get a gym, you get RL ;). A 7B OSS model jumps from 20% to 72% success after only 50 training iterations. The synthesized programs transfer to real robots with minimal sim-to-real gap.
Code.
MIT license.
They incarnate as robot arms and humanoids with a rich set of perception APIs, actuation APIs, and auto synthesize skill libraries as they go.
CaP-X is a strict superset of Nvidia’s old stack, because policies like VLAs are “just” API calls as well. It solves many tasks zero-shot that a learned policy would struggle with.
CaP-X is a most systematic, scientific study on agentic robotics so far:
1. comprehensive agentic toolkit: perception (SAM3 segmentation, Molmo pointing, depth, point cloud), control (IK solvers, grasp planner, navigation), and visualization (EEF, mask overlays) that work across different robots.
2. CaP-Gym: LLM’s first Physical Exam! 187 manipulation tasks across RoboSuite, LIBERO-PRO, and BEHAVIOR. Tabletop, bimanual, mobile manipulation. Sim and real.
3. CaP-Bench: benchmark 12 frontier LLMs/VLMs (Gemini, GPT, Opus, Qwen, DeepSeek, Kimi, and more) across 8 evaluation tiers.
Lots of insights in paper.
4. CaP-Agent0: a training-free agentic harness that matches or exceeds human expert code on 4 out of 7 tasks without task-specific tuning.
5. CaP-RL: if you get a gym, you get RL ;). A 7B OSS model jumps from 20% to 72% success after only 50 training iterations. The synthesized programs transfer to real robots with minimal sim-to-real gap.
Code.
MIT license.
GitHub
GitHub - capgym/cap-x: A Framework for Benchmarking and Improving Coding Agents for Robot Manipulation
A Framework for Benchmarking and Improving Coding Agents for Robot Manipulation - capgym/cap-x
❤4🔥2💯2
Mercor AI has allegedly been breached by Lapsus
More customer data leaks: Amazon, Athena, Aphrodite, Meta, Apple…
939GB of source code
4TB of data in total
SOTA training data now just available. Every major lab. Billions and billions of value and a major national security issue.
More customer data leaks: Amazon, Athena, Aphrodite, Meta, Apple…
939GB of source code
4TB of data in total
SOTA training data now just available. Every major lab. Billions and billions of value and a major national security issue.
❤7🫡4💯2🔥1
Goodfire introduced self-correcting search: a technique to let diffusion models self-correct mid-trajectory.
MatterGen a feedback loop from its own activations, improving viable on-target candidates by ~30%.
MatterGen is an open-source diffusion model for generating novel crystal structures. When generating materials with a target property, stronger conditioning tends to improve targeting, but reduces the stability, diversity, and novelty of outputs.
MatterGen a feedback loop from its own activations, improving viable on-target candidates by ~30%.
MatterGen is an open-source diffusion model for generating novel crystal structures. When generating materials with a target property, stronger conditioning tends to improve targeting, but reduces the stability, diversity, and novelty of outputs.
www.goodfire.ai
Using Self-Correcting Search to Accelerate Materials Discovery
Sakana AI introduced new ultra deep research assistant Marlin
Pushing the limits of test-time scaling for auomating business-oriented research. It builds on top of AB-MCTS and The AI Scientist!
Agents scale to real-world applications and long-running workloads.
Pushing the limits of test-time scaling for auomating business-oriented research. It builds on top of AB-MCTS and The AI Scientist!
Agents scale to real-world applications and long-running workloads.
sakana.ai
Sakana AI
新しいBusiness Intelligenceへ:Ultra Deep Researchアシスタント「Sakana Marlin」βテスト開始
🆒4
Meet SAGA a generalist AI scientist. Instead of just optimizing fixed targets, it refines its own goals like a human researcher.
From de novo nanobodies to permanent magnets.
The core idea: across discovery tasks, scientists rarely know perfect set of objectives upfront. They iterate — tweak scoring functions, add constraints, re-weight trade-offs based on what optimizer produces. SAGA aims to automate this entire loop.
Code.
From de novo nanobodies to permanent magnets.
The core idea: across discovery tasks, scientists rarely know perfect set of objectives upfront. They iterate — tweak scoring functions, add constraints, re-weight trade-offs based on what optimizer produces. SAGA aims to automate this entire loop.
Code.
arXiv.org
Accelerating Scientific Discovery with Autonomous Goal-evolving Agents
There has been unprecedented interest in developing agents that expand the boundary of scientific discovery, primarily by optimizing quantitative objective functions specified by scientists....
🔥2👏2💯2