Offshore
Photo
DAIR.AI
RT @omarsar0: NEW research from Meta Superintelligence Labs.

It uses a clever strategy-auction framework to improve self-improving agents on complex tasks.

Small agents aren't always enough.

On the simplest tasks, a 4B parameter agent attains 87% of a 32B agent's performance. But on the most complex tasks, that relative performance drops to just 21%.

The default assumption today is that you either use the biggest model for everything or route tasks with a trained classifier.

But trained routers degrade as task difficulty increases, and non-predictive cascades become prohibitively expensive for agentic workloads.

This new research introduces SALE (Strategy Auctions for Workload Efficiency), a framework inspired by freelancer marketplaces. Instead of predicting which model to use from a task description alone, agents bid with short strategic plans that are scored by a systematic cost-value mechanism.

How does the auction work? Each candidate agent proposes a strategic solution plan. A peer jury scores plans by predicted value. A heuristic cost predictor estimates execution cost. The agent with the best cost-value trade-off wins and executes its plan.

The self-improvement mechanism is where it gets interesting. After each auction, all proposed strategies are stored in a shared memory bank. Cheaper agents that lost can learn from winning strategies and submit refined bids, analogous to freelancers upskilling over time.

On deep search tasks, SALE exceeds the best single agent's pass@1 by 3.5 points while reducing cost by 35%. On coding tasks, it improves pass@1 by 2.7 points at 25% lower cost. Across both domains, SALE reduces reliance on the largest agent by 53%.

Existing routers like WTP and FrugalGPT either underperform the largest agent or fail to reduce cost. FrugalGPT's costs actually increase on complex coding tasks, reaching 0.61 dollars per million tokens versus the best agent's 0.36 dollars.

Market-inspired coordination mechanisms that organize heterogeneous agents into adaptive ecosystems can systematically outperform both single large models and trained routing approaches.

Paper: https://t.co/UY8C5cmfxK

Learn to build effective AI Agents in our academy: https://t.co/1e8RZKs4uX
tweet
Offshore
Photo
DAIR.AI
// Agent Primitives //

This is a really interesting take on building effective multi-agent systems.

Multi-agent systems get more complex as tasks get harder. More roles, more prompts, more bespoke interaction patterns. However, the core computation patterns keep repeating across every system: review, vote, plan, execute.

But nobody treats these patterns as reusable building blocks.

This new research introduces Agent Primitives, a set of latent building blocks for constructing effective multi-agent systems.

Inspired by how neural networks are built from reusable components like residual blocks and attention heads, the researchers decompose multi-agent architectures into three recurring primitives: Review, Voting and Selection, and Planning and Execution.

What makes these primitives different? Agents inside each primitive communicate via KV-cache rather than natural language. This avoids the information degradation that happens when agents pass long text messages back and forth across multi-stage interactions.

An Organizer agent selects and composes primitives for each query, guided by a lightweight knowledge pool of previously successful configurations.

No manual system design required.

The results across eight benchmarks spanning math, code generation, and QA with five open-source LLMs:

> Primitives-based MAS improve average accuracy by 12.0-16.5% over single-agent baselines

> On GPQA-Diamond, the improvement is striking, 53.2% versus the 33.6-40.2% range of prior methods like AgentVerse, DyLAN, and MAS-GPT

In terms of efficiency, token usage and inference latency drop by approximately 3-4x compared to text-based MAS, while incurring only 1.3-1.6x overhead relative to single-agent inference.

Instead of designing task-specific multi-agent architectures from scratch, Agent Primitives show that a small set of reusable computation patterns with latent communication can match or exceed custom systems while being dramatically more efficient.

Paper: https://t.co/fxEL6g0x4O

Learn to build effective AI agents in our academy: https://t.co/LRnpZN7L4c
tweet
Offshore
Photo
The Transcript
RT @TheTranscript_: $ARM: -8%AH

CEO: "Arm delivered a record revenue quarter as demand for AI computing on our platform continues to accelerate. Record royalty results in the third quarter reflect the growing scale of our ecosystem, as customers design the Arm compute platform into next-generation systems across cloud, edge, and physical environments to deliver high-performance, power-efficient AI. The fundamentals of the Arm business have never been stronger."
tweet
Offshore
Video
Lumida Wealth Management
NVIDIA AND DASSAULT JUST ANNOUNCED THEIR BIGGEST PARTNERSHIP EVER

Jensen: "This is the largest collaboration our two companies have ever had in over a quarter century.

Dassault is integrating Nvidia Cuda X acceleration libraries, Nvidia AI, and Nvidia Omniverse into their platform.

This represents our body of work over 25 years. Now we're fusing it so you can work at a scale 100 times, 1000 times, and very soon a million times greater than before.

What used to be pre-rendered or offline simulations will now be real-time digital twins."

This is the infrastructure layer for the next generation of product design and engineering.
tweet
Javier Blas
RT @michellelprice: KYIV, Ukraine (AP) โ€” US and Russia agree to reestablish military-to-military dialogue after Ukraine talks.
tweet
Offshore
Photo
The Transcript
Peloton double miss

CEO: "Our second quarter represented the most substantial period of innovation at Peloton since our founding."

$PTON: -26% today https://t.co/fgm7873wyL
tweet
Offshore
Video
Michael Fritzell (Asian Century Stocks)
RT @PJaccetturo: RIP Hollywood.

AI is now 100% photorealistic with the launch of Kling 3.0

In just two days, I created the opening sequence from The Way of Kings by Brandon Sanderson

You have to try this new Multi-Shot technique that makes making films much faster and cheaper ๐Ÿงต๐Ÿ‘‡ https://t.co/tqZCnsP96J
tweet
Offshore
Video
Dimitry Nakhla | Babylon Capitalยฎ
๐‚๐ก๐ซ๐ข๐ฌ ๐‡๐จ๐ก๐ง ๐จ๐ง ๐ฐ๐ก๐š๐ญ ๐ญ๐ฒ๐ฉ๐ž๐ฌ ๐จ๐Ÿ ๐œ๐จ๐ฆ๐ฉ๐š๐ง๐ข๐ž๐ฌ ๐ก๐ž ๐ฐ๐จ๐ฎ๐ฅ๐ ๐ง๐ž๐ฏ๐ž๐ซ ๐ข๐ง๐ฏ๐ž๐ฌ๐ญ ๐ข๐ง:

โ€œWe have a long list of companies we donโ€™t invest inโ€ฆ banks, commodity businesses, most manufacturing industries, fossil fuels, utilities, airlines, wireless telecom, advertising agenciesโ€ฆ Why? Because theyโ€™re competitive. And the most important thing Iโ€™ve learned in investing is that investors underestimate the forces of competition and disruption.โ€
___

๐˜๐˜ฏ๐˜ฅ๐˜ถ๐˜ด๐˜ต๐˜ณ๐˜ช๐˜ฆ๐˜ด ๐˜๐˜ฐ๐˜ฉ๐˜ฏ ๐˜ฆ๐˜น๐˜ฑ๐˜ญ๐˜ช๐˜ค๐˜ช๐˜ต๐˜ญ๐˜บ ๐˜ข๐˜ท๐˜ฐ๐˜ช๐˜ฅ๐˜ด:

โ€ข Banks
โ€ข Commodity businesses / manufacturing
โ€ข Insurance
โ€ข Tobacco
โ€ข Fossil fuels
โ€ข Utilities
โ€ข Airlines
โ€ข Wireless telecom
โ€ข Advertising agencies
โ€ข Most traditional manufacturing
___

Hohnโ€™s point isnโ€™t that money canโ€™t be made in these areas โ€” plenty of investors have done well in some of them.

The deeper lesson:

๐™„๐™ฃ๐™ซ๐™š๐™จ๐™ฉ๐™ž๐™ฃ๐™œ ๐™ž๐™จ ๐™–๐™จ ๐™ข๐™ช๐™˜๐™ ๐™–๐™—๐™ค๐™ช๐™ฉ ๐™™๐™š๐™˜๐™ž๐™™๐™ž๐™ฃ๐™œ ๐™ฌ๐™๐™–๐™ฉ ๐™ฃ๐™ค๐™ฉ ๐™ฉ๐™ค ๐™ค๐™ฌ๐™ฃ ๐™–๐™จ ๐™ž๐™ฉ ๐™ž๐™จ ๐™–๐™—๐™ค๐™ช๐™ฉ ๐™™๐™š๐™˜๐™ž๐™™๐™ž๐™ฃ๐™œ ๐™ฌ๐™๐™–๐™ฉ ๐™ฉ๐™ค ๐™ค๐™ฌ๐™ฃ.

Highly competitive industries tend to:

โ€ข Erode returns on capital
โ€ข Compress margins over time
โ€ข Require constant reinvestment

Contrast that with businesses that have:

โ€ข Pricing power
โ€ข High switching costs
โ€ข Network effects
โ€ข Structural barriers to entry

Those are the environments where ๐˜ญ๐˜ฐ๐˜ฏ๐˜จ-๐˜ต๐˜ฆ๐˜ณ๐˜ฎ compounding becomes far more predictable.
___

Another subtle takeaway:

Most investors focus heavily on upside narratives.

Great investors spend just as much time thinking about downside structures.
___

Source: In Good Company | Norges Bank Investment Management (05/14/2025)
tweet
Offshore
Video
The Transcript
$AMZN reporting after hours today: https://t.co/AG90XQkPOZ
tweet
Offshore
Photo
Moon Dev
The $100M Quantitative Blueprint: Building AI Agents to Replace Your Trading Desk
tweet
Offshore
Video
Michael Fritzell (Asian Century Stocks)
RT @kimmonismus: This is 100% AI generated. And I couldnt tell anymore. Holy moly. https://t.co/v9OVfRSqwe

Kling 3.0 is truly "one giant leap for AI video generation"! Check out this amazing mockumentary from Kling AI Creative Partner Simon Meyer! https://t.co/Iyw919s6OJ
- Kling AI
tweet