Peter Steinberger 🦞 / @steipete:
RT by @steipete: Everyone says the latest AI agents will be "job-ready" soon, especially after the release of Fable 5 this week. But is that really the case?
Over the past many months, my group and collaborators have been building Agents' Last Exam (ALE), a benchmark designed to test exactly that claim on real digital labor-market work.
My group and collaborators previously have created many of the benchmarks the field runs on, including MMLU, MATH, CyberGym, and ExploitGym. Today, I'm exci...
RT by @steipete: Everyone says the latest AI agents will be "job-ready" soon, especially after the release of Fable 5 this week. But is that really the case?
Over the past many months, my group and collaborators have been building Agents' Last Exam (ALE), a benchmark designed to test exactly that claim on real digital labor-market work.
My group and collaborators previously have created many of the benchmarks the field runs on, including MMLU, MATH, CyberGym, and ExploitGym. Today, I'm exci...
Peter Steinberger 🦞 / @steipete:
RT by @steipete: We've updated the Artificial Analysis Coding Agent Index, replacing SWE-Bench Pro with Datacurve's DeepSWE benchmark - the swap lifts Codex with GPT-5.5 (xhigh) above Claude Code with Opus 4.8 (max), while the newly released Claude Fable 5 (max) in Claude Code debuts at the top
DeepSWE, built by @datacurve, writes its tasks from scratch rather than adapting them from public GitHub issues or pull requests, so no model has seen the solutions during training. That matters becau...
RT by @steipete: We've updated the Artificial Analysis Coding Agent Index, replacing SWE-Bench Pro with Datacurve's DeepSWE benchmark - the swap lifts Codex with GPT-5.5 (xhigh) above Claude Code with Opus 4.8 (max), while the newly released Claude Fable 5 (max) in Claude Code debuts at the top
DeepSWE, built by @datacurve, writes its tasks from scratch rather than adapting them from public GitHub issues or pull requests, so no model has seen the solutions during training. That matters becau...
Peter Steinberger 🦞 / @steipete:
RT by @steipete: appshots in codex is the most useful piece of software on my Mac
most of my prompts these days are:
- cmd + cmd investigate this
- cmd + cmd open a PR to fix this
- cmd + cmd run the eval on these set of prompts and discussion
- cmd + cmd set a heartbeat to keep following up with this
and many more cases
trust me, you’ve got to try it
RT by @steipete: appshots in codex is the most useful piece of software on my Mac
most of my prompts these days are:
- cmd + cmd investigate this
- cmd + cmd open a PR to fix this
- cmd + cmd run the eval on these set of prompts and discussion
- cmd + cmd set a heartbeat to keep following up with this
and many more cases
trust me, you’ve got to try it
F-Droid - Free and Open Source Android App Repository:
We didn't choose the pastel colors
We didn't choose the pastel colors
f-droid.org
We didn't choose the pastel colors | F-Droid - Free and Open Source Android App Repository
This Week in F-Droid TWIF curated on Friday, 12 Jun 2026, Week 24 F-Droid core F-Droid and F-Droid Basic were updated to 2.0-alpha10. We tooted a toot that y...
Magrão Hortaliças:
PLANTIO ESCALONADO E PRAGAS NA HORTA
PLANTIO ESCALONADO E PRAGAS NA HORTA
YouTube
PLANTIO ESCALONADO E PRAGAS NA HORTA
Gostou do nosso conteúdo? Inscreva-se no canal do Magrão, deixe o seu 𝗹𝗶𝗸𝗲 𝗲 𝗰𝗼𝗺𝗲𝗻𝘁𝗲 o que você quer ver nos próximos vídeos pra nos fortalecer!
Redes sociais:
Facebook: https://www.facebook.com/magraohorta/
Instagram: https://www.instagram.com/magrao_horta/
Redes sociais:
Facebook: https://www.facebook.com/magraohorta/
Instagram: https://www.instagram.com/magrao_horta/
Alessandro Mattos / @Apenasam:
A pergunta é: por que a China está tão interessada em expandir sua presença financeira, energética e logística na América Latina justamente agora?
Nos últimos anos vimos uma aceleração dos investimentos chineses em portos, energia, ferrovias e infraestrutura na região. Ao mesmo tempo, os EUA passaram a tratar a América Latina cada vez mais como um espaço estratégico em sua disputa com Pequim.
Por isso, talvez estejamos olhando para um movimento maior.
O debate é sobre influência. Não estam...
A pergunta é: por que a China está tão interessada em expandir sua presença financeira, energética e logística na América Latina justamente agora?
Nos últimos anos vimos uma aceleração dos investimentos chineses em portos, energia, ferrovias e infraestrutura na região. Ao mesmo tempo, os EUA passaram a tratar a América Latina cada vez mais como um espaço estratégico em sua disputa com Pequim.
Por isso, talvez estejamos olhando para um movimento maior.
O debate é sobre influência. Não estam...