Many takeaways from this talk, but this one stood out the most for me:
Karpathy is skeptical of the simplistic "agents will do everything" narrative.
His preferred model is partial autonomy.
He talks about keeping Al "on a leash", building systems where the Al generates and the human verifies
Don't jump straight to total delegation.
Instead:
let Al do pieces, keep human approval at key checkpoints, compress human verification cost, increase autonomy only where reliability earns it
https://youtu.be/96jN2OCOfLs?si=0Hu5-n40cJ5jUnMs
Karpathy is skeptical of the simplistic "agents will do everything" narrative.
His preferred model is partial autonomy.
He talks about keeping Al "on a leash", building systems where the Al generates and the human verifies
Don't jump straight to total delegation.
Instead:
let Al do pieces, keep human approval at key checkpoints, compress human verification cost, increase autonomy only where reliability earns it
https://youtu.be/96jN2OCOfLs?si=0Hu5-n40cJ5jUnMs
YouTube
Andrej Karpathy: From Vibe Coding to Agentic Engineering
Andrej Karpathy (co-founder of OpenAI, former head of AI at Tesla, and now founder of Eureka Labs) talks with Sequoia partner Stephanie Zhan at AI Ascent 2026 about what's changed in the year since he coined "vibe coding." He explains why he's never felt…
👍2
Chief Justice Sundaresh Menon says 92 percent of new lawyers were already using AI in their work.
If these foundational tasks are increasingly outsourced to machines, and if it becomes uneconomic to have them performed by young lawyers, then we must confront a serious question: how are we going to redesign our workflows and processes to ensure that our young lawyers acquire the instincts, the discipline, and the professional judgment that these very tasks once helped cultivate?"
https://www.channelnewsasia.com/singapore/2026-mass-call-high-court-chief-justice-ai-work-practices-6067356
If these foundational tasks are increasingly outsourced to machines, and if it becomes uneconomic to have them performed by young lawyers, then we must confront a serious question: how are we going to redesign our workflows and processes to ensure that our young lawyers acquire the instincts, the discipline, and the professional judgment that these very tasks once helped cultivate?"
https://www.channelnewsasia.com/singapore/2026-mass-call-high-court-chief-justice-ai-work-practices-6067356
CNA
1 in 3 new lawyers may quit within 3 years due to workload, poor culture: Chief Justice
Citing a survey conducted among lawyers at this year's mass call, Chief Justice Sundaresh Menon says 92 per cent of new lawyers were already using AI in their work.
Just shipped Prompt Architect — open source on github:
If you struggle with how to prompt, resulting in dissatisfying outputs - this is for YOU!
A Claude plugin built straight from Anthropic's "Prompting 101"
workshop framework (https://www.youtube.com/watch?v=ysPbXH0LpIE)
Skill works Interview-style: asks 4 questions about your
task, hands back a structured prompt with a test plan.
No more trial-and-error with your prompts.
Works in Claude Cowork (desktop) and Claude Code (CLI).
github.com/hosanxiv/prompt-architect
Try it out!
If you struggle with how to prompt, resulting in dissatisfying outputs - this is for YOU!
A Claude plugin built straight from Anthropic's "Prompting 101"
workshop framework (https://www.youtube.com/watch?v=ysPbXH0LpIE)
Skill works Interview-style: asks 4 questions about your
task, hands back a structured prompt with a test plan.
No more trial-and-error with your prompts.
Works in Claude Cowork (desktop) and Claude Code (CLI).
github.com/hosanxiv/prompt-architect
Try it out!
❤6🔥1
The AI Burrow 🐰🕳️
Just shipped Prompt Architect — open source on github: If you struggle with how to prompt, resulting in dissatisfying outputs - this is for YOU! A Claude plugin built straight from Anthropic's "Prompting 101" workshop framework (https://www.youtube.com…
Creating this skill was a good learning experience.
Had to reiterate with Claude for quite awhile. The skill was easy to install on claude code, but claude didn’t recognise it’s own framework when it came to installing it on the app (now fixed!)
Had to reiterate with Claude for quite awhile. The skill was easy to install on claude code, but claude didn’t recognise it’s own framework when it came to installing it on the app (now fixed!)
👍2
ChatGPT on a roll - now available as an add-on in Excel and Google Sheets.
https://chatgpt.com/apps/spreadsheets/
https://chatgpt.com/apps/spreadsheets/
❤2
https://x.com/claudeai/status/2052060691893227611?s=46
Claude subscribers, Go forth and create something with the usage refresh!
Claude subscribers, Go forth and create something with the usage refresh!
X (formerly Twitter)
Claude (@claudeai) on X
We’ve agreed to a partnership with @SpaceX that will substantially increase our compute capacity.
This, along with our other recent compute deals, means that we’ve been able to increase our usage limits for Claude Code and the Claude API.
This, along with our other recent compute deals, means that we’ve been able to increase our usage limits for Claude Code and the Claude API.
❤2
https://blog.google/products-and-platforms/devices/fitbit/fitbit-air/
Google just released the Fitbit Air - a 24/7 screenless health tracker - their version of a whoop killer.
Would you cop?
Google just released the Fitbit Air - a 24/7 screenless health tracker - their version of a whoop killer.
Would you cop?
Google
Introducing the all-new Fitbit Air
The new lightweight, screenless Fitbit Air has a long battery life and offers in-depth health and wellness insights.
Something light for the weekend
Try /radio on Claude code
https://x.com/claudedevs/status/2052818282294726699?s=46
Try /radio on Claude code
https://x.com/claudedevs/status/2052818282294726699?s=46
X (formerly Twitter)
ClaudeDevs (@ClaudeDevs) on X
/radio
https://x.com/livinoffwater/status/2052710806568267897?s=46
If you have a hardware device, try this out! Codex Pets is quite kawaii
If you have a hardware device, try this out! Codex Pets is quite kawaii
X (formerly Twitter)
Natalie (@livinoffwater) on X
We are bringing Codex Pets to the physical world
Pick your pet, your hardware and bring them to life
Connect to Codex desktop and never miss when the job finished
Try it out and let me know what you think!
Pick your pet, your hardware and bring them to life
Connect to Codex desktop and never miss when the job finished
Try it out and let me know what you think!
I shipped something this weekend that scratched a very personal itch - Kiasumiles
my wife keeps asking me which card to use at the cashier. I keep answering confidently. i'm not always right. the problem isn't laziness — it's that Singapore credit card earn rates depend on MCC codes (Merchant Category Codes), which don't always match what the merchant looks like. a café might code as fast food. a supermarket might code as a department store. tap the wrong card and you've lost miles you'll never get back, and that could possibly cost you a trip to Bali, or even Tokyo.
so I built KiasuMiles — an MCP server that runs locally on your machine. you tell it which cards you carry (in plain English, once), and from then on you just ask your AI agent - and it’ll tell you which card to use, based on the cards you have.
"what card at Sheng Siong?" → UOB PP Visa, 4mpd, Cap: SGD 600/month, high confidence.
"best card for Grab?" → it tells you.
"which card at Sushi Tei?" → it tells you that too, with the cap and a confidence level.
no API keys. no cloud. one install command:
pip3 install kiasumiles-mcp && kiasumiles-setup
works on Claude Desktop, Claude Code, OpenClaw, Hermes.
48 cards supported across all major SG banks.
full build story — including the bugs I caught, the 150 lines I deleted, and the time the tool was confidently wrong at Sheng Siong — on my substack: https://hosanxiv.substack.com/p/every-wrong-tap-is-miles-you-earned
check it out here, do welcome your feedback: github.com/hosanxiv/kiasumiles
my wife keeps asking me which card to use at the cashier. I keep answering confidently. i'm not always right. the problem isn't laziness — it's that Singapore credit card earn rates depend on MCC codes (Merchant Category Codes), which don't always match what the merchant looks like. a café might code as fast food. a supermarket might code as a department store. tap the wrong card and you've lost miles you'll never get back, and that could possibly cost you a trip to Bali, or even Tokyo.
so I built KiasuMiles — an MCP server that runs locally on your machine. you tell it which cards you carry (in plain English, once), and from then on you just ask your AI agent - and it’ll tell you which card to use, based on the cards you have.
"what card at Sheng Siong?" → UOB PP Visa, 4mpd, Cap: SGD 600/month, high confidence.
"best card for Grab?" → it tells you.
"which card at Sushi Tei?" → it tells you that too, with the cap and a confidence level.
no API keys. no cloud. one install command:
pip3 install kiasumiles-mcp && kiasumiles-setup
works on Claude Desktop, Claude Code, OpenClaw, Hermes.
48 cards supported across all major SG banks.
full build story — including the bugs I caught, the 150 lines I deleted, and the time the tool was confidently wrong at Sheng Siong — on my substack: https://hosanxiv.substack.com/p/every-wrong-tap-is-miles-you-earned
check it out here, do welcome your feedback: github.com/hosanxiv/kiasumiles
🔥8❤4
The AI Burrow 🐰🕳️
I shipped something this weekend that scratched a very personal itch - Kiasumiles my wife keeps asking me which card to use at the cashier. I keep answering confidently. i'm not always right. the problem isn't laziness — it's that Singapore credit card earn…
Would I do this project again?
honestly? probably not.
the data collection alone was brutal - cross-referencing multiple sources that all give you different numbers, none of them clearly wrong. and that's before you get into the permutations: cards with spend windows, cards where the bonus category changes per cardholder, promotional caps that expire without announcement.
got it done. wouldn't sign up for it twice. 😅
honestly? probably not.
the data collection alone was brutal - cross-referencing multiple sources that all give you different numbers, none of them clearly wrong. and that's before you get into the permutations: cards with spend windows, cards where the bonus category changes per cardholder, promotional caps that expire without announcement.
got it done. wouldn't sign up for it twice. 😅
❤1
Really good read by YC CEO on how to make your agent better.
How he used it to ingest books and identify ideas that can be applied specifically to your life, prep for meetings.
Gonna be trying this book-mirror skill out!
Book-mirror skill extracts book chapters then maps every idea specifically to my life context, family history, YC work, and therapy, producing 30,000-word personalized brain pages in 40 minutes.
https://x.com/garrytan/status/2053127519872614419?s=46
How he used it to ingest books and identify ideas that can be applied specifically to your life, prep for meetings.
Gonna be trying this book-mirror skill out!
Book-mirror skill extracts book chapters then maps every idea specifically to my life context, family history, YC work, and therapy, producing 30,000-word personalized brain pages in 40 minutes.
https://x.com/garrytan/status/2053127519872614419?s=46
X (formerly Twitter)
Garry Tan (@garrytan) on X
Meta-Meta-Prompting: The Secret to Making AI Agents Work
❤2
What Meta used to do with training their algorithms on how something goes viral, by looking at how the brain reacts, is now available for the masses.
Might be slop for its initial release, but I think you can take the same idea and reiterate.
https://x.com/higgsfield/status/2053139109074657482
Might be slop for its initial release, but I think you can take the same idea and reiterate.
https://x.com/higgsfield/status/2053139109074657482
X (formerly Twitter)
Higgsfield AI 🧩 (@higgsfield) on X
Higgsfield releases Virality Predictor
What does it mean:
> Upload any clip up to 15s
> Get viral potential, hook score & hold rate
> See a heatmap of brain regions your clip activates
> Pair with Ad Reference for recreated videos
Available via MCP/CLI…
What does it mean:
> Upload any clip up to 15s
> Get viral potential, hook score & hold rate
> See a heatmap of brain regions your clip activates
> Pair with Ad Reference for recreated videos
Available via MCP/CLI…
❤1
https://github.com/mvanhorn/printing-press-library/tree/main/library/devices/whoop
If you use whoop, install this on your claw/hermes
Sample prompts:
“Every morning, use pp-whoop and give me a one-paragraph readiness brief plus a green/yellow/red training recommendation.”
“Pull yesterday’s WHOOP data and append a summary to my daily journal.”
“Combine my WHOOP recovery with my calendar and tell me which meeting-heavy days correlate with bad sleep.”
“Compare my WHOOP strain and recovery before and after travel days.”
“If my recovery is below 40%, draft a lighter training plan for today.”
If you use whoop, install this on your claw/hermes
Sample prompts:
“Every morning, use pp-whoop and give me a one-paragraph readiness brief plus a green/yellow/red training recommendation.”
“Pull yesterday’s WHOOP data and append a summary to my daily journal.”
“Combine my WHOOP recovery with my calendar and tell me which meeting-heavy days correlate with bad sleep.”
“Compare my WHOOP strain and recovery before and after travel days.”
“If my recovery is below 40%, draft a lighter training plan for today.”
GitHub
printing-press-library/library/devices/whoop at main · mvanhorn/printing-press-library
Official library of CLIs generated by the CLI Printing Press. Endorsed, tested, and community-contributed. - mvanhorn/printing-press-library
What I’m still learning from KiasuMiles…
Credit card rules constantly drifting, merchant code changes.
You cannot find a single point source of truth, I still have to comb through various sites and communities for information, collate them and fact check the sources.
But on the bright side, it’s a lot easier for the wife to know which cards to use - now that she has access to her own agent on telegram 😛
Credit card rules constantly drifting, merchant code changes.
You cannot find a single point source of truth, I still have to comb through various sites and communities for information, collate them and fact check the sources.
But on the bright side, it’s a lot easier for the wife to know which cards to use - now that she has access to her own agent on telegram 😛
❤1
Thought of tinkering out with this trending repo :(Openhuman - A Simple, UI-first & Human Desktop Agent), ran the code through claude; and it did not pass:
HIGH concern — OAuth tokens stored on proprietary cloud
Your Gmail, Slack, GitHub etc. OAuth tokens go through api.tinyhumans.ai. The company holds them encrypted on their backend (Redis), fetches them to your local app on demand. They claim split-key encryption (you hold one key locally), but you cannot audit their backend. If their server is compromised, your integrations are at risk.
MEDIUM — Sentry telemetry on by default
Error data + session context sent to Sentry out of the box. Disableable with OPENHUMAN_ANALYTICS_ENABLED=false, but you have to opt out.
MEDIUM — CDP debugger exposed on localhost:19222
Any process running on your machine can attach a Chrome DevTools debugger to the embedded webviews — which contain live WhatsApp/Gmail sessions. This is a local privilege concern.
MEDIUM — JS injection into your accounts
The app injects JavaScript "recipes" into embedded WhatsApp Web, Telegram, Gmail, Slack, Discord webviews to scrape messages. That scraped data flows into local memory and potentially through their backend.
LOW — CEF keychain bypass
Passes --use-mock-keychain to Chromium, so browser credentials in embedded webviews use weaker storage than native OS keychain.
No hardcoded secrets or eval() patterns found. The code is generally clean.
Lesson here: always check and audit before installing things that you see on X, especially on your local computer.
https://github.com/tinyhumansai/openhuman
HIGH concern — OAuth tokens stored on proprietary cloud
Your Gmail, Slack, GitHub etc. OAuth tokens go through api.tinyhumans.ai. The company holds them encrypted on their backend (Redis), fetches them to your local app on demand. They claim split-key encryption (you hold one key locally), but you cannot audit their backend. If their server is compromised, your integrations are at risk.
MEDIUM — Sentry telemetry on by default
Error data + session context sent to Sentry out of the box. Disableable with OPENHUMAN_ANALYTICS_ENABLED=false, but you have to opt out.
MEDIUM — CDP debugger exposed on localhost:19222
Any process running on your machine can attach a Chrome DevTools debugger to the embedded webviews — which contain live WhatsApp/Gmail sessions. This is a local privilege concern.
MEDIUM — JS injection into your accounts
The app injects JavaScript "recipes" into embedded WhatsApp Web, Telegram, Gmail, Slack, Discord webviews to scrape messages. That scraped data flows into local memory and potentially through their backend.
LOW — CEF keychain bypass
Passes --use-mock-keychain to Chromium, so browser credentials in embedded webviews use weaker storage than native OS keychain.
No hardcoded secrets or eval() patterns found. The code is generally clean.
Lesson here: always check and audit before installing things that you see on X, especially on your local computer.
https://github.com/tinyhumansai/openhuman