Stuff
3 subscribers
200K links
Download Telegram
Show HN: AutoThink – Boosts local LLM performance by 43% with adaptive reasoning
19 by codelion | 2 comments on Hacker News.
I built AutoThink, a technique that makes local LLMs reason more efficiently by adaptively allocating computational resources based on query complexity. The core idea: instead of giving every query the same "thinking time," classify queries as HIGH or LOW complexity and allocate thinking tokens accordingly. Complex reasoning gets 70-90% of tokens, simple queries get 20-40%. I also implemented steering vectors derived from Pivotal Token Search (originally from Microsoft's Phi-4 paper) that guide the model's reasoning patterns during generation. These vectors encourage behaviors like numerical accuracy, self-correction, and thorough exploration. Results on DeepSeek-R1-Distill-Qwen-1.5B: - GPQA-Diamond: 31.06% vs 21.72% baseline (+43% relative improvement)- MMLU-Pro: 26.38% vs 25.58% baseline- Uses fewer tokens than baseline approaches Works with any local reasoning model - DeepSeek, Qwen, custom fine-tuned models. No API dependencies. The technique builds on two things I developed: an adaptive classification framework that can learn new complexity categories without retraining, and an open source implementation of Pivotal Token Search. Technical paper: https://ift.tt/WqaK0hF Code and examples: https://ift.tt/PUboNrj... PTS implementation: https://ift.tt/SnpeTvX I'm curious about your thoughts on adaptive resource allocation for AI reasoning. Have you tried similar approaches with your local models?
Ask HN: Is anyone using AI conversation partners?
6 by rickcarlino | 1 comments on Hacker News.
I'm obsessed applying LLMs in language learning software. One thing I am not-so-obsessed with is a wave of conversational chat apps that many startups have begun offering. Having tried them myself, I find them to be quite bland ("Tell me about your day!" ) and often use a speech style that uses direct translation of English phrases into the target language. I see plenty of potential for LLMs in this space, but the conversation bhat bots I have seen so far are too open ended and seem half baked. For users: Is anyone finding these tools helpful? For the people building them: Are people actually returning to the product?
Show HN: Connecting People Through AI-Powered Video Sentiment Matching
5 by armini | 0 comments on Hacker News.
Hi HN, I’d like to share www.kuky.com, a peer support network that connects people through short, self-recorded videos and matches them using sentiment analysis powered by large language models (LLMs). We’re building Kuky to help users find others who genuinely understand their emotional journey—not through swiping or likes, but through shared human stories. In this short Loom demo (link above), I walk through how: Users create a profile by uploading 3 videos: an intro, their mental health journey, and their likes/dislikes LLMs analyze each video for emotional tone, key themes, and psychological markers Based on this, Kuky intelligently connects users with similar experiences and emotional alignment We're passionate about creating a safe, empathetic space for authentic conversation—especially for people dealing with mental health challenges. For more background, here’s a feature on Kuky in Women Love Tech: https://ift.tt/EZoSD3v... Would love your thoughts on the concept, the matching algorithm, and how you’d imagine using something like this. Thanks,Armin
Every wondered how Facebook spoofs Gmail message list snippet text?
19 by chrisjj | 10 comments on Hacker News.
E.g. Gmail inbox shows a message contains "XXX tagged you on Facebook. Take a look about what she said on you." But when you open the message, there's no "Take a look about what she said on you." Answer. The text is present but hidden: Take a look at what she said about you. And unsurprisingly whenever I do click through, I find she hasn't said anything about me.
Vibe coding for teams, thoughts to date
5 by todsacerdoti | 3 comments on Hacker News.
AI: Accelerated Incompetence
120 by stevekrouse | 99 comments on Hacker News.
Show HN: Voiden – a free, offline, Git-native API Client
12 by kiselitza | 8 comments on Hacker News.
Hey HN! Aldin here, a helping hand to Voiden ( https://voiden.md ) Voiden is a free, offline, Git-native API client. Your API definitions, docs, and tests all live together. It came out of years of frustration: cloud sync lock-in, paywalled basics, bloated UIs, and lag on even simple requests. So the team built the opposite: an offline tool with no login, no telemetry, no lock-in. Just markdown and hotkeys. It behaves like code: local files, git branches, no cloud nonsense. Terminal is built-in, so you can commit, diff, and push changes right from the app. Docs stay close to your requests, so that API spec and what the API actually does never drift apart. No more scattered Postman, docs, and test files everywhere. A single source of truth. A minimalist GET request looks something like this: GET https://ift.tt/dJrcfvx Just hit /endpoint, paste the URL, and run it with Cmd/Ctrl + Enter. Not OSS (yet), but 100% local and free.Optional plugins will be coming down the line, but the core stays free. We'd love feedback from folks tired of overcomplicated and bloated API tooling.