Forwarded from Axis of Ordinary
Links for 2025-03-05
AI
1. Why do some LMs self-improve their reasoning while others hit a wall. Four key cognitive behaviors enable successful learning: Verification (checking work), Backtracking (trying new approaches), Subgoal Setting, and Backward Chaining (working backwards from a goal). https://arxiv.org/abs/2503.01307
2. A Three-Layer Model of LLM Psychology https://www.lesswrong.com/posts/zuXo9imNKYspu9HGv/a-three-layer-model-of-llm-psychology
3. Chain of Draft: Thinking Faster by Writing Less—80% fewer tokens per response yet maintains accuracy on math, commonsense, and other benchmarks. On GSM8k math problems, CoD achieved 91% accuracy with an 80% token reduction compared to CoT. https://arxiv.org/abs/2502.18600
4. Reasoning models will enable superhuman capabilities in “pure reasoning tasks” such as mathematics and abstract problem-solving https://epoch.ai/gradient-updates/the-promise-of-reasoning-models
5. SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers — “Our findings highlight the potential of LLMs to push the boundaries of mathematical reasoning and tackle NP-hard problems.” https://arxiv.org/abs/2502.20545
6. LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction https://arxiv.org/abs/2502.17925
7. The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models https://arxiv.org/abs/2503.02875
8. How Much Are LLMs Actually Boosting Real-World Programmer Productivity? https://www.lesswrong.com/posts/tqmQTezvXGFmfSe7f/how-much-are-llms-actually-boosting-real-world-programmer
9. New results on AI and lawyer productivity https://marginalrevolution.com/marginalrevolution/2025/03/new-results-on-ai-and-lawyer-productivity.html
10. German nuclear fusion startup Proxima Fusion works on a smart AI-assisted stellarator concept https://www.proximafusion.com/press-news/proxima-fusion-and-partners-publish-stellaris-fusion-power-plant-concept-to-bring-limitless-safe-clean-energy-to-the-grid
11. Alexa+: the next generation of Alexa—it uses Amazon's own Nova models as well as Claude, and will dynamically switch to the best model for each task. https://www.aboutamazon.com/news/devices/new-alexa-generative-artificial-intelligence
12. Opera's new Al-powered Operator browser can surf the web for you https://blogs.opera.com/news/2025/03/opera-browser-operator-ai-agentics/
AI politics
1. “The Government Knows A.G.I. is Coming” https://www.nytimes.com/2025/03/04/opinion/ezra-klein-podcast-ben-buchanan.html [no paywall: https://archive.is/cj6G1]
2. Scale AI announces multimillion-dollar defense deal, a major step in U.S. military automation https://www.cnbc.com/2025/03/05/scale-ai-announces-multimillion-dollar-defense-military-deal.html
3. Alibaba's CEO: They’re going all-in on AGI development as their primary focus. https://www.bloomberg.com/news/articles/2025-02-20/alibaba-ceo-wu-says-agi-is-now-company-s-primary-objective [no paywall: https://archive.is/0S4H9]
Brains
1. New minimally-invasive neural interface can be placed almost anywhere in the brain through a single spinal tap. https://www.nature.com/articles/s41551-024-01281-9
2. Can we compare subjective experiences (qualia) between individuals? https://www.cell.com/iscience/fulltext/S2589-0042(25)00289-5
Biotech and Security
1. Roche next generation sequencing https://www.youtube.com/watch?v=G8ECt04qPos
2. Delivering therapeutics to the brain through intranasal application of engineered commensal bacteria https://www.cell.com/cell/fulltext/S0092-8674(25)00046-7
3. Methods for strong human germline engineering https://www.lesswrong.com/posts/2w6hjptanQ3cDyDw7/methods-for-strong-human-germline-engineering
Technology
1. Amazon announces Ocelot quantum chip https://www.amazon.science/blog/amazon-announces-ocelot-quantum-chip
2. As of today, you can fit an ENTIRE COMPUTER into a single piece of thread. Analog sensing, LEDs, bluetooth comms, processing, digital memory - it's all there https://www.nature.com/articles/s41586-024-08568-6
AI
1. Why do some LMs self-improve their reasoning while others hit a wall. Four key cognitive behaviors enable successful learning: Verification (checking work), Backtracking (trying new approaches), Subgoal Setting, and Backward Chaining (working backwards from a goal). https://arxiv.org/abs/2503.01307
2. A Three-Layer Model of LLM Psychology https://www.lesswrong.com/posts/zuXo9imNKYspu9HGv/a-three-layer-model-of-llm-psychology
3. Chain of Draft: Thinking Faster by Writing Less—80% fewer tokens per response yet maintains accuracy on math, commonsense, and other benchmarks. On GSM8k math problems, CoD achieved 91% accuracy with an 80% token reduction compared to CoT. https://arxiv.org/abs/2502.18600
4. Reasoning models will enable superhuman capabilities in “pure reasoning tasks” such as mathematics and abstract problem-solving https://epoch.ai/gradient-updates/the-promise-of-reasoning-models
5. SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers — “Our findings highlight the potential of LLMs to push the boundaries of mathematical reasoning and tackle NP-hard problems.” https://arxiv.org/abs/2502.20545
6. LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction https://arxiv.org/abs/2502.17925
7. The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models https://arxiv.org/abs/2503.02875
8. How Much Are LLMs Actually Boosting Real-World Programmer Productivity? https://www.lesswrong.com/posts/tqmQTezvXGFmfSe7f/how-much-are-llms-actually-boosting-real-world-programmer
9. New results on AI and lawyer productivity https://marginalrevolution.com/marginalrevolution/2025/03/new-results-on-ai-and-lawyer-productivity.html
10. German nuclear fusion startup Proxima Fusion works on a smart AI-assisted stellarator concept https://www.proximafusion.com/press-news/proxima-fusion-and-partners-publish-stellaris-fusion-power-plant-concept-to-bring-limitless-safe-clean-energy-to-the-grid
11. Alexa+: the next generation of Alexa—it uses Amazon's own Nova models as well as Claude, and will dynamically switch to the best model for each task. https://www.aboutamazon.com/news/devices/new-alexa-generative-artificial-intelligence
12. Opera's new Al-powered Operator browser can surf the web for you https://blogs.opera.com/news/2025/03/opera-browser-operator-ai-agentics/
AI politics
1. “The Government Knows A.G.I. is Coming” https://www.nytimes.com/2025/03/04/opinion/ezra-klein-podcast-ben-buchanan.html [no paywall: https://archive.is/cj6G1]
2. Scale AI announces multimillion-dollar defense deal, a major step in U.S. military automation https://www.cnbc.com/2025/03/05/scale-ai-announces-multimillion-dollar-defense-military-deal.html
3. Alibaba's CEO: They’re going all-in on AGI development as their primary focus. https://www.bloomberg.com/news/articles/2025-02-20/alibaba-ceo-wu-says-agi-is-now-company-s-primary-objective [no paywall: https://archive.is/0S4H9]
Brains
1. New minimally-invasive neural interface can be placed almost anywhere in the brain through a single spinal tap. https://www.nature.com/articles/s41551-024-01281-9
2. Can we compare subjective experiences (qualia) between individuals? https://www.cell.com/iscience/fulltext/S2589-0042(25)00289-5
Biotech and Security
1. Roche next generation sequencing https://www.youtube.com/watch?v=G8ECt04qPos
2. Delivering therapeutics to the brain through intranasal application of engineered commensal bacteria https://www.cell.com/cell/fulltext/S0092-8674(25)00046-7
3. Methods for strong human germline engineering https://www.lesswrong.com/posts/2w6hjptanQ3cDyDw7/methods-for-strong-human-germline-engineering
Technology
1. Amazon announces Ocelot quantum chip https://www.amazon.science/blog/amazon-announces-ocelot-quantum-chip
2. As of today, you can fit an ENTIRE COMPUTER into a single piece of thread. Analog sensing, LEDs, bluetooth comms, processing, digital memory - it's all there https://www.nature.com/articles/s41586-024-08568-6
🐳1
Axis of Ordinary
In a smarter and more rational world...
Musings of a human mind that has never seen India/China Liveleak Kino footage before
🐳1
Not to shouldpost but I should post more. Will keep you all updated. Keep your eyes peeled for a substack.
🐳1
Forwarded from Hacker News
Phoronix
AMD Announces "Instella" Fully Open-Source 3B Language Models
Another announcement at AMD today beyond the open-source Linux driver fun for the Radeon RX 9070 series is announcing the open-sourcing of Instella as their new fully open 3B parameter language models.
🐳1
Hacker News
Age and cognitive skills: Use it or lose it Article, Comments
Essentially another W for gamers
🐳1
/g/‘s Tech Memes
Photo
Just to let you guys know that this Machead is beyond retarded. You can escape insert mode in vim with C-c (AKA ctrl+c)
🤓2✍1🐳1
Forwarded from Hacker News
LocalThunk
The Balatro Timeline — LocalThunk
It’s been approximately 3 years since I began work on Balatro - and in that time I have personally documented almost nothing about the journey. This is something that has bothered me since the game launched. I am constantly forgetting major moments in development…
🐳1
Forwarded from Hacker News
X (formerly Twitter)
Thomas Wolf (@Thom_Wolf) on X
I shared a controversial take the other day at an event and I decided to write it down in a longer format: I’m afraid AI won't give us a "compressed 21st century".
The "compressed 21st century" comes from Dario's "Machine of Loving Grace" and if you haven’t…
The "compressed 21st century" comes from Dario's "Machine of Loving Grace" and if you haven’t…
🐳1
ヒマワリ会 Sunflower Society (App Banned)
Photo
S&P500 was the only instrument that hedged against inflation.
🐳1
I don't know how to get through to people. How many times do I have to tap the fucking sign... iykyk... ywgi... ngmi... io.... mmd... tnd...
🐳1🤓1
placeholder
must've been destiny's culture of debating, the wealth generated during biden or COVID isolation
It's parasite stress.
👍1🐳1