Byte by Byte

Since 20/09/2024, Linux has been officially an RTOS:

https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=baeb9a7d8b60b021d907127509c44507539c15e5

94 views14:08

Byte by Byte

Valve will support Arch Linux working on critical projects on a freelance basis.

141 views11:39

Byte by Byte

ZLUDA Takes On Third Life: Open-Source Multi-GPU CUDA Implementation Focused On AI - Phoronix
https://www.phoronix.com/news/ZLUDA-Third-Life

Phoronix

ZLUDA Takes On Third Life: Open-Source Multi-GPU CUDA Implementation Focused On AI

The open-source ZLUDA project began life as a drop-in CUDA replacement that ran atop Intel GPUs using the Level Zero API

134 views20:39

Byte by Byte

in the meantime, Nvidia are about to open-source and release their multimodal model

https://nvlm-project.github.io/

133 views20:43

Byte by Byte

LLMs don't do any formal reasoning...

A new research on LLMs by six Apple researchers found “no evidence of formal reasoning in language models …. Their behavior is better explained by sophisticated pattern matching—so fragile, in fact, that changing names can alter results by ~10%!”.

👍1

685 views10:34

Byte by Byte

LLMs don't do any formal reasoning... A new research on LLMs by six Apple researchers found “no evidence of formal reasoning in language models …. Their behavior is better explained by sophisticated pattern matching—so fragile, in fact, that changing names…

2410.05229v1.pdf

1.5 MB

Full article:

629 viewsedited 10:34

Byte by Byte

0:13

This media is not supported in your browser

VIEW IN TELEGRAM

Microsoft just opensourced a blazing fast inference framework for 1-bit LLMS. Now you can run up to 100B LLMS locally on your CPU (GPU and NPU support coming in the future) and get 5-7 tokens/second.

https://github.com/microsoft/BitNet

609 views08:59

Byte by Byte

Brilliant article on Arm's approach for the future, challenges (AI, Qualcomm and Apple turning into competitors, AMD and Intel joining forces to improve x86) and mini-interview to Christopher Bergey, senior VP and GM of the Client Line of Business at Arm. A very good read.

https://www.theregister.com/AMP/2024/10/22/arm_custom_silicon_interview/

The Register

As Arm rivals cook up custom silicon, Mediatek sticks to tried-and-true Cortex recipe

Exec Chris Bergey tells us what the chip designer is doing to stay competitive

148 views17:49

Byte by Byte

Thomas Eugene Kurtz, father of BASIC, has died peacefully today in his retirement home. He was 96. Thank you, Thomas, for enabling at least two generations of human beings to get to learn and use a personal computer.

123 views13:54

Byte by Byte

This is truly the best time for retro gaming and emulation. Ten years ago I would have sworn we'd never see a working emulator for extremely complex systems such as the PS3 and its IBM Cell, a PowerPC-based CPU. Now RPCS3 runs even on ARM64 CPUs, including Apple Silicon, thus opening a new landscape for PS3 emulation even on extremely portable devices.

For more information on the specific RPCS3 case, read this blog post on their website.

512 viewsedited 10:51

Byte by Byte

Interesting article on AMD's quite incredible feat to run crude C++ code on their GPUs for AI workloads, thus threatening the de-facto monopoly Nvidia has with CUDA and their GPUs (biggest market cap in the world).

https://www.phoronix.com/news/AMD-Standard-C-Code-GPUs

Phoronix

How AMD Is Taking Standard C/C++ Code To Run Directly On GPUs

Back at the 2024 LLVM Developers' Meeting was an interesting presentation by AMD engineer Joseph Huber for how they have been exploring running common, standard C/C++ code directly on GPUs without having to be adapted for any GPU language / programming dialects…

🎄2

523 views10:37

Byte by Byte

Find your favourite telegram channel admin at FOSDEM 2025 (01-02/February)!

521 views07:23

Byte by Byte

The research paper "All You Need Is Attention" from the Google Research Team that came up with transformers (which radically changed and improved artificial intelligence and gave rise to genAI as we know it) might have found its successor.

https://arxiv.org/abs/2501.00663

Some good discussion and interpretation of results can be found in the comments section here.

arXiv.org

Titans: Learning to Memorize at Test Time

Over more than a decade there has been an extensive research effort on how to effectively utilize recurrent models and attention. While recurrent models aim to compress the data into a fixed-size...

523 views21:02

Byte by Byte

Imagine playing a simple game of Hangman with an AI.

You'd think that explaining the game mechanics step-by-step would be enough, but my favorite benchmark test for Large Language Models (LLMs) has consistently highlighted their limitations.

Most LLMs can't successfully play Hangman—even after a detailed explanation. They end up accepting all guesses as correct, fail to understand the turn-based nature of the game, and often mix up who should come up with the blank word or provide guesses. In my recent tests, between the commercially available LLMs, only ChatGPT 4o has been able to play Hangman correctly.

This quirky but telling failure shows that while these models can seem incredibly smart, there's still a big gap between surface-level understanding and actually grasping logical, step-by-step processes. Read more about it in my latest post on Substack.

Substack

Hangman and Circles

How LLMs struggle with Simple Reasoning

85 views18:14

Byte by Byte

New post on Substack. Just in time for a good lunch read, I suppose. Don't miss it!

https://atsetilam.substack.com/p/the-fabled-ai-on-the-edge

140 views12:55

Byte by Byte

Asahi Lina left the Asahi Linux project for “personal reasons”. We lost a fantastic developer who was focused on an incredibly difficult and extremely important work - bringing Linux onto the Apple Silicon platform. Such a pity. She says she is “safe”, luckily.

😢3

87 views11:12

Byte by Byte

We got GIMP 3.0 before GTA 6

🔥1

85 views15:54

Byte by Byte

"Model Context Protocol (MCP) is an AI tool calling standard that has been rapidly gaining adoption over the past few months. MCP tools give LLMs a standardized way to call functions, look up data, and interact with the world. Anthropic created the protocol and built the first GitHub MCP server, which grew to be one of the most popular MCP servers in the expanding ecosystem. We are excited to take ownership of the server and continue its development."

https://github.blog/changelog/2025-04-04-github-mcp-server-public-preview/

The GitHub Blog

github-mcp-server is now available in public preview - GitHub Changelog

Today we’re releasing a new open source, official, local GitHub MCP Server. We’ve worked with Anthropic to rewrite their reference server in Go and improve its usability. The new server…

88 views09:41

Byte by Byte

Microsoft has released a tool under MIT license to convert your docx into markdown.

https://github.com/microsoft/markitdown

GitHub

GitHub - microsoft/markitdown: Python tool for converting files and office documents to Markdown.

Python tool for converting files and office documents to Markdown. - microsoft/markitdown

77 views11:54

Byte by Byte

OpenAI has just released a FOSS CLI tool for “developers who already live in the terminal and want ChatGPT‑level reasoning plus the power to actually run code, manipulate files, and iterate – all under version control”.

https://github.com/openai/codex

GitHub

GitHub - openai/codex: Lightweight coding agent that runs in your terminal

Lightweight coding agent that runs in your terminal - openai/codex

75 views22:49

Byte by Byte

Babe wake up, Microsoft released a 1-bit LLM under MIT that is optimized for running on CPUs: microsoft/bitnet-b1.58-2B-4T

94 views11:19

About

Blog

Apps

Platform