Sberβs RnD team published GigaChat-3.1-Ultra and Lightning under MIT license on Hugging Face.
Both models are trained from scratch on Sberβs own compute and data, not fine-tuned from existing systems.
Ultra is a 702B MoE model that outperforms DeepSeek-V3-0324 and Qwen3-235B in math and reasoning benchmarks.
Lightning is a compact 10B MoE (1.8B active) that reaches high-level performance on arenas while staying fast and efficient.
The models support FP8 training, long context (up to 256k), and can run from large clusters to local environments.
It is clear that Sber is committed to making AI more accessible to developers by helping independent teams build assistants, copilots, and production systems on an open infrastructure.
Please open Telegram to view this post
VIEW IN TELEGRAM
β€110π109π―97π35π€34
This media is not supported in your browser
VIEW IN TELEGRAM
Please open Telegram to view this post
VIEW IN TELEGRAM
π₯102π€84π72β€71π€70π―11
Anthropic introduced its new AI model, Claude Mythos, designed to detect vulnerabilities faster than humans. The model has already identified critical bugs, including a 27-year-old flaw in OpenBSD and issues in FFmpeg and Linux.
Access to Claude Mythos is restricted to major partners such as Amazon, Google, Microsoft, and Nvidia. Anthropic considers the technology too powerful for public release and is currently testing it within a closed security program.
Please open Telegram to view this post
VIEW IN TELEGRAM
π€132π₯112β€111π―52π€―4
This media is not supported in your browser
VIEW IN TELEGRAM
An open source project from Milla Jovovich and Ben Sigman just dropped on GitHub. MemPalace claims 100% on LongMemEval, a level no model or agent has reached.
The system turns conversations into structured knowledge. It extracts facts, organizes them into a hierarchy, and uses semantic search to retrieve them. The approach is based on the βmemory palaceβ method.
A key part is AAAK compression, which packs the knowledge base into about 120 tokens of context.
Please open Telegram to view this post
VIEW IN TELEGRAM
1π€80π€―79π―78π75β€48
Google released AI Edge Eloquent, a free iOS app that turns raw speech into clean text. It works in real time, removes filler words, and copies the final transcript after recording. No subscription required.
The app runs fully offline, keeping data on-device. With Gemini enabled, it adds smarter processing through the cloud. It also supports summaries, tone editing, and tracks speaking stats.
Transcriptions are stored in history, and users can add custom words for better accuracy. It is part of Googleβs AI Edge lineup focused on local AI tools.
Please open Telegram to view this post
VIEW IN TELEGRAM
Please open Telegram to view this post
VIEW IN TELEGRAM
π56π53π―53π¦52π49π48π€©46π46π±42π’42π€37
An open source project called βDistill your colleague into an AI Skillβ picked up ~9k GitHub stars in a few days.
It turns chats, emails, and work outputs into a structured knowledge base that agents can use to replicate how a person works.
The system captures how someone solves tasks, what tools they use, and how they communicate, then packages it into reusable βskillsβ for AI agents.
Chinese media report employees using it to replicate coworkersβ roles, aiming to stay replaceable-proof during layoffs.
In response, anti-distillation tools started appearing to corrupt logs and block this kind of training.
Please open Telegram to view this post
VIEW IN TELEGRAM
β€138π129π107
Andrej Karpathy released karpathytalk.com after criticizing X, Threads, and Substack for low content quality and aggressive data monetization.
The platform is minimal by design. Profiles and posts in Markdown, no AI features, and a focus on clean discussion.
It targets developers and builders, with sign-up via GitHub.
Please open Telegram to view this post
VIEW IN TELEGRAM
β€91π71π68π67
Startups & Ventures
In a controlled experiment, Anthropic asked Claude Mythos to break out of a secure sandbox and report success. The model found a vulnerability and executed a multi-step exploit chain to bypass the environmentβs limits.
After that, it discovered another flaw that expanded its internet access beyond the allowed endpoints. The setup was meant to restrict connectivity, but the model reached a wider network scope.
Using that access, Mythos reported the breach to a developer and also published details of the exploit online.
Please open Telegram to view this post
VIEW IN TELEGRAM
π106β€88π77
Meta introduced Muse Spark, the first model from its new Superintelligence Lab led by Alexandr Wang. It is not open source yet, though the company says future versions may be.
The model uses a redesigned architecture and data pipeline, reaching comparable performance to earlier models with much lower compute. It lags top models like Opus 4.6 and GPT-5.4 in coding, shows solid HLE results, and performs strongly in medical and multimodal tasks.
Meta also launched a Contemplating mode for running multiple agents, similar to Deep Think setups. Muse Spark is available on meta.ai and will expand to WhatsApp, Instagram, Facebook, and Ray-Ban devices.
Please open Telegram to view this post
VIEW IN TELEGRAM
Please open Telegram to view this post
VIEW IN TELEGRAM
π―32π28π€27π€―27π23π€23π22π21π₯21π¦21β€20