HPC & Quantum

HPC Guru (Twitter)

RT @NVIDIAAIDev: ✨ Get over 3x perf improvement for #LLM throughput - TensorRT-LLM Multiblock Attention on NVIDIA HGX H200 boosts long-sequence performance without impacting time-to-first-token.

Read how ➡️ nvda.ws/3CE6hgS

2 views20:56

HPC & Quantum

‌HPC Guru (Twitter)

China's "Global Scheduling Ethernet" appears to have the same motives as @ultraethernet

GSE has been deployed in a large cluster in China - claim of substantial network performance improvements during training of a #LLM

https://www.theregister.com/2024/11/26/global_scheduling_ethernet_china_uec/

#AI #HPC via @TheRegister

X (formerly Twitter)

#LLM - Search / X

See posts about #LLM on X. See what people are saying and join the conversation.

1 view05:45

HPC & Quantum

‌insideHPC.com (Twitter)

MLCommons Launches LLM Safety Benchmark
wp.me/p3RLHQ-oMT
@MLCommons #LLM #LLMs #AI #AIbenchmark #HPCAI #HPC

High-Performance Computing News Analysis | insideHPC

MLCommons Launches LLM Safety Benchmark

Dec. 4, 2024 — MLCommons today released AILuminate, a safety test for large language models. The v1.0 benchmark – which provides a series of safety [...]

2 views20:42

HPC & Quantum

HPC Guru (Twitter)

RT @thoefler: Great talking to LIU Bin, the Deputy President of @NUSingapore about #AI, #HPC, and #Health

I'm looking forward to talking about efficient #GenAI and the past and future of #LLM computing on Jan 10 at @NUSComputing!

https://events.comp.nus.edu.sg/view/23280 🤖

We're in the Age of Computation!

1 view06:36

HPC & Quantum

HPC Guru (Twitter)

#AI and the future of everything: Five ways #AI will change our world as we know it

By Kirk Bresniker, @HPE Fellow and Chief Architect at Hewlett Packard Labs

https://www.hpe.com/us/en/newsroom/blog-post/2025/01/ai-and-the-future-of-everything-five-ways-ai-will-change-our-world-as-we-know-it.html

#HPC #LLM #GenerativeAI

2 views16:22

HPC & Quantum

HPC Guru (Twitter)

Claim: 100x Faster* at 1/10th the cost

* Decode tok/s, versus a (cluster of) H100 GPUs with 8-bit quantisation and TensorRT-LLM, on Llama2 70B

Their website is: fractile.ai

#LLM #AI #Inference via @PGelsinger https://twitter.com/PGelsinger/status/1882159997167251812#m

3 views23:04

HPC & Quantum

‌HPC Guru (Twitter)

ALIA-40B, the most advanced public multilingual foundational model in Europe, was trained on MareNostrum 5

https://www.bsc.es/news/bsc-news/alia-europes-first-public-open-and-multilingual-ai-infrastructure

#LLM #AI #HPC #MN5 via @BSC_CNS @HPCwire

BSC-CNS

ALIA, Europe's first public, open and multilingual AI infrastructure

The project, coordinated by BSC, provides open and transparent language models to promote the use of Spanish and co-official languages in the development and deployment of AI

2 views14:56

HPC & Quantum

HPC Guru (Twitter)

RT @thoefler: 🚀 Excited to kick off the 2025 @adia_lab seminar series on Tue!

I'll explore the role of computation in the history of LLMs and the rise of fascinating reasoning models (that authored this post 🤖)

Joining me are two inspiring speakers—can't wait for their insights!" #AI #LLM https://twitter.com/adia_lab/status/1882035491790586121#m

1 view17:18

HPC & Quantum

‌HPC Guru (Twitter)

El Reg digs its claws into Middle Kingdom's latest chain of thought model - results from @TheRegister's tests on DeepSeek's R1

It can tell you how many Rs in strawberry, but not anything about the Tiananmen Square massacre

https://www.theregister.com/2025/01/26/deepseek_r1_ai_cot/

#AI #GenAI #LLM

The Register

China's DeepSeek just emitted a free challenger to OpenAI's o1 – here's how to use it on your PC

El Reg digs its claws into Middle Kingdom's latest chain of thought model

1 view19:44

HPC & Quantum

HPC Guru (Twitter)

RT @thoefler: From #LLMs 🤖 to Reasoning Language Models 🧠 Three Eras in the Age of Computation!

🔥 Progress in #AI and #Computing 🎥 https://www.youtube.com/watch?v=NFwZi94S8qc

💡 Combining the best knowledge databases (#LLM) with the best strategy play (#RL) will be only limited by computational cost 🚀 #HPC

1 view22:31

HPC & Quantum

HPC Guru (Twitter)

RT @thoefler: Do you wonder how Reasoning Language Models like #DeepSeek R1 are made?

A fascinating mix of #ReinforcementLearning, #MCTS, and #LLM training and finetuning on #HPC supercomputers.

Check our thinking framework to derive new exciting #RLM implementations: buff.ly/4hA6GQq

2 views00:42

HPC & Quantum

‌insideHPC.com (Twitter)

Toward AGI: AI Innovation Will Be Driven by Applications, Not LLMs
wp.me/p3RLHQ-oSV
#LLM #AGI #AI @deepseek_ai @OpenAI @AnthropicAI

High-Performance Computing News Analysis | insideHPC

Toward AGI: AI Innovation Will Be Driven by Applications, Not LLMs

DeepSeek’s LLM has caused a stir, but ... companies like OpenAI and Anthropic are aiming higher, their sights are set on artificial general intelligence, [...]

3 views21:28

HPC & Quantum

HPC Guru (Twitter)

RT @hpcnotes: Rio Yokota of Institute of Science Tokyo (new name for Titech) giving an update on #LLM work using #supercomputers in Japan at #MW25NZ mixed with an excellent overall discussion of wider international #AI evolution and advances

#HPC

4 views02:47

HPC & Quantum

HPCwire (Twitter)

A recent xAI headline seemed out of place. We take a closer look here: ow.ly/5YtN50VPrvt #AIdatacenter #LLM

1 view00:17

HPC & Quantum

‌HPC Guru (Twitter)

ICYMI: Microsoft and University of Science and Technology of China trained LLMs using #FP4 for matrix multiplications and achieved accuracy comparable to LLMs trained using the popular #BF16 format

arxiv.org/abs/2501.17116

#AI #LLM via @AndrewYNg

X (formerly Twitter)

#FP4 - Search / X

See posts about #FP4 on X. See what people are saying and join the conversation.

2 views21:35

HPC & Quantum

HPCwire (Twitter)

RT @alex_woodie: During his talk at #TPC25, Satoshi Matsuoka of @RIKEN says there is much debate within the AI community about the size of models @HPCwire #LLM #AIforScience

3 views13:35

HPC & Quantum

HPCwire (Twitter)

Beijing’s Moonshot AI (backed by Alibaba) just dropped Kimi K2, a 1 trillion parameter open-weight LLM built for serious code and agentic reasoning.

Learn more: ow.ly/yR9w50WwHTi

#LLM #MoonshotAI

6 views00:12

About

Blog

Apps

Platform