HPC Guru (Twitter)
#ML-based methodology for HPC facilities supervision
Finding a Needle in an #HPC Haystack: Using #AI to Monitor Anomalies at Scale
By Thomas Leibovici, Head of laboratory at CEA
https://www.hpcwire.com/2024/01/11/finding-a-needle-in-an-hpc-haystack-using-ai-to-monitor-anomalies-at-scale/
#ML-based methodology for HPC facilities supervision
Finding a Needle in an #HPC Haystack: Using #AI to Monitor Anomalies at Scale
By Thomas Leibovici, Head of laboratory at CEA
https://www.hpcwire.com/2024/01/11/finding-a-needle-in-an-hpc-haystack-using-ai-to-monitor-anomalies-at-scale/
HPC Guru (Twitter)
RT @AyarLabs: Term of the Week:
Pipeline Parallelism – A technique in AI, particularly in deep learning, that divides a model into stages, each executed on a different device, to improve efficiency.
bit.ly/3UkcPI9
#optical #photonics #semiconductor #AI #ML
RT @AyarLabs: Term of the Week:
Pipeline Parallelism – A technique in AI, particularly in deep learning, that divides a model into stages, each executed on a different device, to improve efficiency.
bit.ly/3UkcPI9
#optical #photonics #semiconductor #AI #ML
Ayar Labs
Pipeline Parallelism - Ayar Labs
Pipeline Parallelism — A technique in AI that divides a model into stages, each executed on a different device, to improve efficiency.
HPC Guru (Twitter)
RT @EvanKirstel: China's top AI accelerator and CPU makers are bleeding tens of millions -- Longsoon and Cambricon losses continue despite billions in government subsidies https://www.tomshardware.com/pc-components/cpus/chinas-top-ai-accelerator-and-cpu-designers-are-bleeding-tens-of-millions-longsoon-and-cambricon-losses-continue-despite-billions-in-government-subsidies #AI #ML #ArtificialIntelligence #MachineLearning #GenAI @HPC_Guru
RT @EvanKirstel: China's top AI accelerator and CPU makers are bleeding tens of millions -- Longsoon and Cambricon losses continue despite billions in government subsidies https://www.tomshardware.com/pc-components/cpus/chinas-top-ai-accelerator-and-cpu-designers-are-bleeding-tens-of-millions-longsoon-and-cambricon-losses-continue-despite-billions-in-government-subsidies #AI #ML #ArtificialIntelligence #MachineLearning #GenAI @HPC_Guru
insideHPC.com (Twitter)
Updated MLPerf AI Inference Benchmark Results Released
wp.me/p3RLHQ-onT
@MLCommons #HPC #AI #ML #MachineLearning #AIinference #inference
Updated MLPerf AI Inference Benchmark Results Released
wp.me/p3RLHQ-onT
@MLCommons #HPC #AI #ML #MachineLearning #AIinference #inference
High-Performance Computing News Analysis | insideHPC
Updated MLPerf AI Inference Benchmark Results Released
Today, MLCommons announced new results from its MLPerf Inference v4.0 benchmark suite with machine learning (ML) system performance benchmarking. To view [...]
HPC Guru (Twitter)
Google announced its 6th generation #TPU (Trillium) at its annual I/O event and there are still more questions than answers about them at this point
https://www.nextplatform.com/2024/06/10/lots-of-questions-on-googles-trillium-tpu-v6-a-few-answers/
#AI #ML via @TDaytonPM
Google announced its 6th generation #TPU (Trillium) at its annual I/O event and there are still more questions than answers about them at this point
https://www.nextplatform.com/2024/06/10/lots-of-questions-on-googles-trillium-tpu-v6-a-few-answers/
#AI #ML via @TDaytonPM
X (formerly Twitter)
#TPU - Search / X
See posts about #TPU on X. See what people are saying and join the conversation.
HPC Guru (Twitter)
Join ALCF for an in-person hands-on #HPC Workshop Oct 29-31 @argonne
o Hands-on time on Polaris and AI Testbeds
o Includes a tour of Argonne facilities, including #Aurora
Register by Sept 16
https://www.alcf.anl.gov/events/2024-alcf-hands-hpc-workshop
#HPC #AI #ML via @argonne_lcf
Join ALCF for an in-person hands-on #HPC Workshop Oct 29-31 @argonne
o Hands-on time on Polaris and AI Testbeds
o Includes a tour of Argonne facilities, including #Aurora
Register by Sept 16
https://www.alcf.anl.gov/events/2024-alcf-hands-hpc-workshop
#HPC #AI #ML via @argonne_lcf
HPC Guru (Twitter)
RT @mgg_ch: “Switzerland should really make energy cheaper! We should build an energy infrastructure that can lead us in the future” @thoefler @spcl_eth @cscsch keynote at the scientific symposium for the Inauguration of #Alps #AI #ML #HPC #weareALPS
RT @mgg_ch: “Switzerland should really make energy cheaper! We should build an energy infrastructure that can lead us in the future” @thoefler @spcl_eth @cscsch keynote at the scientific symposium for the Inauguration of #Alps #AI #ML #HPC #weareALPS
HPC Guru (Twitter)
RT @mgg_ch: #HPC is necessary to discover new materials and is essential if we want to carry on cutting-edge research for a new advancement of the civilization @NicolaSpaldin @ETH keynote at the Scientific Symposium for the Inauguration of #Alps @cscsch #ML #weareALPS
RT @mgg_ch: #HPC is necessary to discover new materials and is essential if we want to carry on cutting-edge research for a new advancement of the civilization @NicolaSpaldin @ETH keynote at the Scientific Symposium for the Inauguration of #Alps @cscsch #ML #weareALPS
HPC Guru (Twitter)
Addressing the deluge of big data generated by big science
Standing up the nation’s #supercomputing pipeline for streaming #bigdata in real time
https://www.ornl.gov/news/standing-nations-supercomputing-pipeline-streaming-big-data-real-time
#HPC #AI #ML via @ORNL
Addressing the deluge of big data generated by big science
Standing up the nation’s #supercomputing pipeline for streaming #bigdata in real time
https://www.ornl.gov/news/standing-nations-supercomputing-pipeline-streaming-big-data-real-time
#HPC #AI #ML via @ORNL
HPC Guru (Twitter)
Revisiting Reliability in Large-Scale #ML Clusters
@glennklockwood's notes on the @Meta paper
#AI community is arriving at the same conclusions around quantifying reliability as the #HPC community
👏👏@Meta for being open about how they operate
https://glennklockwood.com/garden/papers/revisiting-reliability-in-large-scale-machine-learning-research-clusters
Revisiting Reliability in Large-Scale #ML Clusters
@glennklockwood's notes on the @Meta paper
#AI community is arriving at the same conclusions around quantifying reliability as the #HPC community
👏👏@Meta for being open about how they operate
https://glennklockwood.com/garden/papers/revisiting-reliability-in-large-scale-machine-learning-research-clusters
HPC Guru (Twitter)
Countdown to #SC24: Keynote speaker on Nov 19 will be Dr. Nicola Fox (@SolarGirl2018), @NASAScienceAA
@NASA’s Vision for High Impact Science and Exploration -the importance of #HPC, #AI, and #ML technologies
https://sc24.supercomputing.org/program/keynote/
Countdown to #SC24: Keynote speaker on Nov 19 will be Dr. Nicola Fox (@SolarGirl2018), @NASAScienceAA
@NASA’s Vision for High Impact Science and Exploration -the importance of #HPC, #AI, and #ML technologies
https://sc24.supercomputing.org/program/keynote/
HPC Guru (Twitter)
Build on Trainium: @amazon invests $110 million to support #AI research at universities using #Trainium chips
AWS Trainium is the #ML chip that @awscloud built for the purposes of deep learning training and inference
https://www.aboutamazon.com/news/aws/amazon-trainium-investment-university-ai-research
Build on Trainium: @amazon invests $110 million to support #AI research at universities using #Trainium chips
AWS Trainium is the #ML chip that @awscloud built for the purposes of deep learning training and inference
https://www.aboutamazon.com/news/aws/amazon-trainium-investment-university-ai-research
X (formerly Twitter)
HPC Guru (@HPC_Guru) on X
Build on Trainium: @amazon invests $110 million to support #AI research at universities using #Trainium chips
AWS Trainium is the #ML chip that @awscloud built for the purposes of deep learning training and inference
https://t.co/T2pfDbbBOK
AWS Trainium is the #ML chip that @awscloud built for the purposes of deep learning training and inference
https://t.co/T2pfDbbBOK
HPC Guru (Twitter)
Engineers at NXP have developed a #ML algorithm that learns the patterns of test results and figures out the subset of tests that are really needed and those that they could safely do without
https://spectrum.ieee.org/semiconductor-testing
#AI via @IEEESpectrum
Engineers at NXP have developed a #ML algorithm that learns the patterns of test results and figures out the subset of tests that are really needed and those that they could safely do without
https://spectrum.ieee.org/semiconductor-testing
#AI via @IEEESpectrum
HPC Guru (Twitter)
RT @NVIDIAHPCDev: 🌌 Great to see everyone at #SC24. 🙌
We are excited to show how GPU cloud computing, high-performance networking, #ML, and quantum computing are transforming computer science and #AI ➡️ https://www.nvidia.com/en-us/events/supercomputing/?ncid=so-twit-670367
Don't miss our #generativeAI Super Heros of Supercomputing 🦸♀️ 🦸♂️ ✨
RT @NVIDIAHPCDev: 🌌 Great to see everyone at #SC24. 🙌
We are excited to show how GPU cloud computing, high-performance networking, #ML, and quantum computing are transforming computer science and #AI ➡️ https://www.nvidia.com/en-us/events/supercomputing/?ncid=so-twit-670367
Don't miss our #generativeAI Super Heros of Supercomputing 🦸♀️ 🦸♂️ ✨
HPC Guru (Twitter)
RT @DDNStorage: ✨ Day 2 at #SC24 was incredible! We had insightful conversations with customers, partners, and the media about driving innovation in AI & HPC.
These exchanges fuel our passion to push boundaries and deliver solutions that matter.
Let’s keep the momentum going! 💪
Want to chat? Book some time with our SMEs: https://www.ddn.com/sc24-meeting-registration
#AI #ArtificialIntelligence #ML #MachineLearning #LLMs #tech #data #DataStorage #DataCenters #DataAnalytics #innovation #SC24 @AlexbAlex @jswaroop
RT @DDNStorage: ✨ Day 2 at #SC24 was incredible! We had insightful conversations with customers, partners, and the media about driving innovation in AI & HPC.
These exchanges fuel our passion to push boundaries and deliver solutions that matter.
Let’s keep the momentum going! 💪
Want to chat? Book some time with our SMEs: https://www.ddn.com/sc24-meeting-registration
#AI #ArtificialIntelligence #ML #MachineLearning #LLMs #tech #data #DataStorage #DataCenters #DataAnalytics #innovation #SC24 @AlexbAlex @jswaroop
HPC Guru (Twitter)
Amazon, Apple and NVIDIA compete for #GPU / #ML talent with salaries (in some cases) of $300k or more
https://www.rdworldonline.com/amazon-apple-and-nvidia-compete-for-gpu-talent-with-salaries-of-300k-or-more/
#AI #HPC via @RandDWorld
Amazon, Apple and NVIDIA compete for #GPU / #ML talent with salaries (in some cases) of $300k or more
https://www.rdworldonline.com/amazon-apple-and-nvidia-compete-for-gpu-talent-with-salaries-of-300k-or-more/
#AI #HPC via @RandDWorld
HPC Guru (Twitter)
Ford’s David Kepczynski outlined how #HPC, #GPU acceleration, and #AI/#ML are reshaping the company’s approach to everything from safety, performance, and connected vehicles
https://www.nextplatform.com/2024/11/22/ford-lead-says-hpc-gpus-ai-are-keys-to-driving-progress/
#SC24 via @TheNextPlatform
Ford’s David Kepczynski outlined how #HPC, #GPU acceleration, and #AI/#ML are reshaping the company’s approach to everything from safety, performance, and connected vehicles
https://www.nextplatform.com/2024/11/22/ford-lead-says-hpc-gpus-ai-are-keys-to-driving-progress/
#SC24 via @TheNextPlatform
X (formerly Twitter)
HPC Guru (@HPC_Guru) on X
Ford’s David Kepczynski outlined how #HPC, #GPU acceleration, and #AI/#ML are reshaping the company’s approach to everything from safety, performance, and connected vehicles
https://t.co/Oa9bx4PDjq
#SC24 via @TheNextPlatform
https://t.co/Oa9bx4PDjq
#SC24 via @TheNextPlatform
HPC Guru (Twitter)
RT @EmadBarsoumPi: Want to try R1 on MI300X? We are keeping improving the performance at fast pace, in two weeks we went from 1472 tok/sec to 5921 tok/sec!!! #AMD #ROCm #ML #AI #sglang #AIG https://twitter.com/AnushElangovan/status/1893377685721858375#m
RT @EmadBarsoumPi: Want to try R1 on MI300X? We are keeping improving the performance at fast pace, in two weeks we went from 1472 tok/sec to 5921 tok/sec!!! #AMD #ROCm #ML #AI #sglang #AIG https://twitter.com/AnushElangovan/status/1893377685721858375#m
X (formerly Twitter)
Emad Barsoum (@EmadBarsoumPi) on X
Want to try R1 on MI300X? We are keeping improving the performance at fast pace, in two weeks we went from 1472 tok/sec to 5921 tok/sec!!! #AMD #ROCm #ML #AI #sglang #AIG