HPC Guru (Twitter)
๐๐งต by @CerebrasSystems on #SlimPajama โ the largest deduplicated, multi-corpora, open-source, dataset for training LLMs
Cute mascot!
Cerebras built #SlimPajama to train a #LLaMA style model for partner @Opentensor
https://www.cerebras.net/blog/slimpajama-a-627b-token-cleaned-and-deduplicated-version-of-redpajama
#AI #GenerativeAI
-----------
@CerebrasSystems:
๐ฃ New dataset drop!
Introducing SlimPajama-627B: the largest extensively deduplicated, multi-corpora, open-source dataset for training large language models. ๐งตhttps://t.co/bwsSz4d9hs https://t.co/3Ow1OAEcS9
๐๐งต by @CerebrasSystems on #SlimPajama โ the largest deduplicated, multi-corpora, open-source, dataset for training LLMs
Cute mascot!
Cerebras built #SlimPajama to train a #LLaMA style model for partner @Opentensor
https://www.cerebras.net/blog/slimpajama-a-627b-token-cleaned-and-deduplicated-version-of-redpajama
#AI #GenerativeAI
-----------
@CerebrasSystems:
๐ฃ New dataset drop!
Introducing SlimPajama-627B: the largest extensively deduplicated, multi-corpora, open-source dataset for training large language models. ๐งตhttps://t.co/bwsSz4d9hs https://t.co/3Ow1OAEcS9