SlimPajama: A 627B token cleaned and deduplicated version of RedPajama
SlimPajama: A 627B token cleaned and deduplicated version of RedPajama
www.cerebras.net SlimPajama: A 627B token, cleaned and deduplicated version of RedPajama - Cerebras
Cerebras has built a platform for push-button training of large language models that can accelerate time to insights without having to orchestrate across a large cluster of small devices.
Sounds like Cerebras will be training a model soon based on this dataset, will likely rival OpenLLaMa and RedPajamas models – thoughts??
0 comments