SmolLM baselines trained from scratch
Collection
2 items
•
Updated
This is a model with the same specifications as SmolLM2-135M trained from scratch on the Icelandic portion of Fineweb-2. It is intended as a baseline for my research is probably rather bad for most purposes :)
Training: