Datasets and checkpoints from the paper LlamaTales: Studying the Effects of Developmentally Inspired Training Data on Small Language Models
Ivan Lee
ivnle
AI & ML interests
None yet
Organizations
Collections
1
models
24
ivnle/tinystories-lay4-hs384-hd6-9M
Text Generation
•
Updated
•
4
ivnle/tinystories-lay8-hs384-hd6-18M
Text Generation
•
Updated
•
4
ivnle/tinystories-lay8-hs512-hd8-33M
Text Generation
•
Updated
•
4
ivnle/llamatales_jr_8b-lay1-hs128-hd2-262K
Text Generation
•
Updated
•
4
ivnle/llamatales_jr_8b-lay2-hs128-hd2-524K
Text Generation
•
Updated
•
4
ivnle/llamatales_jr_8b-lay4-hs128-hd2-1M
Text Generation
•
Updated
•
4
ivnle/llamatales_jr_8b-lay4-hs384-hd6-9M
Text Generation
•
Updated
•
4
ivnle/llamatales_jr_8b-lay8-hs384-hd6-18M
Text Generation
•
Updated
•
8
ivnle/llamatales_jr_8b-lay8-hs512-hd8-33M
Text Generation
•
Updated
•
8
ivnle/fineweb-lay1-hs128-hd2-262K
Text Generation
•
Updated
•
4
datasets
6
ivnle/llamatales-jr-70b
Updated
•
2
ivnle/llamatales-gre-70b
Viewer
•
Updated
•
2M
•
4
ivnle/fineweb
Viewer
•
Updated
•
2.03M
•
46
ivnle/tinystories
Viewer
•
Updated
•
4.97M
•
30
ivnle/llamatales-jr
Viewer
•
Updated
•
3.59M
•
24
ivnle/llamatales-gre
Viewer
•
Updated
•
2.02M
•
39