Model Card for Model ID

This is a GPT-2 model trained in llm.c, for 32K steps (of 1M batch size) on FineWeb-EDU.

A lot more detailed information is here: https://github.com/karpathy/llm.c/discussions/677

Bias, Risks, and Limitations

Eagerly generates disinformation about English-speaking unicorns in the Andes mountains.

Downloads last month
7
Safetensors
Model size
1.56B params
Tensor type
BF16
ยท
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for karpathy/gpt2_1558M_final2_hf

Finetunes
7 models
Quantizations
1 model

Space using karpathy/gpt2_1558M_final2_hf 1