arun-AiBharat
/

gpt-2-bookcorpus

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

The GPT -2 model was trained on the BookCorpus dataset for 60K steps.
No position embedding was used (NoPE).
Here is the wandb report
This is for educational purposes only.

Downloads last month: 119

Safetensors

Model size

124M params

Tensor type

F32

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Dataset used to train arun-AiBharat/gpt-2-bookcorpus