This is a GPT2
model uploaded using the KerasNLP library and can be used with JAX, TensorFlow, and PyTorch backends.
This model is related to a CausalLM
task.
Model config:
- name: gpt2_backbone
- trainable: True
- vocabulary_size: 50257
- num_layers: 12
- num_heads: 12
- hidden_dim: 768
- intermediate_dim: 3072
- dropout: 0.1
- max_sequence_length: 1024
This model card has been generated automatically and should be completed by the model author. See Model Cards documentation for more information.
- Downloads last month
- 16
Inference API (serverless) does not yet support keras-hub models for this pipeline type.