File size: 836 Bytes
ed08b45 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 |
---
license: cc0-1.0
datasets:
- Lichess/standard-chess-games
pipeline_tag: text2text-generation
tags:
- chess
---
# Model card for chessPT
A pretrained Decoder only transformer model for chess move prediction.
## Intended use
Predict new moves in a chess game based on PGN tokens.
## Implementation
The model implementation is based on Andrej Karpathy's [nanoGPT](https://github.com/karpathy/nanoGPT) following the webseries "Zero to Hero" on [youtube](https://www.youtube.com/playlist?list=PLAqhIrjkxbuWI23v9cThsA9GvCAUhRvKZ).
## Training
You can find the training script in the repositories files under `train.py`.
This also contains the used parameters
```python
context_size = 256
batch_size = 128
max_iters = 30_000
learning_rate = 3e-5
eval_interval = 100
eval_iters = 20
n_embed = 384
n_layer = 6
n_head = 6
dropout = 0.2
``` |