Nicohst commited on
Commit
4ca6659
1 Parent(s): 2d3261c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -6
README.md CHANGED
@@ -22,17 +22,15 @@ This model is a fine-tuned version of [unsloth/meta-llama-3.1-8b-instruct-bnb-4b
22
 
23
  ## Model description
24
 
25
- More information needed
26
-
27
- ## Intended uses & limitations
28
-
29
- More information needed
30
 
31
  ## Training and evaluation data
32
 
33
- More information needed
34
 
35
  ## Training procedure
 
36
 
37
  ### Training hyperparameters
38
 
 
22
 
23
  ## Model description
24
 
25
+ The Model is trained on all successful episodes of the clembench-benchmark versions 0.9 and 1.0.
26
+ The Dataset contains approximately 3700 Successfully player episodes of all non-multi-modal games
 
 
 
27
 
28
  ## Training and evaluation data
29
 
30
+ Dataset: D10001
31
 
32
  ## Training procedure
33
+ One Episode QLoRa Finetuning with 4bit quantization
34
 
35
  ### Training hyperparameters
36