End of training

Files changed (7) hide show

README.md ADDED Viewed

+---
+tags:
+- generated_from_trainer
+model-index:
+- name: GPT_Neo_llmcs
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# GPT_Neo_llmcs
+This model was trained from scratch on an unknown dataset.
+It achieves the following results on the evaluation set:
+- eval_loss: 1.8919
+- eval_runtime: 626.193
+- eval_samples_per_second: 61.805
+- eval_steps_per_second: 7.726
+- step: 0
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 3.0
+### Framework versions
+- Transformers 4.39.3
+- Pytorch 2.2.2
+- Datasets 2.18.0
+- Tokenizers 0.15.1

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "EleutherAI/gpt-neo-125m",
   "activation_function": "gelu_new",
   "architectures": [
     "GPTNeoForCausalLM"

 {
+  "_name_or_path": "GPT_Neo_llmcs/checkpoint-62000",
   "activation_function": "gelu_new",
   "architectures": [
     "GPTNeoForCausalLM"

generation_config.json ADDED Viewed

+{
+  "_from_model_config": true,
+  "bos_token_id": 50256,
+  "eos_token_id": 50256,
+  "transformers_version": "4.39.3"
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a1d40b01fdd3b82539498cfdd20908bfd1a4d535bfe0a2ba395a1f4bd2389b2c
 size 500811336

 version https://git-lfs.github.com/spec/v1
+oid sha256:fc7cdf75b5059f296dcc621ba39b628a3ff0b4bc2a389fcf64c9fc2c854d86f5
 size 500811336

runs/Apr14_17-18-09_capycoony-m16/events.out.tfevents.1713086291.capycoony-m16.2220.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:985b65ba5365d49b2429ca453aecd6cd31da7460d432920404d219182633018e
-size 9406

 version https://git-lfs.github.com/spec/v1
+oid sha256:e515de73b5ef7cd0f7a48a59d7cd1b50220a03609daf77ea6aa53b8fae28624b
+size 32695

runs/Apr15_09-04-32_capycoony-m16/events.out.tfevents.1713143701.capycoony-m16.20472.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:627436c972ca09ce144083d850831b0a176a14d60e013da0896244a642417f95
+size 249

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7c79ea4ef0e1e7d1908ff537f3fb0941576b5457fea198195c427499a39f83b0
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:6eb0ccd89899ad5de0771147e48b98f60e01f5f3f25ed075dd37f2f17e519a4b
 size 4920