Model save

Browse files

Files changed (3) hide show

README.md +78 -0
model.safetensors +1 -1
runs/Dec26_22-13-46_srvrocgpu011.uct.ac.za/events.out.tfevents.1735244097.srvrocgpu011.uct.ac.za +2 -2

README.md ADDED Viewed

	@@ -0,0 +1,78 @@

+---
+library_name: transformers
+license: apache-2.0
+base_model: facebook/wav2vec2-xls-r-1b
+tags:
+- generated_from_trainer
+metrics:
+- wer
+model-index:
+- name: xls-r-1b-bigcgen-combined-5hrs
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# xls-r-1b-bigcgen-combined-5hrs
+This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.6944
+- Wer: 0.6600
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 3e-05
+- train_batch_size: 4
+- eval_batch_size: 4
+- seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 8
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 500
+- num_epochs: 30.0
+### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Wer    |
+|:-------------:|:------:|:----:|:---------------:|:------:|
+| No log        | 0.4228 | 100  | 3.7787          | 1.0    |
+| No log        | 0.8457 | 200  | 2.8642          | 1.0    |
+| No log        | 1.2664 | 300  | 1.1326          | 0.9986 |
+| No log        | 1.6892 | 400  | 0.8747          | 0.8281 |
+| 5.818         | 2.1099 | 500  | 0.7865          | 0.8027 |
+| 5.818         | 2.5328 | 600  | 0.6710          | 0.6954 |
+| 5.818         | 2.9556 | 700  | 0.7234          | 0.7939 |
+| 5.818         | 3.3763 | 800  | 0.6657          | 0.6706 |
+| 5.818         | 3.7992 | 900  | 0.6836          | 0.7246 |
+| 1.2021        | 4.2199 | 1000 | 0.6894          | 0.6897 |
+| 1.2021        | 4.6427 | 1100 | 0.6464          | 0.6642 |
+| 1.2021        | 5.0634 | 1200 | 0.6663          | 0.6777 |
+| 1.2021        | 5.4863 | 1300 | 0.6701          | 0.6704 |
+| 1.2021        | 5.9091 | 1400 | 0.6834          | 0.6899 |
+| 0.8307        | 6.3298 | 1500 | 0.6944          | 0.6600 |
+### Framework versions
+- Transformers 4.47.1
+- Pytorch 2.5.1+cu124
+- Datasets 3.2.0
+- Tokenizers 0.21.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5ffb6fcda29b32e0019f29eea2a9aa934a2f5c077bb59433f281cdb98fc51507
 size 3850239596

 version https://git-lfs.github.com/spec/v1
+oid sha256:cb95d0b6acf612f3ec328f238417ebc2d72b2c366627e97cfdef46b345fb4f4a
 size 3850239596

runs/Dec26_22-13-46_srvrocgpu011.uct.ac.za/events.out.tfevents.1735244097.srvrocgpu011.uct.ac.za CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:301179d5ef4635dc5bed96452e129485c9a23780c823042b18846ba832ebaaf6
-size 10485

 version https://git-lfs.github.com/spec/v1
+oid sha256:b33706c790a1fdff41c36ae730645ed15ec2ae87b22cf99047065b6439d472a3
+size 12322