SzegedAI
/

hubertusz-tiny-wiki-seq128

generated_from_keras_callback

Inference Endpoints

Model card Files Files and versions Community

Edit model card

hubert-tiny-wiki-seq128

Fully trained model with the second phase of training is available here: SzegedAI/hubert-tiny-wiki

This model was trained from scratch on the Wikipedia subset of Hungarian Webcorpus 2.0 with MLM and SOP tasks.

Pre-Training Parameters:

Training steps: 500.000
Sequence length: 128 (the model is capable for 512)
Batch size: 1024

Framework versions

Transformers 4.21.3
TensorFlow 2.10.0
Datasets 2.4.0
Tokenizers 0.12.1

Acknowledgement

Downloads last month: 9

Inference API

Unable to determine this model’s pipeline type. Check the docs .

Dataset used to train SzegedAI/hubertusz-tiny-wiki-seq128

Evaluation results

Metadata error: specify a dataset to view leaderboard