fpadovani
/

french_childes_sent_42

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

childes_mlm_sent

This model is a fine-tuned version of on an unknown dataset. It achieves the following results on the evaluation set:

Loss: 3.0607

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5e-05
train_batch_size: 16
eval_batch_size: 16
seed: 42
gradient_accumulation_steps: 2
total_train_batch_size: 32
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 100000
training_steps: 400000

Training results

Training Loss	Epoch	Step	Validation Loss
7.2398	0.2170	2000	5.5312
5.1243	0.4340	4000	4.7005
4.5492	0.6510	6000	4.2433
4.1988	0.8680	8000	3.9411
3.8698	1.0850	10000	3.7078
3.6366	1.3020	12000	3.5168
3.5082	1.5190	14000	3.3858
3.4068	1.7360	16000	3.2682
3.3061	1.9530	18000	3.2611
3.2313	2.1700	20000	3.1608
3.2031	2.3870	22000	3.1178
3.1597	2.6040	24000	3.0884
3.1589	2.8210	26000	3.0969
3.113	3.0380	28000	3.0653
3.1082	3.2550	30000	3.0646
3.1023	3.4720	32000	3.0227
3.0771	3.6890	34000	3.0531
3.1092	3.9060	36000	3.0444
3.0432	4.1230	38000	3.0607

Framework versions

Transformers 4.45.2
Pytorch 2.5.1+cu124
Datasets 3.0.1
Tokenizers 0.20.1

Downloads last month: 2

Safetensors

Model size

10.7M params

Tensor type

F32

·

Inference Examples

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Evaluation results

Metadata error: specify a dataset to view leaderboard