Andrija
/

SRoBERTa-base

Inference Endpoints

Model card Files Files and versions Community

SRoBERTa-base / README.md

Andrija's picture

Update README.md

7e48407 about 3 years ago

|

651 Bytes

	---
	datasets:
	- oscar
	- leipzig

	language:
	- hr
	- sr

	tags:
	- masked-lm

	widget:
	- text: "Ovo je početak <mask>."

	license: apache-2.0

	---
	# Transformer language model for Croatian and Serbian
	Trained on 3GB datasets that contain Croatian and Serbian language for two epochs.
	Leipzig and OSCAR datasets

	# Information of dataset

	\| Model \| #params \| Arch. \| Training data \|

	\|--------------------------------\|--------------------------------\|-------\|-----------------------------------\|

	\| `Andrija/SRoBERTa` \| 80M \| First \| Leipzig Corpus and OSCAR (3 GB of text) \|