Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
5
Jia Guo
Jiiiia
Follow
dreamerdeo's profile picture
21world's profile picture
2 followers
ยท
6 following
AI & ML interests
None yet
Recent Activity
liked
a model
6 days ago
inclusionAI/Ling-lite
upvoted
a
paper
11 months ago
Sailor: Open Language Models for South-East Asia
reacted
to
SivilTaram
's
post
with ๐
11 months ago
โ๏ธ Sailor: A New Multilingual Open LLM for South-East Asia ๐ Last month we have released a new family of multilingual language models called **Sailor**, ranging from 0.5B to 7B parameters, continually pre-trained from the Qwen1.5 models. Based on our extensive benchmarking, the Sailor models demonstrate exceptional performance on South-East Asian languages, taking us one step closer to multilingual LLMs that can serve the diverse needs of the region and beyond. Today, we're more than excited to share the key technical details behind the Sailor models! ๐ช **Key highlights**: ๐ Data curation: Merging short examples, document-level code-switching, aggressive data cleaning and deduplication. ๐ค Tokenization Robustness: We find that BPE dropout is really effective to deal with prompt variations. ๐ Optimizing Data Mixture: We propose a new approach to automatically balance capabilities across different languages! ๐ Recipe in Continual Pre-training: We discover a powerful metric that can help predict how well the Sailor models will perform on the original domain (e.g., English) after continual pre-training. We are thrilled to share these technical details with the community and invite you to explore the Sailor models. We hope Sailor models take us one step closer to multilingual LLMs in the world! ๐โจ To learn more, please access our research paper or reach out to our team. ๐ Paper: https://huggingface.co/papers/2404.03608 ๐งฉ Model: https://huggingface.co/collections/sail/sailor-language-models-65e19a749f978976f1959825 ๐ป Code: https://github.com/sail-sg/sailor-llm
View all activity
Organizations
Jiiiia
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
6 days ago
inclusionAI/Ling-lite
Updated
3 days ago
โข
41
โข
8
liked
3 models
12 months ago
sail/Sailor-1.8B
Text Generation
โข
Updated
Dec 21, 2024
โข
213
โข
8
sail/Sailor-4B
Text Generation
โข
Updated
Dec 21, 2024
โข
210
โข
6
sail/Sailor-7B
Text Generation
โข
Updated
Dec 21, 2024
โข
238
โข
28
liked
a model
about 1 year ago
sail/Sailor-0.5B
Text Generation
โข
Updated
Dec 21, 2024
โข
239
โข
9