Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1.9
TFLOPS
2
10
120
Leonard Püttmann
PRO
puettmann
Follow
anakin87's profile picture
eugrug-60's profile picture
HenrikWenck's profile picture
5 followers
·
7 following
leonard-puettmann
AI & ML interests
None yet
Recent Activity
reacted
to
anakin87
's
post
with 👍
3 days ago
𝐍𝐞𝐰 𝐈𝐭𝐚𝐥𝐢𝐚𝐧 𝐒𝐦𝐚𝐥𝐥 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐌𝐨𝐝𝐞𝐥𝐬: 𝐆𝐞𝐦𝐦𝐚 𝐍𝐞𝐨𝐠𝐞𝐧𝐞𝐬𝐢𝐬 𝐜𝐨𝐥𝐥𝐞𝐜𝐭𝐢𝐨𝐧 💎🌍🇮🇹 I am happy to release two new language models for the Italian Language! 💪 Gemma 2 9B Neogenesis ITA https://huggingface.co/anakin87/gemma-2-9b-neogenesis-ita Building on the impressive work by VAGO Solutions, I applied Direct Preference Optimization with a mix of Italian and English data. Using Spectrum, I trained 20% of model layers. 📊 Evaluated on the Open ITA LLM leaderboard (https://huggingface.co/spaces/mii-llm/open_ita_llm_leaderboard), this model achieves strong performance. To beat it on this benchmark, you'd need a 27B model 😎 🤏 Gemma 2 2B Neogenesis ITA https://huggingface.co/anakin87/gemma-2-2b-neogenesis-ita This smaller variant is fine-tuned from the original Gemma 2 2B it by Google. Through a combination of Supervised Fine-Tuning and Direct Preference Optimization, I trained 25% of the layers using Spectrum. 📈 Compared to the original model, it shows improved Italian proficiency, good for its small size. Both models were developed during the recent #gemma competition on Kaggle. 📓 Training code: https://www.kaggle.com/code/anakin87/post-training-gemma-for-italian-and-beyond 🙏 Thanks @FinancialSupport and mii-llm for the help during evaluation.
liked
a model
7 days ago
anakin87/gemma-2-2b-neogenesis-ita
liked
a model
7 days ago
anakin87/gemma-2-9b-neogenesis-ita
View all activity
Organizations
puettmann
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
2 models
7 days ago
anakin87/gemma-2-2b-neogenesis-ita
Text Generation
•
Updated
8 days ago
•
1.25k
•
5
anakin87/gemma-2-9b-neogenesis-ita
Text Generation
•
Updated
8 days ago
•
1.47k
•
7
liked
2 models
8 days ago
naist-nlp/mitre_466m
Translation
•
Updated
18 days ago
•
846
•
13
hexgrad/Kokoro-82M
Text-to-Speech
•
Updated
40 minutes ago
•
33.9k
•
2.34k
liked
a model
11 days ago
LSX-UniWue/LLaMmlein_1B_chat_selected
Updated
11 days ago
•
104
•
1
liked
3 models
12 days ago
microsoft/Phi-3-mini-4k-instruct
Text Generation
•
Updated
Sep 20, 2024
•
792k
•
•
1.12k
Aleph-Alpha/Pharia-1-Embedding-4608-control-hf
Updated
Dec 20, 2024
•
77
•
1
microsoft/phi-4
Text Generation
•
Updated
16 days ago
•
182k
•
1.54k
liked
a Space
13 days ago
Running
43
⚡
Phi-3.5 WebGPU
A powerful AI chatbot that runs locally in your browser
liked
a model
15 days ago
microsoft/Phi-3.5-mini-instruct
Text Generation
•
Updated
Sep 18, 2024
•
917k
•
•
767
liked
a model
20 days ago
google/t5-efficient-tiny
Text2Text Generation
•
Updated
Jan 24, 2023
•
15k
•
21
liked
a Space
25 days ago
Running
31
🚀
Open Translate
liked
a model
26 days ago
Helsinki-NLP/opus-mt-en-it
Translation
•
Updated
Aug 16, 2023
•
178k
•
17
liked
5 models
about 1 month ago
bigscience/mt0-small
Text2Text Generation
•
Updated
Sep 26, 2023
•
8.96k
•
28
DeepMount00/Alireo-400m-instruct-v0.1
Text Generation
•
Updated
Dec 17, 2024
•
2.8k
•
13
gsarti/it5-small
Text2Text Generation
•
Updated
Jun 17, 2024
•
485
•
2
DeepMount00/Llama-3-8b-Ita
Text Generation
•
Updated
Aug 13, 2024
•
204k
•
24
mii-llm/maestrale-chat-v0.4-beta
Text Generation
•
Updated
Jun 6, 2024
•
4.99k
•
5
liked
2 datasets
about 1 month ago
gsarti/clean_mc4_it
Updated
Jun 17, 2024
•
217
•
14
kaitchup/opus-Italian-to-English
Viewer
•
Updated
Nov 1, 2023
•
962k
•
62
•
1
Load more