mii-llm

community

https://www.mii-llm.ai/

Activity Feed Request to join this org

AI & ML interests

Ingegno c'era nell'allenare congegni.

Recent Activity

efederici published a dataset about 2 hours ago

mii-llm/qwen-5588-last-simpo

efederici published a dataset about 2 hours ago

mii-llm/dpo-largemix1

DeepMount00 updated a dataset about 5 hours ago

mii-llm/results

View all activity

mii-llm's activity

efederici

published 2 datasets about 2 hours ago

mii-llm/qwen-5588-last-simpo

Viewer • Updated 2 days ago • 17.6k • 1

mii-llm/dpo-largemix1

Viewer • Updated 7 days ago • 47.8k • 2

DeepMount00

updated a dataset about 5 hours ago

mii-llm/results

Viewer • Updated about 5 hours ago • 1 • 10.2k

FinancialSupport

updated a dataset about 18 hours ago

mii-llm/requests

Updated about 18 hours ago • 35.4k

giux78

updated a dataset 2 days ago

mii-llm/qwen-5588-last-simpo

Viewer • Updated 2 days ago • 17.6k • 1

FinancialSupport

updated a dataset 3 days ago

mii-llm/results

Viewer • Updated about 5 hours ago • 1 • 10.2k

efederici

updated a dataset 7 days ago

mii-llm/dpo-largemix1

Viewer • Updated 7 days ago • 47.8k • 2

giux78

updated a dataset 7 days ago

mii-llm/requests

Updated about 18 hours ago • 35.4k

efederici

published 3 datasets 10 days ago

giux78

posted an update 7 months ago

Post

1682

We https://mii-llm.ai just released a new LLM Italian benchmark and a set of evaluation: MMLU-PRO-ITA

Thanks to @efederici who released efederici/MMLU-Pro-ita a machine translated version of MMLU-PRO and thanks to a community shared computational effort we published in the "Eval Aggiuntive" tab of https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard the results on Italian open source LLMs.

If you want to deepen read the blog article on hf https://huggingface.co/blog/giux78/mmlu-pro-ita

giux78

posted an update 9 months ago

Post

1493

@FinancialSupport and I just released a new version of the Italian LLMs leaderboard https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard
using the super useful https://huggingface.co/demo-leaderboard template from @clefourrier .
We’ve evaluated over 50 models (base, merged, fine-tuned, etc.) from:
- Major companies like Meta, Mistral, Google ...
- University groups such as https://huggingface.co/sapienzanlp or https://huggingface.co/swap-uniba
- Italian Companies like https://huggingface.co/MoxoffSpA , https://huggingface.co/FairMind or https://huggingface.co/raicrits
- Various communities and individuals
All models were tested on #Italian benchmarks #mmlu #arc-c #hellaswag, which we contributed to the opensource lm-evaluation-harness library from https://huggingface.co/EleutherAI.
Plus, you can now submit your model for automatic evaluation, thanks to to https://huggingface.co/seeweb sponsored computation.
Curious about the top Italian models? Check out the leaderboard and submit your model!

https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard

efederici

posted an update 10 months ago

Post

1734

Finally, I can post! 🚀

I created a Capybara-inspired Italian dataset by translating the initial instruction and running it through a pipeline to generate conversations. I used Claude Sonnet for translation and instruction generation, and Opus for generating the answers.

I hope this dataset proves useful for people working on 🇮🇹 language models.

⛁ Open sourcing the dataset here: efederici/capybara-claude-15k-ita

1 reply

giux78

posted an update 10 months ago

Post

1585

@mik3ml just released ReDiX/wikipediaQA-ita an interesting synthetic dataset originated from wikipedia using a fine tuned version of mistral-7B specific for the Italian language 🇮🇹 .

1 reply

giux78

posted an update 11 months ago

Post

1812

🎉 Super @DeepMount00 just released 𝗚𝗲𝗺𝗺𝗮_𝗤𝗔_𝗜𝗧𝗔_𝘃𝟯 𝗹𝗲𝗮𝗱𝗶𝗻𝗴 the 𝗥𝗔𝗚 𝘁𝗮𝘀𝗸 on the Italian 𝗟𝗟𝗠_𝗜𝗧𝗔_𝗟𝗘𝗔𝗗𝗘𝗥𝗕𝗢𝗔𝗥𝗗. The model is a fine tuned version of Gemma 2B.
Model details: https://huggingface.co/DeepMount00/Gemma_QA_ITA_v3
Explore the full RAG section rankings here: https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard on section Classifica RAG

giux78

posted an update 11 months ago

Post

1780

On evaluating fine tuned 7B Italian open source LLMs I have collected many data points and I created a super simple explorative analyses. My hypothesis based on data are:

- mmlu is hard to improve when fine tuning a base model on a different language
- fine tuning also on single GPUs can improve by 5% to 10% the base model on common tasks but a lot more on specific cases with the right training time and data
- fine tuning can specialize well but at cost of loosing some foundational knowledge.

Here the data https://docs.google.com/spreadsheets/d/1MBcxy1loK8eIycZG4DN84Q2ejZ0jSjxUBgoShHDR6IY/edit?usp=sharing
Here the colab https://colab.research.google.com/drive/1ra4_skG5QYWSYOzvagOoIoj4bibQD8Gw?usp=sharing
Here an article with some considerations https://medium.com/@giuxale/an-analyses-on-italian-llms-models-evaluations-51bffe1d44d1

giux78

posted an update 11 months ago

Post

1286

Based on the work of @mrinaldi and @ruggsea we just released the biggest - ready for training - conversational dataset based on Usenet data in the Italian language 🇮🇹🇮🇹🇮🇹🇮🇹🇮🇹🇮🇹🇮🇹. It contains about 9 millions of conversations made by real humans.

mii-community/UsenetArchiveIT-conversations

giux78

posted an update 11 months ago

Post

2663

I and @FinancialSupport just release the new Italian Leaderboard:

https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard

It is based on lm-evaluation-harness and at the moment , mainly, on 7 billion models. In the next weeks we will add more models. If you have suggestion or need explanations join our community discord https://discord.gg/a26cRkBCNH

9 replies

giux78

posted an update 12 months ago

Post

Wonderful open source Italian dataset from @manalog and @ruggsea :

https://huggingface.co/datasets/manalog/UsenetArchiveIT

The dataset contributes to the https://huggingface.co/mii-community project, aimed at advancing the creation of Italian open-source Language Models (LLMs).🇮🇹 🤖 About 10-20 billion token, probably the best conversational open source dataset in the Italian language. 🇮🇹🇮🇹🇮🇹🇮🇹🇮🇹🇮🇹🇮🇹

2 replies

AI & ML interests

Recent Activity

Team members 6

mii-llm's activity