archit's picture

archit

archit11

AI & ML interests

small language models, looking for work please reachout [email protected]

Recent Activity

liked a model 8 days ago
NovaSearch/stella_en_400M_v5
liked a dataset 15 days ago
open-thoughts/OpenThoughts-114k
liked a dataset 20 days ago
simplescaling/s1K
View all activity

Organizations

Literally Me FRFR Research Society's profile picture Blog-explorers's profile picture ZeroGPU Explorers's profile picture IndiaBuild's profile picture Hugging Face Discord Community's profile picture

archit11's activity

upvoted an article 20 days ago
view article
Article

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks

By Pclanglais β€’
β€’ 29
New activity in ubermenchh/SmolLM2-DPO 24 days ago

details pls

1
#1 opened 24 days ago by
archit11
upvoted an article 25 days ago
view article
Article

How to deploy and fine-tune DeepSeek models on AWS

β€’ 46
upvoted an article 27 days ago
view article
Article

Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia?

By davanstrien β€’
β€’ 8
upvoted an article about 1 month ago
view article
Article

Train 400x faster Static Embedding Models with Sentence Transformers

β€’ 149