AI & ML interests

LLMs, NLP, Alignment, DPO, RLHF, data labeling, text-classification, text-generation, token-classification

Recent Activity

argilla's activity

burtenshaw 
posted an update 6 days ago
view post
Post
2531
People are flexing their end of year stats, so I made this app to show hub stats in a tidy design!

Thanks @Ameeeee and @jfcalvo for the feature from Argilla!
burtenshaw/recap
  • 1 reply
·
davidberenstein1957 
posted an update 6 days ago
nataliaElv 
posted an update 8 days ago
view post
Post
1596
If you are still wondering how the FineWeb2 annotations are done, how to follow the guidelines or how Argilla works, this is your video!

I go through a few samples of the FineWeb2 dataset and classify them based on their educational content. Check it out!

https://www.youtube.com/watch?v=_-ORB4WAVGU
davidberenstein1957 
posted an update 9 days ago
view post
Post
4100
Introducing the Synthetic Data Generator, a user-friendly application that takes a no-code approach to creating custom datasets with Large Language Models (LLMs). The best part: A simple step-by-step process, making dataset creation a non-technical breeze, allowing anyone to create datasets and models in minutes and without any code.

Blog: https://huggingface.co/blog/synthetic-data-generator
Space: argilla/synthetic-data-generator
·
Leiyre 
updated a Space 12 days ago