Carlos Fonseca

carlfm01

carlfm01

AI & ML interests

None yet

Recent Activity

liked a dataset 5 days ago

unsloth/RLAIF-V-Dataset

liked a model 10 days ago

HuggingFaceTB/SmolLM2-360M

reacted to Jaward's post with 👀 10 days ago

Finally here it is: a faster, custom, scalable GRPO trainer for smaller models with < 500M params, can train on 8gb ram cpu, also supports gpu for sanity sake (includes support for vllm + flash attention). Using smolLM2-135M/360M-instructs as ref & base models. Experience your own “aha” moment 🐳 on 8gb ram. Code: https://github.com/Jaykef/ai-algorithms/blob/main/smollm2_360M_135M_grpo_gsm8k.ipynb

View all activity

Organizations

None yet

carlfm01's activity

liked a dataset 5 days ago

unsloth/RLAIF-V-Dataset

Viewer • Updated Sep 26, 2024 • 2.49k • 125 • 5

liked a model 10 days ago

HuggingFaceTB/SmolLM2-360M

Text Generation • Updated 21 days ago • 16.9k • • 39

liked 2 datasets 30 days ago

ylacombe/cml-tts

Viewer • Updated Nov 24, 2023 • 1.34M • 30.7k • 19

bespokelabs/Bespoke-Stratos-17k

Viewer • Updated 28 days ago • 16.7k • 97.8k • 283

liked 2 models about 1 month ago

deepseek-ai/DeepSeek-R1-Distill-Llama-70B

Text Generation • Updated 4 days ago • 480k • • 601

deepseek-ai/DeepSeek-R1

Text Generation • Updated 4 days ago • 4.64M • • 10.4k

liked a dataset about 1 month ago

microsoft/PEACE

Viewer • Updated Jan 26 • 7.73k • 1.18k • 13

liked a dataset 2 months ago

microsoft/MAGIC

Viewer • Updated Dec 17, 2024 • 48.1k • 191 • 11

liked 3 models 3 months ago

liked a dataset 3 months ago

TIGER-Lab/OmniEdit-Filtered-1.2M

Viewer • Updated Dec 6, 2024 • 1.2M • 5.85k • 71

liked a model 3 months ago

unsloth/Llama-3.3-70B-Instruct

Text Generation • Updated Jan 7 • 373k • 38

liked 7 datasets 3 months ago

Xkev/LLaVA-CoT-100k

Viewer • Updated Nov 27, 2024 • 98.6k • 3.41k • 74

5CD-AI/LLaVA-CoT-o1-Instruct

Viewer • Updated Nov 27, 2024 • 58.5k • 290 • 92

unsloth/Radiology_mini

Viewer • Updated Nov 21, 2024 • 2.31k • 2.15k • 16

eltorio/ROCOv2-radiology

Viewer • Updated Nov 13, 2024 • 79.8k • 1.23k • 45

HuggingFaceTB/smoltalk

Viewer • Updated 17 days ago • 2.2M • 7.34k • 307

TIGER-Lab/WebInstructFull

Viewer • Updated Dec 21, 2024 • 13.5M • 908 • 21

TIGER-Lab/Fineweb-Instruct

Viewer • Updated Nov 16, 2024 • 10.8M • 1.08k • 5