Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
61
Carlos Fonseca
carlfm01
Follow
shimmyshimmer's profile picture
1 follower
·
18 following
carlfm01
AI & ML interests
None yet
Recent Activity
liked
a dataset
5 days ago
unsloth/RLAIF-V-Dataset
liked
a model
10 days ago
HuggingFaceTB/SmolLM2-360M
reacted
to
Jaward
's
post
with 👀
10 days ago
Finally here it is: a faster, custom, scalable GRPO trainer for smaller models with < 500M params, can train on 8gb ram cpu, also supports gpu for sanity sake (includes support for vllm + flash attention). Using smolLM2-135M/360M-instructs as ref & base models. Experience your own “aha” moment 🐳 on 8gb ram. Code: https://github.com/Jaykef/ai-algorithms/blob/main/smollm2_360M_135M_grpo_gsm8k.ipynb
View all activity
Organizations
None yet
carlfm01
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
5 days ago
unsloth/RLAIF-V-Dataset
Viewer
•
Updated
Sep 26, 2024
•
2.49k
•
125
•
5
liked
a model
10 days ago
HuggingFaceTB/SmolLM2-360M
Text Generation
•
Updated
21 days ago
•
16.9k
•
•
39
liked
2 datasets
30 days ago
ylacombe/cml-tts
Viewer
•
Updated
Nov 24, 2023
•
1.34M
•
30.7k
•
19
bespokelabs/Bespoke-Stratos-17k
Viewer
•
Updated
28 days ago
•
16.7k
•
97.8k
•
283
liked
2 models
about 1 month ago
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation
•
Updated
4 days ago
•
480k
•
•
601
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
4 days ago
•
4.64M
•
•
10.4k
liked
a dataset
about 1 month ago
microsoft/PEACE
Viewer
•
Updated
Jan 26
•
7.73k
•
1.18k
•
13
liked
a dataset
2 months ago
microsoft/MAGIC
Viewer
•
Updated
Dec 17, 2024
•
48.1k
•
191
•
11
liked
3 models
3 months ago
Qwen/Qwen2.5-1.5B-Instruct
Text Generation
•
Updated
Sep 25, 2024
•
824k
•
•
339
deepseek-ai/DeepSeek-V2.5-1210
Text Generation
•
Updated
Dec 11, 2024
•
3.88k
•
251
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation
•
Updated
Jul 3, 2024
•
86.9k
•
•
397
liked
a dataset
3 months ago
TIGER-Lab/OmniEdit-Filtered-1.2M
Viewer
•
Updated
Dec 6, 2024
•
1.2M
•
5.85k
•
71
liked
a model
3 months ago
unsloth/Llama-3.3-70B-Instruct
Text Generation
•
Updated
Jan 7
•
373k
•
38
liked
7 datasets
3 months ago
Xkev/LLaVA-CoT-100k
Viewer
•
Updated
Nov 27, 2024
•
98.6k
•
3.41k
•
74
5CD-AI/LLaVA-CoT-o1-Instruct
Viewer
•
Updated
Nov 27, 2024
•
58.5k
•
290
•
92
unsloth/Radiology_mini
Viewer
•
Updated
Nov 21, 2024
•
2.31k
•
2.15k
•
16
eltorio/ROCOv2-radiology
Viewer
•
Updated
Nov 13, 2024
•
79.8k
•
1.23k
•
45
HuggingFaceTB/smoltalk
Viewer
•
Updated
17 days ago
•
2.2M
•
7.34k
•
307
TIGER-Lab/WebInstructFull
Viewer
•
Updated
Dec 21, 2024
•
13.5M
•
908
•
21
TIGER-Lab/Fineweb-Instruct
Viewer
•
Updated
Nov 16, 2024
•
10.8M
•
1.08k
•
5
Load more