Nicky's picture

Nicky

NickyNicky

·

AI & ML interests

None yet

Recent Activity

new activity 1 day ago

Qwen/QwQ-32B:What languages were you trained in?

liked a model 1 day ago

Qwen/QwQ-32B

liked a model 2 days ago

KRLabsOrg/lettucedect-large-modernbert-en-v1

View all activity

Organizations

NickyNicky's activity

New activity in Qwen/QwQ-32B 1 day ago

What languages were you trained in?

#7 opened 1 day ago by

liked a model 1 day ago

Qwen/QwQ-32B

Text Generation • Updated about 12 hours ago • 8.74k • • 1.11k

liked 2 models 2 days ago

KRLabsOrg/lettucedect-large-modernbert-en-v1

Token Classification • Updated 7 days ago • 256 • 16

KRLabsOrg/lettucedect-base-modernbert-en-v1

Token Classification • Updated 7 days ago • 2.4k • 11

liked a model 3 days ago

lightblue/DeepSeek-R1-Distill-Qwen-1.5B-Multilingual

Updated Jan 31 • 1.12k • 19

reacted to rizavelioglu's post with 🚀 4 days ago

Post

3071

Comparing reconstruction quality of various VAEs with an interactive demo
rizavelioglu/vae-comparison

1 reply

·

liked a model 5 days ago

pankajmathur/orca_mini_v9_6_1B-Instruct

Text Generation • Updated Jan 23 • 188 • 6

reacted to Jaward's post with 🚀 5 days ago

Post

4893

made a few improvements on custom grpo trainer:
- added sequence similarity reward (seems to work)
- improved vllm support (5x inference speed)
- adjusted reward scores (this helped with format/accuracy)
- can now push to hf hub (already pushed mine lol: Jaward/smollm2_360m_grpo_gsm8k_reasoner)

Code: https://github.com/Jaykef/ai-algorithms/blob/main/smollm2_360M_135M_grpo_gsm8k.ipynb

New activity in microsoft/Phi-4-multimodal-instruct 8 days ago

thanks , how to fine tune?

#1 opened 8 days ago by

liked a model 8 days ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated 2 days ago • 71.2k • 932

liked 3 models 9 days ago

microsoft/Magma-8B

Image-Text-to-Text • Updated 1 day ago • 8.99k • 308

prithivMLmods/Guard-Against-Unsafe-Content-Siglip2

Image Classification • Updated 9 days ago • 67 • 10

samchain/econo-sentence-v2

Sentence Similarity • Updated 15 days ago • 35 • 2

liked 2 datasets 9 days ago

samchain/econo-pairs-v2

Viewer • Updated about 6 hours ago • 56.3k • 87 • 2

SynthLabsAI/Big-Math-RL-Verified

Viewer • Updated 8 days ago • 251k • 3.7k • 126

liked 2 models 9 days ago

Undi95/MistralThinker-GGUF

Updated 9 days ago • 261 • 3

perplexity-ai/r1-1776

Text Generation • Updated 8 days ago • 35.8k • • 2.03k

liked 3 models 10 days ago

NexaAIDev/DeepSeek-R1-Distill-Qwen-1.5B-NexaQuant

Updated 15 days ago • 6.58k • 86

moonshotai/Moonlight-16B-A3B

Text Generation • Updated 9 days ago • 1.78k • 72

moonshotai/Moonlight-16B-A3B-Instruct

Text Generation • Updated 3 days ago • 3.92k • 125