Josè Juan flores MEndez's picture

1 116

Josè Juan flores MEndez

joseEjmendez

·

Jose_jMdz

AI & ML interests

Lerning

Recent Activity

liked a Space 8 days ago

MiniMaxAI/MiniMax-Text-01

liked a model 9 days ago

MiniMaxAI/MiniMax-VL-01

liked a model 9 days ago

bytedance-research/UI-TARS-7B-SFT

View all activity

Organizations

None yet

joseEjmendez's activity

liked a Space 8 days ago

MiniMaxText01

liked 4 models 9 days ago

MiniMaxAI/MiniMax-VL-01

Image-Text-to-Text • Updated 6 days ago • 2.04k • 226

bytedance-research/UI-TARS-7B-SFT

Image-Text-to-Text • Updated 6 days ago • 2.65k • 125

lmstudio-community/DeepSeek-R1-Distill-Qwen-1.5B-GGUF

Text Generation • Updated 11 days ago • 18.6k • 5

tensorblock/DeepSeek-R1-Distill-Qwen-1.5B-GGUF

Updated 11 days ago • 2.21k • 2

reacted to chansung's post with 🔥 9 days ago

Post

1981

Simple Summarization on DeepSeek-R1 from DeepSeek AI

The RL stage is very important.
↳ However, it is difficult to create a truly helpful AI for people solely through RL.
↳ So, we applied a learning pipeline consisting of four stages: providing a good starting point, reasoning RL, SFT, and safety RL, and achieved performance comparable to o1.
↳ Simply fine-tuning other open models with the data generated by R1-Zero (distillation) resulted in performance comparable to o1-mini.

Of course, this is just a brief overview and may not be of much help. All models are accessible on Hugging Face, and the paper can be read through the GitHub repository.

Model: https://huggingface.co/deepseek-ai
Paper: https://github.com/deepseek-ai/DeepSeek-R1

1 reply

·

liked a model 10 days ago

bartowski/SmallThinker-3B-Preview-GGUF

Text Generation • Updated Dec 30, 2024 • 81.5k • 26

liked 7 models 11 days ago

deepseek-ai/DeepSeek-V3

Text Generation • Updated 8 days ago • 804k • • 2.94k

unsloth/DeepSeek-R1-Distill-Qwen-1.5B-GGUF

Updated 7 days ago • 35.3k • 61

unsloth/DeepSeek-R1-Distill-Qwen-1.5B-unsloth-bnb-4bit

Text Generation • Updated 9 days ago • 5.8k • 7

unsloth/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • Updated 9 days ago • 1.41k • 5

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • Updated 6 days ago • 260k • • 589

unsloth/DeepSeek-R1-Distill-Qwen-14B

Text Generation • Updated 9 days ago • 5.95k • 9

deepseek-ai/deepseek-vl2-tiny

Image-Text-to-Text • Updated Dec 18, 2024 • 24.1k • 85

liked 2 models 13 days ago

mlx-community/helium-1-preview-2b-4bit

Text Generation • Updated 13 days ago • 31 • 1

mlx-community/helium-1-preview-2b-8bit

Text Generation • Updated 13 days ago • 21 • 1

New activity in kyutai/helium-1-preview-2b 13 days ago

GGUF format

#4 opened 13 days ago by

liked 3 models 14 days ago

calcuis/ltxv-gguf

Text-to-Video • Updated about 5 hours ago • 51.1k • 34

MiniMaxAI/MiniMax-Text-01

Text Generation • Updated 15 days ago • 6.05k • 491

prithivMLmods/Omni-Reasoner-Merged

Text Generation • Updated 15 days ago • 53 • 9