quyettv's picture

quyettv

quyettv

·

AI & ML interests

None yet

Recent Activity

liked a Space about 2 months ago

upstage/open-ko-llm-leaderboard

upvoted a paper about 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

reacted to cfahlgren1's post with 🚀 2 months ago

The https://huggingface.co/deepseek-ai/DeepSeek-V3 is very good! I have been playing with it and found it is really good at one-shotting a pretty good landing page. You can play with it here: https://deepseek-artifacts.vercel.app All the responses get saved in the https://huggingface.co/datasets/cfahlgren1/react-code-instructions dataset. Hopefully we can build one of the biggest, highest quality frontend datasets on the hub 💪

View all activity

Organizations

None yet

quyettv's activity

liked a Space about 2 months ago

Open Ko-LLM Leaderboard

Explore and filter language model benchmark results

upvoted a paper about 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 260

reacted to cfahlgren1's post with 🚀 2 months ago

Post

3337

The deepseek-ai/DeepSeek-V3 is very good! I have been playing with it and found it is really good at one-shotting a pretty good landing page.

You can play with it here: https://deepseek-artifacts.vercel.app

All the responses get saved in the cfahlgren1/react-code-instructions dataset. Hopefully we can build one of the biggest, highest quality frontend datasets on the hub 💪

liked a dataset 2 months ago

HuggingFaceFW/fineweb-2

Viewer • Updated Jan 8 • 12.5B • 74.6k • 437

upvoted a collection 2 months ago

DeepSeek-V3

3 items • Updated Jan 6 • 193

liked a model 2 months ago

deepseek-ai/DeepSeek-V3-Base

Updated 8 days ago • 657k • 1.58k

upvoted a paper 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 348

updated 4 models 3 months ago

quyettv/SmolLM2-FT-DPO

Text Generation • Updated Dec 17, 2024 • 47

quyettv/SmolLM2-FT-the-stack-smol

Text Generation • Updated Dec 17, 2024 • 59

quyettv/SmolLM2-FT-o1

Text Generation • Updated Dec 17, 2024 • 54

quyettv/SmolLM2-FT-everyday

Text Generation • Updated Dec 16, 2024 • 61

liked 2 Spaces 3 months ago

Qwen2.5 Coder Artifacts

Generate application code with Qwen2.5-Coder-32B

QwQ-32B-Preview

QwQ-32B-Preview

upvoted a paper 4 months ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 128

commented a paper 5 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 171 •

upvoted 3 papers 5 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 171

FAN: Fourier Analysis Networks

Paper • 2410.02675 • Published Oct 3, 2024 • 26

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 51

upvoted a paper 7 months ago

MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24, 2024 • 55

liked a model 7 months ago

THUDM/LongWriter-glm4-9b

Text Generation • Updated Oct 2, 2024 • 1.04k • 118