Gunisetti Gokul's picture

Gunisetti Gokul

gokul9

·

gokulcoder7

AI & ML interests

None yet

Recent Activity

updated a model 2 days ago

gokul9/phi-4-Q4_K_M-GGUF

published a model 2 days ago

gokul9/phi-4-Q4_K_M-GGUF

liked a Space 2 days ago

webml-community/deepseek-r1-webgpu

View all activity

Organizations

None yet

gokul9's activity

upvoted a paper 17 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 18 days ago • 248

upvoted a collection 20 days ago

steiner-preview

Reasoning models trained on synthetic data using reinforcement learning. • 3 items • Updated Oct 20, 2024 • 28

upvoted a paper about 1 month ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 139

upvoted 4 collections about 2 months ago

EXAONE-3.5

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B. • 10 items • Updated Dec 10, 2024 • 88

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 192

best vision model

1 item • Updated Dec 6, 2024 • 1

best text to video model

1 item • Updated Dec 4, 2024 • 1

upvoted an article about 2 months ago

Article

Use Models from the Hugging Face Hub in LM Studio

By

•

Nov 28, 2024

• 132

upvoted 2 collections about 2 months ago

QwQ

Qwen with Questions • 2 items • Updated Nov 28, 2024 • 56

Qwen2.5-Math

Math-specific model series based on Qwen2.5 • 11 items • Updated 12 days ago • 65

upvoted a collection 2 months ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 7 items • Updated 20 days ago • 33

upvoted an article 2 months ago

Article

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

By

•

Jul 27, 2024

• 30

upvoted an article 3 months ago

Article

Unbelievable! Run 70B LLM Inference on a Single 4GB GPU with This NEW Technique

By

•

Nov 30, 2023

• 31

upvoted 3 collections 3 months ago

useful for academics

1 item • Updated Nov 2, 2024 • 1

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 355

image editing

2 items • Updated Nov 2, 2024 • 1