4 20 84

Richard Lian

richardlian

dachenlian

AI & ML interests

None yet

Recent Activity

liked a model about 8 hours ago

KVCache-ai/DeepSeek-R1-GGML-FP8-Hybrid

liked a model 21 days ago

unsloth/DeepSeek-R1-GGUF

upvoted a paper about 1 month ago

Evolving Deeper LLM Thinking

View all activity

Organizations

richardlian's activity

liked a model about 8 hours ago

KVCache-ai/DeepSeek-R1-GGML-FP8-Hybrid

Updated 1 day ago • 3

liked a model 21 days ago

unsloth/DeepSeek-R1-GGUF

Text Generation • Updated 13 days ago • 4.2M • 925

upvoted 2 papers about 1 month ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 106

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 16 • 37

upvoted an article about 1 month ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

Jan 15

• 149

liked a Space about 2 months ago

1.16k

Big Code Models Leaderboard

📈

Submit code models for evaluation on benchmarks

upvoted an article about 2 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 17

upvoted a collection 2 months ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 139

liked a model 3 months ago

nyrahealth/CrisperWhisper

Automatic Speech Recognition • Updated Dec 19, 2024 • 23.6k • • 233