1 28 52

罗杰斯

rojasdiego

https://rojasdiego.com

AI & ML interests

LLMs for Code Generation

Recent Activity

upvoted a paper 3 days ago

LLM as a Broken Telephone: Iterative Generation Distorts Information

upvoted a paper 3 days ago

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

upvoted a paper 12 days ago

The Curse of Depth in Large Language Models

View all activity

Organizations

rojasdiego's activity

upvoted 2 papers 3 days ago

LLM as a Broken Telephone: Iterative Generation Distorts Information

Paper • 2502.20258 • Published 10 days ago • 18

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published 3 days ago • 16

upvoted a paper 12 days ago

The Curse of Depth in Large Language Models

Paper • 2502.05795 • Published 29 days ago • 35

liked a model 23 days ago

mistralai/Mistral-Small-24B-Base-2501

Text Generation • Updated Jan 30 • 29.1k • 226

upvoted 2 papers about 1 month ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 56

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 108

liked 2 models about 2 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 14 days ago • 3.64M • • 11.1k

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 14 days ago • 11.6k • 860

liked a dataset about 2 months ago

bigcode/the-stack-v2-train-smol-ids

Viewer • Updated Apr 23, 2024 • 40.1M • 961 • 32

liked a model 2 months ago

numind/NuExtract-1.5

Text Generation • Updated Nov 18, 2024 • 12.3k • • 217

updated a collection 2 months ago

Code LLMs

Collection

6 items • Updated Jan 3 • 1

liked 2 models 2 months ago

infly/OpenCoder-1.5B-Base

Text Generation • Updated Nov 11, 2024 • 11.3k • 21

infly/OpenCoder-8B-Instruct

Text Generation • Updated Nov 14, 2024 • 1.77k • 187

updated a collection 2 months ago

CoT Models

Collection

2 items • Updated Jan 1

liked a model 2 months ago

PowerInfer/SmallThinker-3B-Preview

Text Generation • Updated Jan 16 • 108k • • 389

updated a collection 2 months ago

Code LLMs

Collection

6 items • Updated Jan 3 • 1

liked 2 models 2 months ago

Qwen/QwQ-32B-Preview

Text Generation • Updated Jan 12 • 258k • • 1.7k

deepseek-ai/DeepSeek-V3-Base

Updated 14 days ago • 765k • 1.59k

liked a model 3 months ago

meta-llama/Llama-3.3-70B-Instruct

Text Generation • Updated Dec 21, 2024 • 740k • • 2.11k

upvoted a paper 4 months ago

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4, 2024 • 35