87 54 275

Lee Junbum PRO

beomi

https://junbuml.ee

AI & ML interests

AI/ML GDE. Advancing Low-Resource Language Open Access LLM

Recent Activity

updated a collection 2 days ago

Korean Instruction Dataset

updated a collection 2 days ago

Korean Instruction Dataset

upvoted a paper 3 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

View all activity

Organizations

beomi's activity

upvoted a paper 3 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 4 days ago • 185

upvoted a paper 8 days ago

Do generative video models learn physical principles from watching videos?

Paper • 2501.09038 • Published 12 days ago • 30

upvoted a paper 11 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 12 days ago • 268

upvoted a paper about 2 months ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 75

upvoted 2 collections about 2 months ago

EXAONE-3.5

Collection

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B. • 10 items • Updated Dec 10, 2024 • 88

Llama 3.3 (All Versions)

Collection

Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 6 days ago • 32

upvoted 4 papers about 2 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 124

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 31

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 57

o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 43

upvoted a collection about 2 months ago

OLMo 2

Collection

Artifacts for the second set of OLMo models. • 22 items • Updated 20 days ago • 75

upvoted a paper about 2 months ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published Nov 26, 2024 • 11

upvoted 5 papers 2 months ago

upvoted 2 articles 3 months ago

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

•

Aug 26, 2024

• 48

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 216

upvoted a paper 3 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 145