1 33 23

InHo Won

kotmul

AI & ML interests

None yet

Recent Activity

updated a model about 10 hours ago

WildBoar-LM/wildboar-6B-0.3epoch

published a model about 10 hours ago

WildBoar-LM/wildboar-6B-0.3epoch

liked a Space 1 day ago

HuggingFaceH4/blogpost-scaling-test-time-compute

View all activity

Organizations

kotmul's activity

updated a model about 10 hours ago

WildBoar-LM/wildboar-6B-0.3epoch

Text Generation • Updated about 10 hours ago • 7

published a model about 10 hours ago

WildBoar-LM/wildboar-6B-0.3epoch

Text Generation • Updated about 10 hours ago • 7

liked a Space 1 day ago

523

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

updated a dataset 7 days ago

kotmul/toxicity_kr

Viewer • Updated 7 days ago • 62.2k • 22

published a dataset 7 days ago

kotmul/toxicity_kr

Viewer • Updated 7 days ago • 62.2k • 22

liked 2 models 8 days ago

UNIVA-Bllossom/DeepSeek-llama3.3-Bllossom-70B

Text Generation • Updated 11 days ago • 1.31k • 46

UNIVA-Bllossom/DeepSeek-llama3.1-Bllossom-8B

Text Generation • Updated 13 days ago • 3.8k • 33

updated a dataset 14 days ago

TEL-LLM/pubmed-inst-synthesizer-100k

Viewer • Updated 14 days ago • 100k • 55

published a dataset 14 days ago

TEL-LLM/pubmed-inst-synthesizer-100k

Viewer • Updated 14 days ago • 100k • 55

updated a dataset 16 days ago

TEL-LLM/bloomberg-inst-synthesizer-100k

Viewer • Updated 16 days ago • 99.9k • 69

published a dataset 16 days ago

TEL-LLM/bloomberg-inst-synthesizer-100k

Viewer • Updated 16 days ago • 99.9k • 69

liked a dataset 19 days ago

HuggingFaceTB/cosmopedia

Viewer • Updated Aug 12, 2024 • 31.1M • 10.8k • 587

upvoted a paper 23 days ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published 26 days ago • 106

liked a dataset 29 days ago

BAAI/IndustryCorpus_finance

Viewer • Updated Jul 26, 2024 • 32.6M • 1.04k • 11

upvoted a paper about 1 month ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 332

updated 2 datasets about 1 month ago

MLP-SEMO/sentence_recon

Viewer • Updated Jan 14 • 244M • 101 • 1

MLP-SEMO/sentence_recon

Viewer • Updated Jan 14 • 244M • 101 • 1

upvoted a paper 2 months ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38

liked a model 2 months ago

Bllossom/llama-3.1-Korean-Bllossom-Vision-8B

Image-Text-to-Text • Updated Jan 4 • 610 • 108

liked a dataset 2 months ago

nvidia/ChatQA-Training-Data

Viewer • Updated Jun 4, 2024 • 442k • 1.55k • 167