40 4 177

kaeru39 PRO

ryota39

AI & ML interests

LLM × RL

Recent Activity

updated a dataset about 7 hours ago

preference-team/dataset-for-annotation-v2-annotated

updated a dataset about 7 hours ago

preference-team/progress

published a Space about 16 hours ago

ryota39/sake-sonar

View all activity

Organizations

Collections 7

spaces 2

Running

ICLR2025 Sonar-v1

🪄

Search for ICLR2025 papers using keywords

Running

Sake Sonar

🍶

Ask questions about brewing industry data

models 17

datasets 29

ryota39/test

Viewer • Updated Jan 23 • 5 • 49

ryota39/wild_chat_ja

Viewer • Updated Jan 23 • 3.49k • 96

ryota39/aya-evol-instruct

Viewer • Updated Jan 6 • 29.2k • 59

ryota39/JCommonsenseMorality

Viewer • Updated Nov 29, 2024 • 9.98k • 101

ryota39/hh-rlhf

Viewer • Updated Nov 26, 2024 • 169k • 70

ryota39/preference-en-ja-100k

Viewer • Updated Nov 19, 2024 • 101k • 68 • 1

ryota39/preference_test

Viewer • Updated Nov 16, 2024 • 29.6k • 64

ryota39/preference_test_annotated

Viewer • Updated Nov 10, 2024 • 5 • 59

ryota39/open_preference_v0.4

Viewer • Updated Aug 26, 2024 • 202k • 69 • 1

ryota39/webgpt_comparisons-ja

Viewer • Updated Aug 16, 2024 • 17.4k • 63 • 1

kaeru39 PRO

AI & ML interests

Recent Activity

Organizations

Collections 7

ryota39/Phi-3-mini-4k-instruct-dpo

ryota39/truthy-dpo-ja

ryota39/hh-rlhf-12k-ja_orpo

ryota39/dpo-ja-45k

ryota39/Tora-7B-v0.1

ryota39/Tora-7B-v0.2

ryota39/Tora-12B

spaces 2

ICLR2025 Sonar-v1

Sake Sonar

models 17

ryota39/gemma-2-2b-jpn-it-q8

ryota39/Tora-12B

ryota39/Tora-7B-v0.1

ryota39/mluke-large-lite-reward

ryota39/retriva-bert-preference-classifier

ryota39/Tora-7B-v0.2

ryota39/llm-jp-1b-sft-100k-LoRA-dpo-12k

ryota39/Phi-3-mini-4k-instruct-dpo

ryota39/llm-jp-1b-sft-15k

ryota39/llm-jp-1b-sft-100k-LoRA

datasets 29

ryota39/test

ryota39/wild_chat_ja

ryota39/aya-evol-instruct

ryota39/JCommonsenseMorality

ryota39/hh-rlhf

ryota39/preference-en-ja-100k

ryota39/preference_test

ryota39/preference_test_annotated

ryota39/open_preference_v0.4

ryota39/webgpt_comparisons-ja

kaeru39 PRO

AI & ML interests

Recent Activity

Organizations

Collections 7

spaces 2 Sort: Recently updated

ICLR2025 Sonar-v1

Sake Sonar

models 17 Sort: Recently updated

datasets 29 Sort: Recently updated

spaces 2

models 17

datasets 29