Mixture of Rewards

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

seungone authored a paper 21 days ago

LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation

seungone authored a paper 25 days ago

Bridging the Data Provenance Gap Across Text, Speech and Video

seungone authored a paper about 2 months ago

Evaluating Language Models as Synthetic Data Generators

View all activity

MoR-labeling's activity

seungone

authored a paper 21 days ago

LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation

Paper • 2412.10424 • Published Dec 10, 2024 • 2

seungone

authored a paper 25 days ago

Bridging the Data Provenance Gap Across Text, Speech and Video

Paper • 2412.17847 • Published Dec 19, 2024 • 8

seungone

authored a paper about 2 months ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published Dec 4, 2024 • 46

seungone

authored 3 papers 3 months ago

MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models

Paper • 2410.17578 • Published Oct 23, 2024 • 1

Better Instruction-Following Through Minimum Bayes Risk

Paper • 2410.02902 • Published Oct 3, 2024

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published Oct 21, 2024 • 44

seungone

authored 2 papers 5 months ago

Consent in Crisis: The Rapid Decline of the AI Data Commons

Paper • 2407.14933 • Published Jul 20, 2024 • 12

Can Language Models Evaluate Human Written Text? Case Study on Korean Student Writing for Education

Paper • 2407.17022 • Published Jul 24, 2024

seungone

authored 2 papers 8 months ago

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Paper • 2406.05761 • Published Jun 9, 2024 • 2

Aligning to Thousands of Preferences via System Message Generalization

Paper • 2405.17977 • Published May 28, 2024 • 7

seungone

authored 2 papers 9 months ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 121

Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards

Paper • 2404.10346 • Published Apr 16, 2024 • 1

seungone

authored a paper 10 months ago

Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models

Paper • 2404.02575 • Published Apr 3, 2024 • 48

seungone

authored 2 papers 11 months ago

KMMLU: Measuring Massive Multitask Language Understanding in Korean

Paper • 2402.11548 • Published Feb 18, 2024

Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once?

Paper • 2402.11597 • Published Feb 18, 2024 • 1

seungone

authored a paper 12 months ago

LangBridge: Multilingual Reasoning Without Multilingual Supervision

Paper • 2401.10695 • Published Jan 19, 2024 • 5

seungone

authored a paper about 1 year ago

Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation

Paper • 2401.06591 • Published Jan 12, 2024 • 3

wkddydpf

authored a paper about 1 year ago

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Paper • 2311.10702 • Published Nov 17, 2023 • 19

seungone

authored a paper about 1 year ago

Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging

Paper • 2310.11564 • Published Oct 17, 2023 • 2

seungone

authored a paper over 1 year ago

CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification

Paper • 2303.03628 • Published Mar 7, 2023 • 2

AI & ML interests

Recent Activity

Team members 13

MoR-labeling's activity