4 3 3

Dayiheng Liu

Losin94

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition

authored a paper 5 days ago

ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training

authored a paper 5 days ago

OccuQuest: Mitigating Occupational Bias for Inclusive Large Language Models

View all activity

Organizations

Losin94's activity

authored 10 papers 5 days ago

How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition

Paper • 2310.05492 • Published Oct 9, 2023 • 2

Bridging Subword Gaps in Pretrain-Finetune Paradigm for Natural Language Generation

Paper • 2106.06125 • Published Jun 11, 2021

PolyLM: An Open Source Polyglot Large Language Model

Paper • 2307.06018 • Published Jul 12, 2023 • 25

LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback

Paper • 2406.14024 • Published Jun 20

Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement

Paper • 2409.12122 • Published Sep 18 • 3

Language Models can Self-Lengthen to Generate Long Texts

Paper • 2410.23933 • Published Oct 31 • 17

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 6 days ago • 326

upvoted a paper 6 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 6 days ago • 326

authored a paper 15 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 16 days ago • 68

upvoted a paper 15 days ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 16 days ago • 68

authored 2 papers 3 months ago

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18 • 74

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18 • 138

updated a collection 3 months ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 9 items • Updated 28 days ago • 58

authored a paper 4 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4 • 71