Fine tuning - a Testerpce Collection

Testerpce 's Collections

Agent

MoE

RAG

State space LLM

Partial layer training LLMs

Math

Dataset and Data processing

Video understanding

Reinforcement learning

Fine tuning

updated 11 days ago

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27 • 23
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Paper • 2410.23743 • Published 14 days ago • 57