ShowAndTell-2025-01-30 - a floom Collection

floom 's Collections

ShowAndTell-2025-01-30

ShowAndTell-2024-12-03

Coding

ICL

RL

Agents

NLU

RAG

Data Efficient Approaches

Personalization

sentence-transformer-models

Tool Use & more

Feedback Analysis

Memory

SSM

Efficient Serving/Inference

Synthetic Data Generation

Frontier research ideas

ShowAndTell-2025-01-30

updated 4 days ago

Atla Selene Mini: A General Purpose Evaluation Model

Paper • 2501.17195 • Published 11 days ago • 30
DeepSeek-V3 Technical Report

Paper • 2412.19437 • Published Dec 27, 2024 • 49
Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 10 days ago • 32
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 88
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 16 days ago • 302
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published 18 days ago • 63
RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published 15 days ago • 22
Autonomy-of-Experts Models

Paper • 2501.13074 • Published 16 days ago • 40