DataComp

non-profit

https://www.datacomp.ai/dclm/index.html#home

AI & ML interests

None defined yet.

Recent Activity

ranpox authored a paper 5 days ago

Qwen2.5-VL Technical Report

wannaphong authored a paper 6 days ago

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Lewis-Lau authored a paper 6 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

View all activity

dclm's activity

ranpox

authored a paper 5 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 6 days ago • 143

wannaphong

authored a paper 6 days ago

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published 7 days ago • 11

Lewis-Lau

authored a paper 6 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 97

bencw

authored a paper 11 days ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published 12 days ago • 32

AmeyaPrabhu

authored a paper 18 days ago

Great Models Think Alike and this Undermines AI Oversight

Paper • 2502.04313 • Published 19 days ago • 30

thomwolf

authored a paper 19 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 21 days ago • 192

weizechen

authored a paper 21 days ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 22 days ago • 54

lx865712528

authored a paper 27 days ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 28 days ago • 36

Wanfq

authored 2 papers about 1 month ago

BlockPruner: Fine-grained Pruning for Large Language Models

Paper • 2406.10594 • Published Jun 15, 2024

ProFuser: Progressive Fusion of Large Language Models

Paper • 2408.04998 • Published Aug 9, 2024

lx865712528

authored a paper about 1 month ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 44

yentinglin

authored a paper about 1 month ago

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Paper • 2501.10799 • Published Jan 18 • 15

greglindahl

authored a paper about 1 month ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 55

thomwolf

authored a paper about 1 month ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 55

Lewis-Lau

authored 2 papers about 1 month ago

T-Rex: Text-assisted Retrosynthesis Prediction

Paper • 2401.14637 • Published Jan 26, 2024

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 84

lx865712528

authored 2 papers about 2 months ago

EpiCoder: Encompassing Diversity and Complexity in Code Generation

Paper • 2501.04694 • Published Jan 8 • 14

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Paper • 2412.15797 • Published Dec 20, 2024 • 18

orionweller

authored 2 papers 2 months ago

NevIR: Negation in Neural Information Retrieval

Paper • 2305.07614 • Published May 12, 2023 • 1

Learning from Task Descriptions

Paper • 2011.08115 • Published Nov 16, 2020