ldwang's picture

ldwang

ldwang

·

ftgreat

AI & ML interests

LLM, MLLM, Infra

Recent Activity

updated a dataset about 2 hours ago

BAAI/OpenSeek-Pretrain-Data-Examples

published a dataset about 5 hours ago

BAAI/OpenSeek-Pretrain-Data-Examples

upvoted an article about 6 hours ago

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

View all activity

Organizations

ldwang's activity

updated a dataset about 2 hours ago

BAAI/OpenSeek-Pretrain-Data-Examples

Preview • Updated about 2 hours ago

published a dataset about 5 hours ago

BAAI/OpenSeek-Pretrain-Data-Examples

Preview • Updated about 2 hours ago

upvoted an article about 6 hours ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

By

•

14 days ago

• 6

updated a collection about 9 hours ago

MiscBlogs

3 items • Updated about 9 hours ago • 1

liked a Space about 9 hours ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 1 day ago

facebook/natural_reasoning

Viewer • Updated 4 days ago • 1.15M • 2.12k • 195

upvoted a paper 1 day ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 147

updated a collection 2 days ago

MiscModels

5 items • Updated 2 days ago • 1

liked a model 2 days ago

moonshotai/Moonlight-16B-A3B-Instruct

Text Generation • Updated 3 days ago • 1.14k • 97

liked a dataset 2 days ago

OpenCoder-LLM/opc-sft-stage2

Viewer • Updated Nov 24, 2024 • 436k • 1.44k • 56

upvoted a collection 7 days ago

DeepSeek-R1

8 items • Updated Jan 21 • 533

liked a dataset 7 days ago

microsoft/RedStone

Updated Dec 5, 2024 • 31 • 32

updated a Space 7 days ago

Openseek

published a Space 9 days ago

Openseek

liked a dataset 10 days ago

bigcode/the-stack-v2-dedup

Viewer • Updated Apr 23, 2024 • 2.3B • 1.94k • 85

liked a dataset 12 days ago

PleIAs/common_corpus

Viewer • Updated 14 days ago • 470M • 57.3k • 239

upvoted an article 13 days ago

Article

Large-scale Near-deduplication Behind BigCode

May 16, 2023

• 21

liked a dataset 17 days ago

WildEval/ZebraLogic

Viewer • Updated 21 days ago • 4.26k • 227 • 5

updated a collection 17 days ago

MiscModels

5 items • Updated 2 days ago • 1

liked a model 17 days ago

princeton-nlp/QuRater-1.3B

Text Classification • Updated Apr 16, 2024 • 443 • 13