ldwang
ldwang
AI & ML interests
LLM, MLLM, Infra
Recent Activity
updated
a dataset
17 minutes ago
BAAI/OpenSeek-Pretrain-Data-Examples
published
a dataset
about 3 hours ago
BAAI/OpenSeek-Pretrain-Data-Examples
upvoted
an
article
about 4 hours ago
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment