Erfan Miahi
miahi
AI & ML interests
Reinforcement Learning, Representation Learning, Generalization
Recent Activity
upvoted
a
paper
8 days ago
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR
Summarization
liked
a dataset
10 months ago
iamtarun/code_instructions_120k_alpaca
liked
a dataset
10 months ago
nampdn-ai/tiny-codes
Organizations
None yet
models
None public yet
datasets
None public yet