Andrew Siah PRO
andrewsiah
AI & ML interests
None yet
Recent Activity
upvoted
an
article
4 days ago
FastRTC: The Real-Time Communication Library for Python
new activity
4 days ago
namkoong-lab/PersonalLLM:Add task category and link to paper
new activity
4 days ago
namkoong-lab/PersonalLLM_Eval:Improve dataset card and add paper link
Organizations
Collections
1
spaces
2
models
10

andrewsiah/Qwen-2.5-1.5B-Instruct-Datamix
Text Generation
•
Updated
•
1

andrewsiah/Qwen-2.5-7B-Simple-RL
Text Generation
•
Updated
•
2

andrewsiah/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
•
9

andrewsiah/Qwen2.5-1.5B-Open-R1-Distill
Updated

andrewsiah/Reinforce-1
Reinforcement Learning
•
Updated

andrewsiah/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated

andrewsiah/taxi-v3
Reinforcement Learning
•
Updated

andrewsiah/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated

andrewsiah/ppo-Huggy
Reinforcement Learning
•
Updated
•
36

andrewsiah/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
datasets
1178
andrewsiah/math-mixture-mix_precalculus50_number_theory42_intermediate_algebra05
Viewer
•
Updated
•
2.7k
•
44
andrewsiah/math-mixture-mix_counting___probability47_algebra18_number_theory15
Viewer
•
Updated
•
2.7k
•
27
andrewsiah/math-mixture-mix_number_theory87_precalculus09_counting___probability01
Viewer
•
Updated
•
2.7k
•
40
andrewsiah/math-mixture-mix_algebra76_prealgebra12_number_theory08
Viewer
•
Updated
•
2.7k
•
26
andrewsiah/math-mixture-mix_algebra33_intermediate_algebra32_geometry27
Viewer
•
Updated
•
2.7k
•
40
andrewsiah/math-mixture-mix_geometry52_intermediate_algebra30_prealgebra09
Viewer
•
Updated
•
2.7k
•
37
andrewsiah/math-mixture-mix_intermediate_algebra97_algebra01_prealgebra00
Viewer
•
Updated
•
2.7k
•
29
andrewsiah/math-mixture-mix_geometry54_intermediate_algebra29_prealgebra14
Viewer
•
Updated
•
2.7k
•
37
andrewsiah/math-mixture-mix_precalculus99_algebra00_geometry00
Viewer
•
Updated
•
2.7k
•
28
andrewsiah/math-mixture-mix_precalculus40_algebra33_number_theory14
Viewer
•
Updated
•
2.7k
•
28