Marko Tiosavljevic
magnetoid
AI & ML interests
None yet
Recent Activity
liked
a model
10 days ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
reacted
to
csabakecskemeti's
post
with ๐ค
10 days ago
Testing Training on AMD/ROCm the first time!
I've got my hands on an AMD Instinct MI100. It's about the same price used as a V100 but on paper has more TOPS (V100 14TOPS vs MI100 23TOPS) also the HBM has faster clock so the memory bandwidth is 1.2TB/s.
For quantized inference it's a beast (MI50 was also surprisingly fast)
For LORA training with this quick test I could not make the bnb config works so I'm running the FT on the fill size model.
Will share all the install, setup and setting I've learned in a blog post, together with the cooling shroud 3D design.
liked
a model
14 days ago
perplexity-ai/r1-1776
Organizations
models
None public yet
datasets
None public yet