Gradient Checkpointing
#5
by
amadalincostea2
- opened
Does the model support gradient checkpointing?
The OLMo codebase supports activation checkpointing.
But since you're here in Huggingface, and not on GitHub, you probably want to know whether the Huggingface version of OLMo supports it?
Same person, different account. Yes I meant for the Huggingface version.
dirkgr
changed discussion status to
closed