Memory scaling on input length

#7
by marksverdhei - opened

Will it be possible to add some sort of memory scaling estimate?
either like a widget where you enter n-tokens or a list of typical lengths like 512, 1k, 2k, 4k, 8k

accelerate org

We're looking into this, definitely!

Sign up or log in to comment