add device=cuda; initialize tokenizer and model outside of function call; add inputs to device
60ffe71
Allen Park
commited on