hongyin
/

chat-llama2-70b-4bit

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

chat-llama2-70b-4bit

1 contributor

History: 10 commits

hongyin's picture

Upload pytorch_model.bin.index.json

24d0fcd 12 months ago