Spaces:

xxparthparekhxx
/

llama-3.2-1B-FastApi

Running

llama-3.2-1B-FastApi / requirements.txt

parth parekh

added more speed

9e1ad54 about 2 months ago

232 Bytes

	fastapi
	transformers
	torch
	uvicorn
	python-dotenv
	optimum[onnxruntime] # For CPU optimizations with ONNX Runtime
	accelerate # For managing multi-device setup (CPU/GPU)
	gunicorn # For running multiple workers