fastapi uvicorn pydantic transformers boto3 huggingface_hub torch optimum onnxruntime onnx bitsandbytes accelerate>=0.26.0