fastapi uvicorn pydantic transformers boto3 huggingface_hub torch optimum onnxruntime onnx bitsandbytes