YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Resnet50 Test Quantization for accelerated int8-int8 compute for Apple Neural Engine

Resnet From Code:

Int8ANE.ipynb - test notebook to create


CoreMLTools 8.01b, M4 iPad Pro 16GB, iPad OS18.1 beta

LUT 4-bit FP16 1.03ms
A4W8 0.58 ms
LUT 4-bit (A8W8) 0.92ms - no Accell!

Tourch Model: 8-bit
resnet50-LUT8-iOS17.mlpackage 1.08 ms
resnet50-W8A8-iOS17.mlpackage 0.81 ms

4-bit
resnet50-LUT4-iOS17.mlpackage 0.93 ms
resnet50-W4A8-iOS18.mlpackage 0.68 ms

Downloads last month
12
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.