IdolSankaku SwinV2 Tagger v1
Supports ratings, characters and general tags.
Trained using https://github.com/SmilingWolf/JAX-CV.
TPUs used for training kindly provided by the TRC program.
Dataset
Trained on a human annotated dataset of real world photos.
Validation results
v1.0: P=R: threshold = 0.3094, F1 = 0.6161
What's new
Model v1.0/Dataset v1:
First version of the dataset, tags updated on 2024-08-31.timm
compatible! Load it up and give it a spin using the canonical one-liner!
ONNX model is compatible with code developed for the v3 series of WD tagger models.
The batch dimension of the ONNX model is not fixed to 1 anymore. Now you can go crazy with batch inference.
Switched to Macro-F1 to measure model performance since it gives me a better gauge of overall training progress.
Runtime deps
ONNX model requires onnxruntime >= 1.17.0
Inference code examples
For timm: https://github.com/neggles/wdv3-timm
For ONNX: https://huggingface.co/spaces/SmilingWolf/wd-tagger
For JAX: https://github.com/SmilingWolf/wdv3-jax
Final words
Subject to change and updates.
Downstream users are encouraged to use tagged releases rather than relying on the head of the repo.
Thanks
Thanks to the whole DeepGHS team for data gathering and encouraging me to push the models much further than they had any reason to attempt to reach, much less succeed.
- Downloads last month
- 8