--- license: apache-2.0 base_model: google/vit-base-patch16-224 tags: - generated_from_trainer datasets: - imagefolder metrics: - accuracy model-index: - name: vit-base-patch16-224-base-patch16-224 results: - task: name: Image Classification type: image-classification dataset: name: imagefolder type: imagefolder config: default split: train args: default metrics: - name: Accuracy type: accuracy value: 0.8138537787800912 --- # Caracam (gen 2) This model is a fine-tuned version of [google/vit-base-patch16-224](https://huggingface.co/google/vit-base-patch16-224) on the imagefolder dataset. It achieves the following results on the evaluation set: - Loss: 0.7005 - Accuracy: 0.8139 ## Model description ***This is the model the Caracam mobile app will be is using upon release*** \ Second Generation of [Caracam](https://huggingface.co/Takekazuchi/Caracam)! with a tested accuracy of 92% and a reported accuracy of 81% this makes this model of Caracam at least 1.5x more accurate than gen 1!\ If you wish to support this project please head over to my [Patreon](https://www.patreon.com/Caracam) or my [Cashapp](https://cash.app/$Clippayy) \ even if you don't want to support us financially thank you for even just coming to this page in the first place as the traffic helps more people find this project! ## Intended uses & limitations ***NOT FOR COMMERCIAL USE OUTSIDE OF OFFICIAL CARACAM MOBILE APP*** \ ***Limitations*** \ This model assumes that its input image contains a well-cropped car. \ If a non-car image is given or if the car is not well-cropped, the output of the model may be meaningless. ## Expected Updates EXPECTED RELEASE DATE OF CARACAM: 12/01 (Date pushed back due to complications with tflite support for ViT model architecture) \ Future versions of Caracam mobile app to have price-prediction with links via web-integration as well as more general information on the predicted car model. \ User-Polls and comments with like/dislike system to be addded for feedback on new cars and anything else users want added to the app/model. ## Non-Suitable Usecases Do NOT use this model to determine whether an object is a car or not. ## Suitable Usecases DO use this model to determine the make/model/year of a car \ DO take pictures of your animals to see what car they look like and send the results to my [Twitter](https://twitter.com/caracam0) so i can retweet them! ### Training hyperparameters The following hyperparameters were used during training: - learning_rate: 0.0001 - train_batch_size: 32 - eval_batch_size: 32 - seed: 42 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08 - lr_scheduler_type: linear - lr_scheduler_warmup_ratio: 0.1 - num_epochs: 10 - mixed_precision_training: Native AMP ### Training results | Training Loss | Epoch | Step | Validation Loss | Accuracy | |:-------------:|:-----:|:------:|:---------------:|:--------:| | 2.9008 | 1.0 | 21451 | 2.0266 | 0.5235 | | 1.747 | 2.0 | 42902 | 1.2724 | 0.6764 | | 1.4453 | 3.0 | 64353 | 1.0682 | 0.7189 | | 1.1809 | 4.0 | 85804 | 0.9663 | 0.7445 | | 1.2318 | 5.0 | 107255 | 0.8934 | 0.7596 | | 0.8664 | 6.0 | 128706 | 0.8309 | 0.7782 | | 1.0645 | 7.0 | 150157 | 0.7863 | 0.7890 | | 1.1092 | 8.0 | 171608 | 0.7503 | 0.7974 | | 0.8655 | 9.0 | 193059 | 0.7204 | 0.8076 | | 0.6357 | 10.0 | 214510 | 0.7005 | 0.8139 | ### Framework versions - Transformers 4.38.1 - Pytorch 2.2.1+cu121 - Datasets 2.17.1 - Tokenizers 0.15.2