File size: 795 Bytes
cbb0aea
 
 
c3231ac
d96b454
c3231ac
 
 
7df35b8
d96b454
 
 
7a124f1
a81ef09
 
d96b454
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
---
license: apache-2.0
---

# ViT Fine-tuned on Stanford Car Dataset

Base model: https://huggingface.co/google/vit-base-patch16-224

This achieves around 86% on the testing set

Dataset Description: 

The Stanford car dataset contains 16,185 images of 196 classes of cars. Classes are typically at the level of Make, Model, Year, e.g. 2012 Tesla Model S or 2012 BMW M3 coupe. The data is split into 8144 training images, 6,041 testing images, and 2000 validation images in this case. 

 <img src="https://ai.stanford.edu/~jkrause/cars/class_montage.jpg"> 

Citations: 
3D Object Representations for Fine-Grained Categorization
Jonathan Krause, Michael Stark, Jia Deng, Li Fei-Fei
4th IEEE Workshop on 3D Representation and Recognition, at ICCV 2013 (3dRR-13). Sydney, Australia. Dec. 8, 2013.