iammartian0
/

vegetation_classification_model

Image Classification

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

vegetation_classification_model / README.md

iammartian0's picture

Update README.md

be5e9cf about 2 years ago

|

history blame contribute delete

2.53 kB

	---
	license: apache-2.0
	language:
	- en
	metrics:
	- accuracy
	pipeline_tag: image-classification
	tags:
	- climate
	---

	## Model description

	This is a transformers based image classification model, implemented using the technique of transfer learning.
	The pretrained model is [Vision transformer](https://huggingface.co/google/vit-base-patch16-224) trained on Imagenet-21k.

	## Datasets

	The dataset used is downloaded from git repo [Agri-Hub/Space2Ground](https://github.com/Agri-Hub/Space2Ground/tree/main).
	I used Street-level image patches folder for this model. It is a dataset containing cropped vegetation parts of
	mapillary street-level images. Further details are on the linked git repo.

	### How to use

	You can use this model directly with help of pipeline class from transformers library of hugging face

	```python

	>>>from transformers import pipeline
	>>>classifier = pipeline("image-classification", model="iammartian0/vegetation_classification_model")
	>>>classifier(image)

	```
	or

	uploading a target image to Hosted inference api.

	## Training procedure



	### Preprocessing

	Assigining labels based on parent folder names

	### Image Transformations

	Applied RandomResizedCrop from torchvision.transforms to all the training images.

	### Finetuning

	Model is finetuned on the dataset for four epochs

	## Evaluation results

	Model acheived an Top-1 accuracy of 0.929.

	## Further exploration to do
	- Trainig a multilabel model where model can find if the image is from left side or right side
	on top of classifying the vegetation
	- Fine grained classification of crop labels using Raw/Initial set of street-level images


	### BibTeX entry and citation info

	```bibtex
	@misc{wu2020visual,
	title={Visual Transformers: Token-based Image Representation and Processing for Computer Vision},
	author={Bichen Wu and Chenfeng Xu and Xiaoliang Dai and Alvin Wan and Peizhao Zhang and Zhicheng Yan and Masayoshi Tomizuka and Joseph Gonzalez and Kurt Keutzer and Peter Vajda},
	year={2020},
	eprint={2006.03677},
	archivePrefix={arXiv},
	primaryClass={cs.CV}
	}
	```
	```bibtex

	@INPROCEEDINGS{9816335,
	author={Choumos, George and Koukos, Alkiviadis and Sitokonstantinou, Vasileios and Kontoes, Charalampos},
	booktitle={2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP)},
	title={Towards Space-to-Ground Data Availability for Agriculture Monitoring},
	year={2022},
	volume={},
	number={},
	pages={1-5},
	doi={10.1109/IVMSP54334.2022.9816335}
	}
	```