TU Berlin	RSiM	DIMA	BigEarth	BIFOLD

Convmixer_768_32 pretrained on BigEarthNet v2.0 using Sentinel-1 bands

This model was trained on the BigEarthNet v2.0 (also known as reBEN) dataset using the Sentinel-1 bands. It was trained using the following parameters:

Number of epochs: up to 100 (with early stopping after 5 epochs of no improvement based on validation average precision macro)
Batch size: 512
Learning rate: 0.001
Dropout rate: 0.15
Drop Path rate: 0.15
Learning rate scheduler: LinearWarmupCosineAnnealing for 1000 warmup steps
Optimizer: AdamW
Seed: 42

The weights published in this model card were obtained after 35 training epochs. For more information, please visit the official BigEarthNet v2.0 (reBEN) repository, where you can find the training scripts.

The model was evaluated on the test set of the BigEarthNet v2.0 dataset with the following results:

Metric	Macro	Micro
Average Precision	0.595479	0.787697
F1 Score	0.549242	0.692228
Precision	0.612713	0.747496

Example

A Sentinel-1 image (VV, VH and VV/VH bands are used for visualization)

Class labels	Predicted scores
Agro-forestry areas Arable land Beaches, dunes, sands ... Urban fabric	0.000299 0.008017 0.001981 ... 0.383937

To use the model, download the codes that define the model architecture from the official BigEarthNet v2.0 (reBEN) repository and load the model using the code below. Note that you have to install configilm to use the provided code.

from reben_publication.BigEarthNetv2_0_ImageClassifier import BigEarthNetv2_0_ImageClassifier

model = BigEarthNetv2_0_ImageClassifier.from_pretrained("path_to/huggingface_model_folder")

e.g.

from reben_publication.BigEarthNetv2_0_ImageClassifier import BigEarthNetv2_0_ImageClassifier

model = BigEarthNetv2_0_ImageClassifier.from_pretrained(
  "BIFOLD-BigEarthNetv2-0/convmixer_768_32-s1-v0.1.1")

If you use this model in your research or the provided code, please cite the following papers:

CITATION FOR DATASET PAPER

@article{hackel2024configilm,
  title={ConfigILM: A general purpose configurable library for combining image and language models for visual question answering},
  author={Hackel, Leonard and Clasen, Kai Norman and Demir, Beg{\"u}m},
  journal={SoftwareX},
  volume={26},
  pages={101731},
  year={2024},
  publisher={Elsevier}
}