README.md · DarthReca/CLOSP-VL at main

CLOSP-VL / README.md

DarthReca

Update README.md

2e3ebdf verified 10 days ago

preview code

raw

history blame contribute delete

2 kB

	---
	license: openrail
	datasets:
	- DarthReca/crisislandmark
	language:
	- en
	library_name: transformers
	tags:
	- remote-sensing
	- text-to-image-retrieval
	- multimodal
	- geospatial
	- SAR
	- multispectral
	- crisis-management
	- earth-observation
	- contrastive-learning
	base_model:
	- sentence-transformers/all-MiniLM-L6-v2
	---
	# CLOSP-VL

	CLOSP (Contrastive Language Optical SAR Pretraining) is a multimodal architecture designed for text-to-image retrieval.
	It creates a unified embedding space for text, Sentinel-2 (MSI), and Sentinel-1 (SAR) data.
	The CLOSP-VL variant uses a ViT-large vision backbone.

	## Model Details
	The model uses three separate encoders: one for text, one for Sentinel-1 (SAR) data, and one for Sentinel-2 (MSI) data.
	During training, it uses a contrastive objective to align the textual embeddings with the corresponding visual embeddings (either SAR or MSI).


	- Developed by: Daniele Rege Cambrin
	- Model type: CLOSP
	- Language(s) (NLP): english
	- License: OpenRAIL
	- Finetuned from model: [More Information Needed]
	- Repository: [GitHub](https://github.com/DarthReca/closp)
	- Paper: [ArXiv](https://arxiv.org/abs/2507.10403)

	## How to Get Started with the Model

	Use the code below to get started with the model.

	```python
	model = AutoModel.from_pretrained("DarthReca/CLOSP-VL", trust_remote_code=True)
	```

	## Citation

	```bibtex
	@misc{cambrin2025texttoremotesensingimageretrievalrgbsources,
	title={Text-to-Remote-Sensing-Image Retrieval beyond RGB Sources},
	author={Daniele Rege Cambrin and Lorenzo Vaiani and Giuseppe Gallipoli and Luca Cagliero and Paolo Garza},
	year={2025},
	eprint={2507.10403},
	archivePrefix={arXiv},
	primaryClass={cs.CV},
	url={https://arxiv.org/abs/2507.10403},
	}
	```

	## Licensing
	The data in this dataset is a compilation of multiple sources, each with its own license. For detailed information on the licensing of each component, please see the [NOTICE.md](NOTICE.md) file.