Spravil
/

hpx_former_b36.westai_in1k

Image Classification

Model card Files Files and versions Community

hpx_former_b36.westai_in1k / README.md

Spravil's picture

Update README.md

e4e1826 verified 5 months ago

|

2.63 kB

	---
	tags:
	- image-classification
	- timm
	library_name: timm
	license: apache-2.0
	datasets:
	- imagenet-1k
	metrics:
	- accuracy
	---
	# Model card for hpx_former_b36

	The model hpx_former_b36 is part of the HyenaPixel model family proposed in the paper ["HyenaPixel: Global Image Context with Convolutions"](https://arxiv.org/abs/2402.19305).
	HyenaPixel uses large convolutions as an attention replacement by extending Hyena ([Paper](https://arxiv.org/abs/2302.10866) and [GitHub](https://github.com/HazyResearch/safari/)) to support bidirectrional and two-dimensional input.
	The operator is integrated in the MetaFormer ([Paper](https://arxiv.org/abs/2210.13452) and [GitHub](https://github.com/sail-sg/metaformer)) framework.

	The official PyTorch implementation of HyenaPixel can be found on [GitHub](https://github.com/spravil/HyenaPixel).

	## Models

	\| Model \| Resolution \| Params \| Top1 Acc \| Download \|
	\| :----------------- \| :--------: \| :----: \| :------: \| :--------------------------------------------------------------------------: \|
	\| hpx_former_s18 \| 224 \| 29M \| 83.2 \| [HuggingFace](https://huggingface.co/Spravil/hpx_former_s18.westai_in1k) \|
	\| hpx_former_s18_384 \| 384 \| 29M \| 84.7 \| [HuggingFace](https://huggingface.co/Spravil/hpx_former_s18.westai_in1k_384) \|
	\| hb_former_s18 \| 224 \| 28M \| 83.5 \| [HuggingFace](https://huggingface.co/Spravil/hb_former_s18.westai_in1k) \|
	\| c_hpx_former_s18 \| 224 \| 28M \| 83.0 \| [HuggingFace](https://huggingface.co/Spravil/c_hpx_former_s18.westai_in1k) \|
	\| hpx_a_former_s18 \| 224 \| 28M \| 83.6 \| [HuggingFace](https://huggingface.co/Spravil/hpx_a_former_s18.westai_in1k) \|
	\| hb_a_former_s18 \| 224 \| 27M \| 83.2 \| [HuggingFace](https://huggingface.co/Spravil/hb_a_former_s18.westai_in1k) \|
	\| hpx_former_b36 \| 224 \| 111M \| 84.9 \| [HuggingFace](https://huggingface.co/Spravil/hpx_former_b36.westai_in1k) \|
	\| hb_former_b36 \| 224 \| 102M \| 85.2 \| [HuggingFace](https://huggingface.co/Spravil/hb_former_b36.westai_in1k) \|

	## Usage

	```
	pip install git+https://github.com/spravil/HyenaPixel.git
	```

	```python
	import timm
	import hyenapixel.models

	model = timm.create_model("hpx_former_b36", pretrained=True)
	```

	# Bibtex

	```
	@article{spravil2024hyenapixel,
	title={HyenaPixel: Global Image Context with Convolutions},
	author={Julian Spravil and Sebastian Houben and Sven Behnke},
	journal={arXiv preprint arXiv:2402.19305},
	year={2024},
	}
	```