|
--- |
|
tags: |
|
- image-classification |
|
- timm |
|
library_name: timm |
|
license: apache-2.0 |
|
datasets: |
|
- imagenet-1k |
|
metrics: |
|
- accuracy |
|
--- |
|
# Model card for hpx_former_b36 |
|
|
|
The model hpx_former_b36 is part of the HyenaPixel model family proposed in the paper ["HyenaPixel: Global Image Context with Convolutions"](https://arxiv.org/abs/2402.19305). |
|
HyenaPixel uses large convolutions as an attention replacement by extending Hyena ([Paper](https://arxiv.org/abs/2302.10866) and [GitHub](https://github.com/HazyResearch/safari/)) to support bidirectrional and two-dimensional input. |
|
The operator is integrated in the MetaFormer ([Paper](https://arxiv.org/abs/2210.13452) and [GitHub](https://github.com/sail-sg/metaformer)) framework. |
|
|
|
The official PyTorch implementation of HyenaPixel can be found on [GitHub](https://github.com/spravil/HyenaPixel). |
|
|
|
## Models |
|
|
|
| Model | Resolution | Params | Top1 Acc | Download | |
|
| :----------------- | :--------: | :----: | :------: | :--------------------------------------------------------------------------: | |
|
| hpx_former_s18 | 224 | 29M | 83.2 | [HuggingFace](https://huggingface.co/Spravil/hpx_former_s18.westai_in1k) | |
|
| hpx_former_s18_384 | 384 | 29M | 84.7 | [HuggingFace](https://huggingface.co/Spravil/hpx_former_s18.westai_in1k_384) | |
|
| hb_former_s18 | 224 | 28M | 83.5 | [HuggingFace](https://huggingface.co/Spravil/hb_former_s18.westai_in1k) | |
|
| c_hpx_former_s18 | 224 | 28M | 83.0 | [HuggingFace](https://huggingface.co/Spravil/c_hpx_former_s18.westai_in1k) | |
|
| hpx_a_former_s18 | 224 | 28M | 83.6 | [HuggingFace](https://huggingface.co/Spravil/hpx_a_former_s18.westai_in1k) | |
|
| hb_a_former_s18 | 224 | 27M | 83.2 | [HuggingFace](https://huggingface.co/Spravil/hb_a_former_s18.westai_in1k) | |
|
| hpx_former_b36 | 224 | 111M | 84.9 | [HuggingFace](https://huggingface.co/Spravil/hpx_former_b36.westai_in1k) | |
|
| hb_former_b36 | 224 | 102M | 85.2 | [HuggingFace](https://huggingface.co/Spravil/hb_former_b36.westai_in1k) | |
|
|
|
## Usage |
|
|
|
``` |
|
pip install git+https://github.com/spravil/HyenaPixel.git |
|
``` |
|
|
|
```python |
|
import timm |
|
import hyenapixel.models |
|
|
|
model = timm.create_model("hpx_former_b36", pretrained=True) |
|
``` |
|
|
|
# Bibtex |
|
|
|
``` |
|
@article{spravil2024hyenapixel, |
|
title={HyenaPixel: Global Image Context with Convolutions}, |
|
author={Julian Spravil and Sebastian Houben and Sven Behnke}, |
|
journal={arXiv preprint arXiv:2402.19305}, |
|
year={2024}, |
|
} |
|
``` |