Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spiral-AI
/
Spiral-RetNet-3b-base
like
5
Follow
Spiral-AI
27
Text Generation
Transformers
Safetensors
Japanese
English
retnet
Inference Endpoints
arxiv:
2307.08621
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
9cae16e
Spiral-RetNet-3b-base
1 contributor
History:
10 commits
ksterx
Update README.md
9cae16e
verified
10 months ago
.gitattributes
Safe
1.52 kB
initial commit
10 months ago
README.md
Safe
4.75 kB
Update README.md
10 months ago
config.json
Safe
943 Bytes
Upload RetNetForCausalLM
10 months ago
generation_config.json
Safe
111 Bytes
Upload RetNetForCausalLM
10 months ago
logo.jpg
Safe
121 kB
Upload logo.jpg
10 months ago
logo.png
Safe
25.6 kB
Upload 2 files
10 months ago
loss_comparison.png
Safe
108 kB
Upload 3 files
10 months ago
model-00001-of-00003.safetensors
Safe
4.99 GB
LFS
Upload RetNetForCausalLM
10 months ago
model-00002-of-00003.safetensors
Safe
4.96 GB
LFS
Upload RetNetForCausalLM
10 months ago
model-00003-of-00003.safetensors
Safe
1.47 GB
LFS
Upload RetNetForCausalLM
10 months ago
model.safetensors.index.json
Safe
26.3 kB
Upload RetNetForCausalLM
10 months ago
retention.gif
Safe
82.1 kB
Upload 3 files
10 months ago
special_tokens_map.json
Safe
585 Bytes
Upload tokenizer
10 months ago
tokenizer.json
Safe
3.27 MB
Upload tokenizer
10 months ago
tokenizer_config.json
Safe
674 Bytes
Upload tokenizer
10 months ago