Denis Kuznedelev
SpiridonSunRotator
AI & ML interests
Model compression, computer vision, NLP
Recent Activity
updated
a model
4 days ago
daslab-testing/DeepSeek-R1-GPTQ-1_58b-128g-experts-decompressed
published
a model
4 days ago
daslab-testing/DeepSeek-R1-GPTQ-1_58b-128g-experts-decompressed
new activity
4 days ago
quickjkee/swd_pipeline:Argument and dtype fix
Organizations
SpiridonSunRotator's activity
Argument and dtype fix
#1 opened 4 days ago
by
SpiridonSunRotator

VLLM launch command?
2
#1 opened 7 days ago
by
nfunctor
Fixed typo
1
#29 opened 8 days ago
by
SpiridonSunRotator

ERROR:Gemma3Config' object has no attribute 'vocab_size'
2
#17 opened 14 days ago
by
nexyi
awq version
3
#20 opened 14 days ago
by
rastegar
Updated Model card
#1 opened 4 months ago
by
SpiridonSunRotator

size mismatch for model.layers when loading with from pretrained
8
#35 opened 8 months ago
by
C-Felix
70B Version?
1
#1 opened 11 months ago
by
amrothemich
Fast Mamba kernels are not available
10
#16 opened 12 months ago
by
MohamedRashad

Update README.md
1
#2 opened about 1 year ago
by
SpiridonSunRotator

Updated model description and added evaluation metrics.
1
#1 opened about 1 year ago
by
SpiridonSunRotator

Updated description and the metrics
#1 opened about 1 year ago
by
SpiridonSunRotator

Update README.md
#6 opened about 1 year ago
by
SpiridonSunRotator

Updated 13b model.
#1 opened about 1 year ago
by
SpiridonSunRotator

Update README.md
#1 opened about 1 year ago
by
SpiridonSunRotator

Can use with v100 gpu?
4
#1 opened about 1 year ago
by
jesulo
Upload finetuned Llama-2-7b models
1
#1 opened about 1 year ago
by
SpiridonSunRotator

Upload finetuned Llama-2-7b models
1
#1 opened about 1 year ago
by
SpiridonSunRotator

Update README.md
#1 opened about 1 year ago
by
SpiridonSunRotator
