Matthew Hendrey
mrhendrey
AI & ML interests
None yet
Recent Activity
new activity
25 days ago
neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16:Model only outputs "!!!!!!!!!!"
updated
a model
about 2 months ago
mrhendrey/Florence-2-large-ft-safetensors
new activity
2 months ago
microsoft/Florence-2-large:VRAM consumption when using GPU (CUDA)
Organizations
None yet
mrhendrey's activity
Model only outputs "!!!!!!!!!!"
1
#1 opened 25 days ago
by
mrhendrey
VRAM consumption when using GPU (CUDA)
3
#37 opened 6 months ago
by
Sunjay353
Batch: inefficient memory
1
#50 opened 6 months ago
by
SinanAkkoyun
Any chance your team is working on a 4-bit Llama-3.2-90B-Vision-Instruct-quantized.w4a16 version?
1
#1 opened 4 months ago
by
mrhendrey