Matthew Hendrey's picture

4

Matthew Hendrey

mrhendrey

AI & ML interests

None yet

Recent Activity

new activity 25 days ago

neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16:Model only outputs "!!!!!!!!!!"

updated a model about 2 months ago

mrhendrey/Florence-2-large-ft-safetensors

new activity 2 months ago

microsoft/Florence-2-large:VRAM consumption when using GPU (CUDA)

View all activity

Organizations

None yet

mrhendrey's activity

New activity in neuralmagic/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16 25 days ago

Model only outputs "!!!!!!!!!!"

#1 opened 25 days ago by

New activity in microsoft/Florence-2-large 2 months ago

VRAM consumption when using GPU (CUDA)

#37 opened 6 months ago by

Batch: inefficient memory

#50 opened 6 months ago by

New activity in neuralmagic/Llama-3.2-90B-Vision-Instruct-FP8-dynamic 4 months ago

Any chance your team is working on a 4-bit Llama-3.2-90B-Vision-Instruct-quantized.w4a16 version?

#1 opened 4 months ago by