Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
numen-tech
/
Qwen2.5-3B-Instruct-GPTQ-Int4
like
0
Text Generation
MLC-LLM
English
chat
conversational
4-bit precision
arxiv:
2210.17323
License:
qwen-research
Model card
Files
Files and versions
Community
Use this model
4-bit
GPTQ
quantized version of
Qwen2.5-3B-Instruct
for use with the
Private LLM app
.
Downloads last month
0
Inference Examples
Text Generation
Inference API (serverless) does not yet support mlc-llm models for this pipeline type.
Model tree for
numen-tech/Qwen2.5-3B-Instruct-GPTQ-Int4
Base model
Qwen/Qwen2.5-3B
Finetuned
Qwen/Qwen2.5-3B-Instruct
Quantized
(
79
)
this model
Collection including
numen-tech/Qwen2.5-3B-Instruct-GPTQ-Int4
Qwen2.5
Collection
6 items
•
Updated
22 days ago