Exllama v2 Quantizations of Tess-v2.5.2-Qwen2-72B
Using turboderp's ExLlamaV2 v0.0.21 for quantization.
Original model: https://huggingface.co/migtissera/Tess-v2.5.2-Qwen2-72B
- Downloads last month
- 17
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.