Edit model card

EXL2 Quantization of Undi95's's MM-ReMM-L2-20B.

Model details

Quantized at 3.18bpw with hb 6.

Perplexity:

Base = 6.9504

3.18 h6 = 7.0138

Dataset = wikitext

Prompt Format

Below is an instruction that describes a task. Write a response that appropriately completes the request.

### Instruction:
{prompt}

### Response:
Downloads last month
9
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Collection including R136a1/MM-ReMM-L2-20B-exl2