INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Paper
•
2411.19799
•
Published
•
10
Knowledge Distillation, Pruning, Quantization, KV Cache Compression, Latency, Inference Speed