Language Detection with StaticVectors

This model is an export of this FastText Language Identification model for staticvectors. staticvectors enables running inference in Python with NumPy. This helps it maintain solid runtime performance.

Language detection is an important task and identification with n-gram models is an efficient and highly accurate way to do it.

This model is a quantized version of the base language id model. It's using 2x256 Product Quantization like the original quantized model from FastText. This shrinks this model down to 4MB with only a minor hit on accuracy.

Usage with StaticVectors

from staticvectors import StaticVectors

model = StaticVectors("neuml/language-id-quantized")
model.predict(["What language is this text?"])
Downloads last month
21
Safetensors
Model size
4.09M params
Tensor type
I64
·
F32
·
U8
·
Inference Examples
Inference API (serverless) has been turned off for this model.

Model tree for NeuML/language-id-quantized

Base model

NeuML/language-id
Finetuned
(1)
this model

Collection including NeuML/language-id-quantized