File size: 1,115 Bytes
4cf0f1e
 
 
 
 
 
 
 
 
 
 
 
 
 
1c2a8f1
4cf0f1e
 
 
 
 
 
 
 
 
 
568531a
4cf0f1e
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
---
tags:
  - text-classification
  - language-identification
inference: false
license: cc-by-sa-3.0
language: multilingual
library_name: staticvectors
base_model:
  - NeuML/language-id
---

# Language Detection with StaticVectors

This model is an export of this [FastText Language Identification model](https://fasttext.cc/docs/en/language-identification.html) for [`staticvectors`](https://github.com/neuml/staticvectors). `staticvectors` enables running inference in Python with NumPy. This helps it maintain solid runtime performance.

Language detection is an important task and identification with n-gram models is an efficient and highly accurate way to do it.

_This model is a quantized version of the [base language id model](https://hf.co/neuml/language-id). It's using 2x256 Product Quantization like the original quantized model from FastText. This shrinks this model down to 4MB with only a minor hit on accuracy._

## Usage with StaticVectors

```python
from staticvectors import StaticVectors

model = StaticVectors("neuml/language-id-quantized")
model.predict(["What language is this text?"])
```