Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,12 @@
|
|
1 |
-
---
|
2 |
-
license: mit
|
3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: mit
|
3 |
+
---
|
4 |
+
|
5 |
+
# akhooli/arabic-colbertv2-711k-norm
|
6 |
+
This is a ColBERT V2 model trained on Arabic mMARCO dataset after removing queries with Latin words (711K queries).
|
7 |
+
It is not fully trained, but is good for many tasks especially ranking.
|
8 |
+
The dataset was normalized before training, so please normalize your query and docs before using it.
|
9 |
+
```python
|
10 |
+
from unicodedata import normalize
|
11 |
+
query_n = normalize('NFKC', query)
|
12 |
+
```
|