Update README.md
Browse files
README.md
CHANGED
@@ -8,53 +8,49 @@ tags:
|
|
8 |
---
|
9 |
# Model Catalogue
|
10 |
|
11 |
-
|
12 |
-
- Contains many boutique AI models
|
13 |
-
- Still a work in progress
|
14 |
|
15 |
-
## Pretrained
|
|
|
|
|
|
|
16 |
|
17 |
-
English models were finetuned on a subset of [Zyphra/Zyda-2](https://huggingface.co/datasets/Zyphra/Zyda-2).
|
18 |
- [snowflake-arctic-embed-xs-zyda-2](https://huggingface.co/agentlans/snowflake-arctic-embed-xs-zyda-2)
|
19 |
- [deberta-v3-xsmall-zyda-2](https://huggingface.co/agentlans/deberta-v3-xsmall-zyda-2)
|
20 |
- [deberta-v3-base-zyda-2](https://huggingface.co/agentlans/deberta-v3-base-zyda-2)
|
21 |
|
22 |
-
Multilingual
|
|
|
|
|
23 |
- [multilingual-e5-small-aligned](https://huggingface.co/agentlans/multilingual-e5-small-aligned)
|
24 |
- [distilbert-base-multilingual-cased-aligned](https://huggingface.co/agentlans/distilbert-base-multilingual-cased-aligned)
|
25 |
|
26 |
-
## Text
|
27 |
|
28 |
-
|
29 |
-
- **Output:** number
|
30 |
|
31 |
-
|
|
32 |
-
|
33 |
-
| deberta-v3-xsmall-zyda-2
|
34 |
-
| deberta-v3-base-zyda-2
|
35 |
| multilingual-e5-small-aligned | Multilingual | [Link](https://huggingface.co/agentlans/multilingual-e5-small-aligned-quality) | [Link](https://huggingface.co/agentlans/multilingual-e5-small-aligned-readability) | [Link](https://huggingface.co/agentlans/multilingual-e5-small-aligned-sentiment) |
|
36 |
-
| mdeberta-v3-base
|
37 |
|
38 |
-
Note
|
39 |
|
40 |
-
## Small
|
41 |
|
42 |
-
|
43 |
-
- **Output:** text
|
44 |
|
45 |
-
|
46 |
-
|
47 |
-
| **Task** | **Model** | **Dataset** |
|
48 |
-
|:------------------:|:---------------------------------------------------------------------------------:|:------------------------------------------------------------------------------------------------------:|
|
49 |
| Keyword extraction | [flan-t5-small-keywords](https://huggingface.co/agentlans/flan-t5-small-keywords) | [wikipedia-paragraph-keywords](https://huggingface.co/datasets/agentlans/wikipedia-paragraph-keywords) |
|
50 |
-
| Title generation
|
51 |
-
|
52 |
-
## Natural language inference (NLI) models
|
53 |
|
54 |
-
|
55 |
-
- **Output:** label (entailment, neutral, or contradiction)
|
56 |
|
57 |
-
These
|
58 |
|
59 |
- [all-MiniLM-L6-v2-nli](https://huggingface.co/agentlans/all-MiniLM-L6-v2-nli)
|
60 |
- [bge-small-en-v1.5-nli](https://huggingface.co/agentlans/bge-small-en-v1.5-nli)
|
@@ -63,4 +59,4 @@ These are English only.
|
|
63 |
- [NoInstruct-small-Embedding-v0-nli](https://huggingface.co/agentlans/NoInstruct-small-Embedding-v0-nli)
|
64 |
- [snowflake-arctic-embed-s-nli](https://huggingface.co/agentlans/snowflake-arctic-embed-s-nli)
|
65 |
- [snowflake-arctic-embed-xs-nli](https://huggingface.co/agentlans/snowflake-arctic-embed-xs-nli)
|
66 |
-
- [TinyBERT_General_4L_312D-nli](https://huggingface.co/agentlans/TinyBERT_General_4L_312D-nli)
|
|
|
8 |
---
|
9 |
# Model Catalogue
|
10 |
|
11 |
+
[This repository](https://huggingface.co/agentlans) contains a collection of boutique AI models and is organized as follows:
|
|
|
|
|
12 |
|
13 |
+
## Pretrained Base Models for Text Embedding
|
14 |
+
|
15 |
+
### English Models
|
16 |
+
These models were finetuned on a subset of [Zyphra/Zyda-2](https://huggingface.co/datasets/Zyphra/Zyda-2):
|
17 |
|
|
|
18 |
- [snowflake-arctic-embed-xs-zyda-2](https://huggingface.co/agentlans/snowflake-arctic-embed-xs-zyda-2)
|
19 |
- [deberta-v3-xsmall-zyda-2](https://huggingface.co/agentlans/deberta-v3-xsmall-zyda-2)
|
20 |
- [deberta-v3-base-zyda-2](https://huggingface.co/agentlans/deberta-v3-base-zyda-2)
|
21 |
|
22 |
+
### Multilingual Models
|
23 |
+
These models were aligned using [agentlans/en-translations](https://huggingface.co/datasets/agentlans/en-translations):
|
24 |
+
|
25 |
- [multilingual-e5-small-aligned](https://huggingface.co/agentlans/multilingual-e5-small-aligned)
|
26 |
- [distilbert-base-multilingual-cased-aligned](https://huggingface.co/agentlans/distilbert-base-multilingual-cased-aligned)
|
27 |
|
28 |
+
## Text Statistics Models
|
29 |
|
30 |
+
These models take text as input and output a number.
|
|
|
31 |
|
32 |
+
| Base Model | Language | Quality | Readability | Sentiment |
|
33 |
+
|------------|----------|---------|-------------|-----------|
|
34 |
+
| deberta-v3-xsmall-zyda-2 | English | [Link](https://huggingface.co/agentlans/deberta-v3-xsmall-zyda-2-quality) | [Link](https://huggingface.co/agentlans/deberta-v3-xsmall-zyda-2-readability) | [Link](https://huggingface.co/agentlans/deberta-v3-xsmall-zyda-2-sentiment) |
|
35 |
+
| deberta-v3-base-zyda-2 | English | [Link](https://huggingface.co/agentlans/deberta-v3-base-zyda-2-quality) | [Link](https://huggingface.co/agentlans/deberta-v3-base-zyda-2-readability) | [Link](https://huggingface.co/agentlans/deberta-v3-base-zyda-2-sentiment) |
|
36 |
| multilingual-e5-small-aligned | Multilingual | [Link](https://huggingface.co/agentlans/multilingual-e5-small-aligned-quality) | [Link](https://huggingface.co/agentlans/multilingual-e5-small-aligned-readability) | [Link](https://huggingface.co/agentlans/multilingual-e5-small-aligned-sentiment) |
|
37 |
+
| mdeberta-v3-base | Multilingual | [Link](https://huggingface.co/agentlans/mdeberta-v3-base-quality) | [Link](https://huggingface.co/agentlans/mdeberta-v3-base-readability) | [Link](https://huggingface.co/agentlans/mdeberta-v3-base-sentiment) |
|
38 |
|
39 |
+
**Note:** The `mdeberta-v3-base` models were trained on a previous version of the dataset, not the complete dataset.
|
40 |
|
41 |
+
## Small Text-to-Text Models (English Only)
|
42 |
|
43 |
+
These models take text as input and produce text as output.
|
|
|
44 |
|
45 |
+
| Task | Model | Dataset |
|
46 |
+
|------|-------|---------|
|
|
|
|
|
47 |
| Keyword extraction | [flan-t5-small-keywords](https://huggingface.co/agentlans/flan-t5-small-keywords) | [wikipedia-paragraph-keywords](https://huggingface.co/datasets/agentlans/wikipedia-paragraph-keywords) |
|
48 |
+
| Title generation | [flan-t5-small-title](https://huggingface.co/agentlans/flan-t5-small-title) | [wikipedia-paragraph-titles](https://huggingface.co/datasets/agentlans/wikipedia-paragraph-titles) |
|
49 |
+
| Summarization | [text-summarization](https://huggingface.co/agentlans/text-summarization) | [wikipedia-paragraph-summaries](https://huggingface.co/datasets/agentlans/wikipedia-paragraph-summaries) |
|
|
|
50 |
|
51 |
+
## Natural Language Inference (NLI) Models (English Only)
|
|
|
52 |
|
53 |
+
These models take text as input and output a label (entailment, neutral, or contradiction).
|
54 |
|
55 |
- [all-MiniLM-L6-v2-nli](https://huggingface.co/agentlans/all-MiniLM-L6-v2-nli)
|
56 |
- [bge-small-en-v1.5-nli](https://huggingface.co/agentlans/bge-small-en-v1.5-nli)
|
|
|
59 |
- [NoInstruct-small-Embedding-v0-nli](https://huggingface.co/agentlans/NoInstruct-small-Embedding-v0-nli)
|
60 |
- [snowflake-arctic-embed-s-nli](https://huggingface.co/agentlans/snowflake-arctic-embed-s-nli)
|
61 |
- [snowflake-arctic-embed-xs-nli](https://huggingface.co/agentlans/snowflake-arctic-embed-xs-nli)
|
62 |
+
- [TinyBERT_General_4L_312D-nli](https://huggingface.co/agentlans/TinyBERT_General_4L_312D-nli)
|