nicholasKluge
commited on
Commit
•
ed201d0
1
Parent(s):
e32c0f9
Update README.md
Browse files
README.md
CHANGED
@@ -16,11 +16,11 @@ widget:
|
|
16 |
- text: "<s>Uma mulher está misturando ovos.<s>A mulher está bebendo.</s>"
|
17 |
example_title: Exemplo
|
18 |
---
|
19 |
-
# TeenyTinyLlama-
|
20 |
|
21 |
TeenyTinyLlama is a series of small foundational models trained in Brazilian Portuguese.
|
22 |
|
23 |
-
This repository contains a version of [TeenyTinyLlama-
|
24 |
|
25 |
## Details
|
26 |
|
@@ -38,7 +38,7 @@ from transformers import pipeline
|
|
38 |
|
39 |
text = "<s>Qual a capital do Brasil?<s>A capital do Brasil é Brasília!</s>"
|
40 |
|
41 |
-
classifier = pipeline("text-classification", model="nicholasKluge/TeenyTinyLlama-
|
42 |
classifier(text)
|
43 |
|
44 |
# >>> [{'label': 'ENTAILED', 'score': 0.9392824769020081}]
|
@@ -63,13 +63,13 @@ dataset = load_dataset("assin2")
|
|
63 |
|
64 |
# Create a `ModelForSequenceClassification`
|
65 |
model = AutoModelForSequenceClassification.from_pretrained(
|
66 |
-
"nicholasKluge/TeenyTinyLlama-
|
67 |
num_labels=2,
|
68 |
id2label={0: "UNENTAILED", 1: "ENTAILED"},
|
69 |
label2id={"UNENTAILED": 0, "ENTAILED": 1}
|
70 |
)
|
71 |
|
72 |
-
tokenizer = AutoTokenizer.from_pretrained("nicholasKluge/TeenyTinyLlama-
|
73 |
|
74 |
# Format the dataset
|
75 |
train = dataset['train'].to_pandas()
|
@@ -146,7 +146,7 @@ trainer.train()
|
|
146 |
|
147 |
| Models | [Assin2](https://huggingface.co/datasets/assin2)|
|
148 |
|--------------------------------------------------------------------------------------------|-------------------------------------------------|
|
149 |
-
| [Teeny Tiny Llama
|
150 |
| [Bert-base-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased) | 87.45 |
|
151 |
| [Bert-large-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased)| 88.97 |
|
152 |
| [Gpt2-small-portuguese](https://huggingface.co/pierreguillou/gpt2-small-portuguese) | 86.11 |
|
@@ -157,7 +157,7 @@ trainer.train()
|
|
157 |
|
158 |
@misc{nicholas22llama,
|
159 |
doi = {10.5281/zenodo.6989727},
|
160 |
-
url = {https://huggingface.co/nicholasKluge/TeenyTinyLlama-
|
161 |
author = {Nicholas Kluge Corrêa},
|
162 |
title = {TeenyTinyLlama},
|
163 |
year = {2023},
|
@@ -173,4 +173,4 @@ This repository was built as part of the RAIES ([Rede de Inteligência Artificia
|
|
173 |
|
174 |
## License
|
175 |
|
176 |
-
TeenyTinyLlama-
|
|
|
16 |
- text: "<s>Uma mulher está misturando ovos.<s>A mulher está bebendo.</s>"
|
17 |
example_title: Exemplo
|
18 |
---
|
19 |
+
# TeenyTinyLlama-160m-Assin2
|
20 |
|
21 |
TeenyTinyLlama is a series of small foundational models trained in Brazilian Portuguese.
|
22 |
|
23 |
+
This repository contains a version of [TeenyTinyLlama-160m](https://huggingface.co/nicholasKluge/TeenyTinyLlama-160m) (`TeenyTinyLlama-160m-Assin2`) fine-tuned on the [Assin2](https://huggingface.co/datasets/assin2).
|
24 |
|
25 |
## Details
|
26 |
|
|
|
38 |
|
39 |
text = "<s>Qual a capital do Brasil?<s>A capital do Brasil é Brasília!</s>"
|
40 |
|
41 |
+
classifier = pipeline("text-classification", model="nicholasKluge/TeenyTinyLlama-160m-Assin2")
|
42 |
classifier(text)
|
43 |
|
44 |
# >>> [{'label': 'ENTAILED', 'score': 0.9392824769020081}]
|
|
|
63 |
|
64 |
# Create a `ModelForSequenceClassification`
|
65 |
model = AutoModelForSequenceClassification.from_pretrained(
|
66 |
+
"nicholasKluge/TeenyTinyLlama-160m",
|
67 |
num_labels=2,
|
68 |
id2label={0: "UNENTAILED", 1: "ENTAILED"},
|
69 |
label2id={"UNENTAILED": 0, "ENTAILED": 1}
|
70 |
)
|
71 |
|
72 |
+
tokenizer = AutoTokenizer.from_pretrained("nicholasKluge/TeenyTinyLlama-160m")
|
73 |
|
74 |
# Format the dataset
|
75 |
train = dataset['train'].to_pandas()
|
|
|
146 |
|
147 |
| Models | [Assin2](https://huggingface.co/datasets/assin2)|
|
148 |
|--------------------------------------------------------------------------------------------|-------------------------------------------------|
|
149 |
+
| [Teeny Tiny Llama 160m](https://huggingface.co/nicholasKluge/TeenyTinyLlama-160m) | 85.78 |
|
150 |
| [Bert-base-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased) | 87.45 |
|
151 |
| [Bert-large-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased)| 88.97 |
|
152 |
| [Gpt2-small-portuguese](https://huggingface.co/pierreguillou/gpt2-small-portuguese) | 86.11 |
|
|
|
157 |
|
158 |
@misc{nicholas22llama,
|
159 |
doi = {10.5281/zenodo.6989727},
|
160 |
+
url = {https://huggingface.co/nicholasKluge/TeenyTinyLlama-160m},
|
161 |
author = {Nicholas Kluge Corrêa},
|
162 |
title = {TeenyTinyLlama},
|
163 |
year = {2023},
|
|
|
173 |
|
174 |
## License
|
175 |
|
176 |
+
TeenyTinyLlama-160m-Assin2 is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.
|