basilepp19
/

bloom-1b7_it

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

basilepp19 commited on Oct 5, 2023

Commit

2a5cbf2

•

1 Parent(s): fe48a35

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -20,6 +20,8 @@ To produce a valuable model, we follow the same procedure proposed in: https://a
 We use default script parameters and select a sample of 100,000 examples in the Italian language. We decided to sample data from the Filtered Oscar Dataset for
 the Italian Language released by Sarti.
 - **Developed by:** Pierpaolo Basile, Pierluigi Cassotti, Marco Polignano, Lucia Siciliani, Giovanni Semeraro. Department of Computer Science, University of Bari Aldo Moro, Italy
 - **Model type:** BLOOM
 - **Language(s) (NLP):** Italian

 We use default script parameters and select a sample of 100,000 examples in the Italian language. We decided to sample data from the Filtered Oscar Dataset for
 the Italian Language released by Sarti.
+**It is important to underline that when you use the adapted LLM is necessary to use the tokenizer of the adapted model.**
 - **Developed by:** Pierpaolo Basile, Pierluigi Cassotti, Marco Polignano, Lucia Siciliani, Giovanni Semeraro. Department of Computer Science, University of Bari Aldo Moro, Italy
 - **Model type:** BLOOM
 - **Language(s) (NLP):** Italian