BSC-LT
/

salamandra-2b-base-gptq

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

🇪🇺 Region: EU

Model card Files Files and versions Community

ferran-espuna commited on 9 days ago

Commit

70e5ace

•

1 Parent(s): f1c1dc8

Update README.md

Files changed (1) hide show

README.md +18 -1

README.md CHANGED Viewed

@@ -61,8 +61,25 @@ This model card corresponds to the gptq-quantized version of Salamandra-2b for s
 The entire Salamandra family is released under a permissive [Apache 2.0 license]((https://www.apache.org/licenses/LICENSE-2.0)).
-## Additional information
 ### Author
 International Business Machines (IBM).

 The entire Salamandra family is released under a permissive [Apache 2.0 license]((https://www.apache.org/licenses/LICENSE-2.0)).
+## How to Use
+The following example code works under ``Python 3.9.16``, ``vllm==0.6.3.post1``, ``torch==2.4.0`` and ``torchvision==0.19.0``, though it should run on
+any current version of the libraries. This is an example of how to create a text completion using the model:
+```
+from vllm import LLM, SamplingParams
+model_name = "BSC-LT/salamandra-2b-base-fp8"
+llm = LLM(model=model_name)
+outputs = llm.generate("El mercat del barri ",
+                       sampling_params=SamplingParams(
+                           temperature=0.5,
+                           max_tokens=200)
+                       )
+print(outputs[0].outputs[0].text)
+```
 ### Author
 International Business Machines (IBM).