andrijdavid
/

Solidity-Llama3-8b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

andrijdavid commited on Jun 18

Commit

81179f4

•

1 Parent(s): ecc8bf9

Update README.md

Files changed (1) hide show

README.md +21 -3

README.md CHANGED Viewed

@@ -29,12 +29,14 @@ Solidity Llama 3 is a Large Language Model specifically designed for Solidity co
 ### Direct Use
 Solidity Llama 3 can be used for code completion and infilling tasks within Solidity code editors. It was trained for this task using the fill-in-the-middle (FIM) objective, where you provide a prefix and a suffix as context for the completion. The following tokens are used to separate the different parts of the input:
-    <|reserved_special_token_11|> precedes the context before the completion we want to run.
-    <|reserved_special_token_10|> precedes the suffix. You must put this token exactly where the cursor would be positioned in an editor, as this is the location that will be completed by the model.
-    <|reserved_special_token_12|> is the prompt that invites the model to run the generation.
 ```python
 FIM_SUFFIX = "<|reserved_special_token_10|>"
 FIM_PREFIX = "<|reserved_special_token_11|>"
 FIM_MIDDLE = "<|reserved_special_token_12|>"
@@ -67,6 +69,22 @@ print(tokenizer.decode(outputs[0][prompt_len:]))
 ```
 ### Out-of-Scope Use
 The model may not perform well for tasks outside of Solidity code completion and infilling, and users should be aware of its limitations in these areas.

 ### Direct Use
 Solidity Llama 3 can be used for code completion and infilling tasks within Solidity code editors. It was trained for this task using the fill-in-the-middle (FIM) objective, where you provide a prefix and a suffix as context for the completion. The following tokens are used to separate the different parts of the input:
+- <|reserved_special_token_11|> precedes the context before the completion we want to run.
+- <|reserved_special_token_10|> precedes the suffix. You must put this token exactly where the cursor would be positioned in an editor, as this is the location that will be completed by the model.
+- <|reserved_special_token_12|> is the prompt that invites the model to run the generation.
 ```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
 FIM_SUFFIX = "<|reserved_special_token_10|>"
 FIM_PREFIX = "<|reserved_special_token_11|>"
 FIM_MIDDLE = "<|reserved_special_token_12|>"
 ```
+You can provide a list of terminators to the generate function, like this:
+```python
+terminators = tokenizer.convert_tokens_to_ids([FIM_PREFIX, FIM_MIDDLE, FIM_SUFFIX])
+terminators += [tokenizer.eos_token_id]
+outputs = model.generate(
+  **inputs,
+  max_new_tokens=1024,
+  eos_token_id=terminators,
+)
+print(tokenizer.decode(outputs[0][prompt_len:]))
+```
 ### Out-of-Scope Use
 The model may not perform well for tasks outside of Solidity code completion and infilling, and users should be aware of its limitations in these areas.