Update README.md
Browse files
README.md
CHANGED
@@ -101,4 +101,11 @@ This is `BEE-spoke-data/smol_llama-220M-GQA` fine-tuned for code generation on:
|
|
101 |
|
102 |
- filtered version of stack-smol-XL
|
103 |
- deduped version of 'algebraic stack' from proof-pile-2
|
104 |
-
- cleaned and deduped pypi (last dataset)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
101 |
|
102 |
- filtered version of stack-smol-XL
|
103 |
- deduped version of 'algebraic stack' from proof-pile-2
|
104 |
+
- cleaned and deduped pypi (last dataset)
|
105 |
+
|
106 |
+
This model (and the base model) were both trained using ctx length 2048.
|
107 |
+
|
108 |
+
Example script for inference testing: [here](https://gist.github.com/pszemraj/c7738f664a64b935a558974d23a7aa8c)
|
109 |
+
|
110 |
+
|
111 |
+
---
|