Text Generation
Transformers
llm-rs
ggml
Inference Endpoints
LLukas22 commited on
Commit
a35052c
1 Parent(s): ead726c

Generated README.md

Browse files
Files changed (1) hide show
  1. README.md +93 -1
README.md CHANGED
@@ -1,4 +1,66 @@
1
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  tags:
3
  - llm-rs
4
  - ggml
@@ -28,4 +90,34 @@ BLOOM is an autoregressive Large Language Model (LLM), trained to continue text
28
  | [bloom-560m-q4_0-ggjt.bin](https://huggingface.co/rustformers/bloom-ggml/blob/main/bloom-560m-q4_0-ggjt.bin) | [bigscience/bloom-560m](https://huggingface.co/bigscience/bloom-560m) | Q4_0 | GGJT | V3 |
29
  | [bloom-560m-q5_1-ggjt.bin](https://huggingface.co/rustformers/bloom-ggml/blob/main/bloom-560m-q5_1-ggjt.bin) | [bigscience/bloom-560m](https://huggingface.co/bigscience/bloom-560m) | Q5_1 | GGJT | V3 |
30
 
31
- ## Usage
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: bigscience-bloom-rail-1.0
3
+ language:
4
+ - ak
5
+ - ar
6
+ - as
7
+ - bm
8
+ - bn
9
+ - ca
10
+ - code
11
+ - en
12
+ - es
13
+ - eu
14
+ - fon
15
+ - fr
16
+ - gu
17
+ - hi
18
+ - id
19
+ - ig
20
+ - ki
21
+ - kn
22
+ - lg
23
+ - ln
24
+ - ml
25
+ - mr
26
+ - ne
27
+ - nso
28
+ - ny
29
+ - or
30
+ - pa
31
+ - pt
32
+ - rn
33
+ - rw
34
+ - sn
35
+ - st
36
+ - sw
37
+ - ta
38
+ - te
39
+ - tn
40
+ - ts
41
+ - tum
42
+ - tw
43
+ - ur
44
+ - vi
45
+ - wo
46
+ - xh
47
+ - yo
48
+ - zh
49
+ - zu
50
+ programming_language:
51
+ - C
52
+ - C++
53
+ - C#
54
+ - Go
55
+ - Java
56
+ - JavaScript
57
+ - Lua
58
+ - PHP
59
+ - Python
60
+ - Ruby
61
+ - Rust
62
+ - Scala
63
+ - TypeScript
64
  tags:
65
  - llm-rs
66
  - ggml
 
90
  | [bloom-560m-q4_0-ggjt.bin](https://huggingface.co/rustformers/bloom-ggml/blob/main/bloom-560m-q4_0-ggjt.bin) | [bigscience/bloom-560m](https://huggingface.co/bigscience/bloom-560m) | Q4_0 | GGJT | V3 |
91
  | [bloom-560m-q5_1-ggjt.bin](https://huggingface.co/rustformers/bloom-ggml/blob/main/bloom-560m-q5_1-ggjt.bin) | [bigscience/bloom-560m](https://huggingface.co/bigscience/bloom-560m) | Q5_1 | GGJT | V3 |
92
 
93
+ ## Usage
94
+
95
+ ### Python via [llm-rs](https://github.com/LLukas22/llm-rs-python):
96
+
97
+ #### Installation
98
+ Via pip: `pip install llm-rs`
99
+
100
+ #### Run inference
101
+ ```python
102
+ from llm_rs import AutoModel
103
+
104
+ #Load the model, define any model you like from the list above as the `model_file`
105
+ model = AutoModel.from_pretrained("rustformers/bloom-ggml",model_file="bloom-3b-q4_0-ggjt.bin")
106
+
107
+ #Generate
108
+ print(model.generate("The meaning of life is"))
109
+ ```
110
+
111
+ ### Rust via [Rustformers/llm](https://github.com/rustformers/llm):
112
+
113
+ #### Installation
114
+ ```
115
+ git clone --recurse-submodules https://github.com/rustformers/llm.git
116
+ cd llm
117
+ cargo build --release
118
+ ```
119
+
120
+ #### Run inference
121
+ ```
122
+ cargo run --release -- bloom infer -m path/to/model.bin -p "Tell me how cool the Rust programming language is:"
123
+ ```