AncientCatz commited on
Commit
bdbb4c6
1 Parent(s): 25a94c4

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +95 -0
README.md ADDED
@@ -0,0 +1,95 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ - fr
5
+ - ro
6
+ - de
7
+ - multilingual
8
+ tags:
9
+ - text2text-generation
10
+ - llama-cpp
11
+ - gguf-my-repo
12
+ widget:
13
+ - text: 'Translate to German: My name is Arthur'
14
+ example_title: Translation
15
+ - text: Please answer to the following question. Who is going to be the next Ballon
16
+ d'or?
17
+ example_title: Question Answering
18
+ - text: 'Q: Can Geoffrey Hinton have a conversation with George Washington? Give the
19
+ rationale before answering.'
20
+ example_title: Logical reasoning
21
+ - text: Please answer the following question. What is the boiling point of Nitrogen?
22
+ example_title: Scientific knowledge
23
+ - text: Answer the following yes/no question. Can you write a whole Haiku in a single
24
+ tweet?
25
+ example_title: Yes/no question
26
+ - text: Answer the following yes/no question by reasoning step-by-step. Can you write
27
+ a whole Haiku in a single tweet?
28
+ example_title: Reasoning task
29
+ - text: 'Q: ( False or not False or False ) is? A: Let''s think step by step'
30
+ example_title: Boolean Expressions
31
+ - text: The square root of x is the cube root of y. What is y to the power of 2, if
32
+ x = 4?
33
+ example_title: Math reasoning
34
+ - text: 'Premise: At my age you will probably have learnt one lesson. Hypothesis: It''s
35
+ not certain how many lessons you''ll learn by your thirties. Does the premise
36
+ entail the hypothesis?'
37
+ example_title: Premise and hypothesis
38
+ datasets:
39
+ - svakulenk0/qrecc
40
+ - taskmaster2
41
+ - djaym7/wiki_dialog
42
+ - deepmind/code_contests
43
+ - lambada
44
+ - gsm8k
45
+ - aqua_rat
46
+ - esnli
47
+ - quasc
48
+ - qed
49
+ license: apache-2.0
50
+ base_model: google/flan-t5-small
51
+ ---
52
+
53
+ # AncientCatz/flan-t5-small-Q2_K-GGUF
54
+ This model was converted to GGUF format from [`google/flan-t5-small`](https://huggingface.co/google/flan-t5-small) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
55
+ Refer to the [original model card](https://huggingface.co/google/flan-t5-small) for more details on the model.
56
+
57
+ ## Use with llama.cpp
58
+ Install llama.cpp through brew (works on Mac and Linux)
59
+
60
+ ```bash
61
+ brew install llama.cpp
62
+
63
+ ```
64
+ Invoke the llama.cpp server or the CLI.
65
+
66
+ ### CLI:
67
+ ```bash
68
+ llama-cli --hf-repo AncientCatz/flan-t5-small-Q2_K-GGUF --hf-file flan-t5-small-q2_k.gguf -p "The meaning to life and the universe is"
69
+ ```
70
+
71
+ ### Server:
72
+ ```bash
73
+ llama-server --hf-repo AncientCatz/flan-t5-small-Q2_K-GGUF --hf-file flan-t5-small-q2_k.gguf -c 2048
74
+ ```
75
+
76
+ Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
77
+
78
+ Step 1: Clone llama.cpp from GitHub.
79
+ ```
80
+ git clone https://github.com/ggerganov/llama.cpp
81
+ ```
82
+
83
+ Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
84
+ ```
85
+ cd llama.cpp && LLAMA_CURL=1 make
86
+ ```
87
+
88
+ Step 3: Run inference through the main binary.
89
+ ```
90
+ ./llama-cli --hf-repo AncientCatz/flan-t5-small-Q2_K-GGUF --hf-file flan-t5-small-q2_k.gguf -p "The meaning to life and the universe is"
91
+ ```
92
+ or
93
+ ```
94
+ ./llama-server --hf-repo AncientCatz/flan-t5-small-Q2_K-GGUF --hf-file flan-t5-small-q2_k.gguf -c 2048
95
+ ```