PuxAI commited on
Commit
0baf28b
1 Parent(s): 04c02e8

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +162 -0
README.md ADDED
@@ -0,0 +1,162 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ - vi
5
+ license: apache-2.0
6
+ library_name: transformers
7
+ tags:
8
+ - text-generation-inference
9
+ - transformers
10
+ - unsloth
11
+ - mistral
12
+ - trl
13
+ - mergekit
14
+ - llama-cpp
15
+ - gguf-my-repo
16
+ datasets:
17
+ - 1TuanPham/Vietnamese-magpie-ultra-v0.1
18
+ - 1TuanPham/KTO-mix-14k-vietnamese-groq
19
+ - 1TuanPham/T-VisStar-finalphase
20
+ - 1TuanPham/T-VisStar-dataset-uncensored
21
+ pipeline_tag: text-generation
22
+ base_model: 1TuanPham/T-VisStar-7B-v0.1
23
+ model-index:
24
+ - name: T-VisStar-v0.1
25
+ results:
26
+ - task:
27
+ type: text-generation
28
+ name: Text Generation
29
+ dataset:
30
+ name: IFEval (0-Shot)
31
+ type: HuggingFaceH4/ifeval
32
+ args:
33
+ num_few_shot: 0
34
+ metrics:
35
+ - type: inst_level_strict_acc and prompt_level_strict_acc
36
+ value: 36.07
37
+ name: strict accuracy
38
+ source:
39
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=1TuanPham/T-VisStar-v0.1
40
+ name: Open LLM Leaderboard
41
+ - task:
42
+ type: text-generation
43
+ name: Text Generation
44
+ dataset:
45
+ name: BBH (3-Shot)
46
+ type: BBH
47
+ args:
48
+ num_few_shot: 3
49
+ metrics:
50
+ - type: acc_norm
51
+ value: 30.24
52
+ name: normalized accuracy
53
+ source:
54
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=1TuanPham/T-VisStar-v0.1
55
+ name: Open LLM Leaderboard
56
+ - task:
57
+ type: text-generation
58
+ name: Text Generation
59
+ dataset:
60
+ name: MATH Lvl 5 (4-Shot)
61
+ type: hendrycks/competition_math
62
+ args:
63
+ num_few_shot: 4
64
+ metrics:
65
+ - type: exact_match
66
+ value: 4.53
67
+ name: exact match
68
+ source:
69
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=1TuanPham/T-VisStar-v0.1
70
+ name: Open LLM Leaderboard
71
+ - task:
72
+ type: text-generation
73
+ name: Text Generation
74
+ dataset:
75
+ name: GPQA (0-shot)
76
+ type: Idavidrein/gpqa
77
+ args:
78
+ num_few_shot: 0
79
+ metrics:
80
+ - type: acc_norm
81
+ value: 4.7
82
+ name: acc_norm
83
+ source:
84
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=1TuanPham/T-VisStar-v0.1
85
+ name: Open LLM Leaderboard
86
+ - task:
87
+ type: text-generation
88
+ name: Text Generation
89
+ dataset:
90
+ name: MuSR (0-shot)
91
+ type: TAUR-Lab/MuSR
92
+ args:
93
+ num_few_shot: 0
94
+ metrics:
95
+ - type: acc_norm
96
+ value: 13.55
97
+ name: acc_norm
98
+ source:
99
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=1TuanPham/T-VisStar-v0.1
100
+ name: Open LLM Leaderboard
101
+ - task:
102
+ type: text-generation
103
+ name: Text Generation
104
+ dataset:
105
+ name: MMLU-PRO (5-shot)
106
+ type: TIGER-Lab/MMLU-Pro
107
+ config: main
108
+ split: test
109
+ args:
110
+ num_few_shot: 5
111
+ metrics:
112
+ - type: acc
113
+ value: 24.56
114
+ name: accuracy
115
+ source:
116
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=1TuanPham/T-VisStar-v0.1
117
+ name: Open LLM Leaderboard
118
+ ---
119
+
120
+ # PuxAI/T-VisStar-7B-v0.1-Q3_K_M-GGUF
121
+ This model was converted to GGUF format from [`1TuanPham/T-VisStar-7B-v0.1`](https://huggingface.co/1TuanPham/T-VisStar-7B-v0.1) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
122
+ Refer to the [original model card](https://huggingface.co/1TuanPham/T-VisStar-7B-v0.1) for more details on the model.
123
+
124
+ ## Use with llama.cpp
125
+ Install llama.cpp through brew (works on Mac and Linux)
126
+
127
+ ```bash
128
+ brew install llama.cpp
129
+
130
+ ```
131
+ Invoke the llama.cpp server or the CLI.
132
+
133
+ ### CLI:
134
+ ```bash
135
+ llama-cli --hf-repo PuxAI/T-VisStar-7B-v0.1-Q3_K_M-GGUF --hf-file t-visstar-7b-v0.1-q3_k_m.gguf -p "The meaning to life and the universe is"
136
+ ```
137
+
138
+ ### Server:
139
+ ```bash
140
+ llama-server --hf-repo PuxAI/T-VisStar-7B-v0.1-Q3_K_M-GGUF --hf-file t-visstar-7b-v0.1-q3_k_m.gguf -c 2048
141
+ ```
142
+
143
+ Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
144
+
145
+ Step 1: Clone llama.cpp from GitHub.
146
+ ```
147
+ git clone https://github.com/ggerganov/llama.cpp
148
+ ```
149
+
150
+ Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
151
+ ```
152
+ cd llama.cpp && LLAMA_CURL=1 make
153
+ ```
154
+
155
+ Step 3: Run inference through the main binary.
156
+ ```
157
+ ./llama-cli --hf-repo PuxAI/T-VisStar-7B-v0.1-Q3_K_M-GGUF --hf-file t-visstar-7b-v0.1-q3_k_m.gguf -p "The meaning to life and the universe is"
158
+ ```
159
+ or
160
+ ```
161
+ ./llama-server --hf-repo PuxAI/T-VisStar-7B-v0.1-Q3_K_M-GGUF --hf-file t-visstar-7b-v0.1-q3_k_m.gguf -c 2048
162
+ ```