GGUF
Inference Endpoints
jan-hq commited on
Commit
89c334b
·
verified ·
1 Parent(s): 526eade

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -0
README.md ADDED
@@ -0,0 +1,34 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+
5
+ ## Overview
6
+
7
+ The Mixtral-8x7B Large Language Model (LLM) is a pretrained generative Sparse Mixture of Experts. The Mistral-8x7B outperforms Llama 2 70B on most benchmarks we tested.
8
+
9
+ ## Variants
10
+
11
+ | No | Variant | Cortex CLI command |
12
+ | --- | --- | --- |
13
+ | 1 | [8x7B-gguf](https://huggingface.co/cortexhub/llama3/tree/8B-gguf) | `cortex run mixtral:8x7B-gguf` |
14
+
15
+ ## Use it with Jan (UI)
16
+
17
+ 1. Install **Jan** using [Quickstart](https://jan.ai/docs/quickstart)
18
+ 2. Use in Jan model Hub:
19
+ ```
20
+ cortexhub/llama3
21
+ ```
22
+
23
+ ## Use it with Cortex (CLI)
24
+
25
+ 1. Install **Cortex** using [Quickstart](https://cortex.jan.ai/docs/quickstart)
26
+ 2. Run the model with command:
27
+ ```
28
+ cortex run mixtral
29
+ ```
30
+
31
+ ## Credits
32
+
33
+ - **Author:** Mistralai
34
+ - **Converter:** [Homebrew](https://www.homebrew.ltd/)