numind
/

NuExtract-tiny

Text Generation

text-generation-inference

Model card Files Files and versions Community

Alexandre-Numind commited on Jun 24, 2024

Commit

9e926fd

·

verified ·

1 Parent(s): cbb8243

Update README.md

Files changed (1) hide show

README.md +28 -7

README.md CHANGED Viewed

@@ -3,13 +3,34 @@ license: mit
 language:
 - en
 widget:
-- text: '''<|input|>
-sds ### Template\n:
-sd
-'''
 ---
 # Structure Extraction Model by NuMind 🔥

 language:
 - en
 widget:
+- text: '<|input|>
+### Template:
+{
+    "Model": {
+        "Name": "",
+        "Number of parameters": "",
+        "Number of max token": "",
+        "Architecture": []
+    },
+    "Usage": {
+        "Use case": [],
+        "Licence": ""
+    }
+}
+### Text:
+We introduce Mistral 7B, a 7–billion-parameter language model engineered for
+superior performance and efficiency. Mistral 7B outperforms the best open 13B
+model (Llama 2) across all evaluated benchmarks, and the best released 34B
+model (Llama 1) in reasoning, mathematics, and code generation. Our model
+leverages grouped-query attention (GQA) for faster inference, coupled with sliding
+window attention (SWA) to effectively handle sequences of arbitrary length with a
+reduced inference cost. We also provide a model fine-tuned to follow instructions,
+Mistral 7B – Instruct, that surpasses Llama 2 13B – chat model both on human and
+automated benchmarks. Our models are released under the Apache 2.0 license.
+Code https://github.com/mistralai/mistral-src
+Webpage https://mistral.ai/news/announcing-mistral-7b/
+<|output|>
+'
 ---
 # Structure Extraction Model by NuMind 🔥