arcee-ai
/

Llama-3-SEC-Chat

Text Generation

large_language_model

continual_pre_training

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Crystalcareai commited on Jun 18

Commit

8787869

•

1 Parent(s): fe3e9b4

Update README.md

Files changed (1) hide show

README.md +32 -0

README.md CHANGED Viewed

@@ -16,6 +16,8 @@ datasets:
 Llama-3-SEC is a state-of-the-art domain-specific large language model trained on a vast corpus of SEC (Securities and Exchange Commission) data. Built upon the powerful Meta-Llama-3-70B-Instruct model, Llama-3-SEC has been developed to provide unparalleled insights and analysis capabilities for financial professionals, investors, researchers, and anyone working with SEC filings and related financial data.
 ## Model Details
 - **Base Model:** Meta-Llama-3-70B-Instruct
@@ -94,6 +96,36 @@ generated_ids = [
 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ```
 ## Limitations and Future Work

 Llama-3-SEC is a state-of-the-art domain-specific large language model trained on a vast corpus of SEC (Securities and Exchange Commission) data. Built upon the powerful Meta-Llama-3-70B-Instruct model, Llama-3-SEC has been developed to provide unparalleled insights and analysis capabilities for financial professionals, investors, researchers, and anyone working with SEC filings and related financial data.
+GGUFS: https://huggingface.co/arcee-ai/Llama-3-SEC-Chat-GGUF
 ## Model Details
 - **Base Model:** Meta-Llama-3-70B-Instruct
 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ```
+## Mergekit Yaml
+```yaml
+merge-config}]
+merge_method: ties
+base_model: meta-llama/Meta-Llama-3-70B
+models:
+  - model: /home/ubuntu/data/cpt
+    parameters:
+      weight:
+        - filter: mlp
+          value: [0.25, 0.5, 0.5, 0.25]
+        - filter: self_attn
+          value: [0.25, 0.5, 0.5, 0]
+        - value: [0.25, 0.5, 0.5, 0.25]
+      density: 0.75
+  - model: meta-llama/Meta-Llama-3-70B-Instruct
+    parameters:
+      weight:
+        - filter: mlp
+          value: [0.75, 0.5, 0.5, 0.75]
+        - filter: self_attn
+          value: [0.75, 0.5, 0.5, 1]
+        - value: [0.75, 0.5, 0.5, 0.75]
+      density: 1.0
+parameters:
+  normalize: true
+  int8_mask: true
+dtype: bfloat16
+```
 ## Limitations and Future Work