macadeliccc
/

magistrate-3.2-3b-base

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

macadeliccc commited on Oct 1, 2024

Commit

d0926a1

·

verified ·

1 Parent(s): 2a40ac9

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ pipeline_tag: text-generation
 ---
 # Magistrate 3.2 3B
-Continued pretraining applied to  [meta-llama/Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) using no synthetic data.  ~250M tokens.
 The model achieves the following results on the evaluation set:
 - Loss: 0.6802
@@ -210,7 +210,9 @@ More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure

 ---
 # Magistrate 3.2 3B
+Continued pretraining applied to  [meta-llama/Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) using no synthetic legal data.  ~250M tokens.
 The model achieves the following results on the evaluation set:
 - Loss: 0.6802
 ## Training and evaluation data
+The training data consists of US Supreme Court verdicts, federal regulations, laws and treaties.
+Some other resources have been included from institutions like CLL to fill in the gaps in knowledge for industry jargon.
 ## Training procedure