leafspark
/

wikichat-v2

Text Generation

Inference Endpoints

Model card Files Files and versions Community

leafspark commited on Apr 24

Commit

0aefaec

•

1 Parent(s): 9cd9460

Update README.md

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -20,11 +20,13 @@ The GGUFs uploaded are full FP32 precision.
 Using OpenOrca GPT-4 data + cosmopedia for some extra data + dolly15k for instruct
 ## Model Details:
-- 40M parameters
 - 8 attention heads
-- 32 layers
 - 384 embeddings size
-- 2048/8192 context (please use 4x RoPE scaling)
 ## Prompt Format (Alpaca):
 ```

 Using OpenOrca GPT-4 data + cosmopedia for some extra data + dolly15k for instruct
 ## Model Details:
+- 71.7M parameters (71,775,700)
 - 8 attention heads
+- 32 layers (34 layers on final model)
 - 384 embeddings size
+- 2048/8192/16384 context (please use 4x RoPE scaling, may train a 16k finetuned version later)
+- Batch size 16
+- llama.cpp (train-text-from-scratch)
 ## Prompt Format (Alpaca):
 ```