rizla
/

rizla55b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rizla commited on Feb 2, 2024

Commit

814ea2c

·

verified ·

1 Parent(s): aaf7e79

Update README.md

Files changed (1) hide show

README.md +2 -4

README.md CHANGED Viewed

@@ -2,10 +2,8 @@
 license: cc-by-nd-4.0
 base_model: []
 tags:
-- mergekit
-- merge
 ---
 # This is an experimental model that I made by merging two Llama2 70b models and gluing them together with the mergekit of llama70b. The mergekit is a tool that lets me mix and match different models into one big model, keeping all the smarts and skills of the original models. The llama70b is a huge language model that can make words for all kinds of things and ways, based on the GPT-4 thingy.
-The merged model has 55 billion parameters and was made trained on 640GB of vram cluster

 license: cc-by-nd-4.0
 base_model: []
 tags:
+- dpo
 ---
 # This is an experimental model that I made by merging two Llama2 70b models and gluing them together with the mergekit of llama70b. The mergekit is a tool that lets me mix and match different models into one big model, keeping all the smarts and skills of the original models. The llama70b is a huge language model that can make words for all kinds of things and ways, based on the GPT-4 thingy.
+The merged model has 55 billion parameters and was made trained on 640GB of vram cluster