asiansoul
/

Joah-Llama-3-KoEn-8B-Coder-v2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

asiansoul commited on May 11, 2024

Commit

d21a1e7

·

verified ·

1 Parent(s): 2e0ca30

Update README.md

Files changed (1) hide show

README.md +31 -1

README.md CHANGED Viewed

@@ -18,7 +18,37 @@ tags:
 ---
 # Joah-Llama-3-KoEn-8B-Coder-v2
-## Merge Details
 ### Merge Method
 This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [NousResearch/Meta-Llama-3-8B](https://huggingface.co/NousResearch/Meta-Llama-3-8B) as a base.

 ---
 # Joah-Llama-3-KoEn-8B-Coder-v2
+<a href="https://ibb.co/2Srsmn7"><img src="https://i.ibb.co/f9WnB1Y/Screenshot-2024-05-11-at-7-15-42-PM.png" alt="Screenshot-2024-05-11-at-7-15-42-PM" border="0"></a>
+오늘 부터 서로에게 빛이 되어 줄 여러분의 Merge Model
+"좋아(Joah)" by AsianSoul
+Soon Multi Language Model Merge based on this. First German Start (Korean / English / German) 🌍
+Where to use Joah : Medical, Korean, English, Translation, Code, Science... 🎥
+## 🎡 Merge Details
+The performance of this merge model doesn't seem to be bad though.-> Just opinion ^^ 🏟️
+This may not be a model that satisfies you. But if we continue to overcome our shortcomings,
+Won't we someday find the answer we want?
+Don't worry even if you don't get the results you want.
+I'll find the answer for you.
+Soon real PoSE to extend Llama's context length to 64k with using my merge method : [reborn](https://medium.com/@puffanddmx82/reborn-elevating-model-adaptation-with-merging-for-superior-nlp-performance-f604e8e307b2)
+I have found that most of merge's model outside so far do not actually have 64k in their configs. I will improve it in the next merge with my reborn. If that doesn't work, I guess I'll have to find another way, right?
+256k is not possible. My computer is running out of memory.
+If you support me, i will try it on a computer with maximum specifications, also, i would like to conduct great tests by building a network with high-capacity traffic and high-speed 10G speeds for you.
 ### Merge Method
 This model was merged using the [DARE](https://arxiv.org/abs/2311.03099) [TIES](https://arxiv.org/abs/2306.01708) merge method using [NousResearch/Meta-Llama-3-8B](https://huggingface.co/NousResearch/Meta-Llama-3-8B) as a base.