mirlab
/

AkaLlama-llama3-70b-v0.1-GGUF

Text Generation

Model card Files Files and versions Community

Steamout commited on May 6, 2024

Commit

e23e89c

·

verified ·

1 Parent(s): 393336c

Update README.md

Files changed (1) hide show

README.md +10 -8

README.md CHANGED Viewed

@@ -190,10 +190,12 @@ del AkaLlama-llama3-70b-v0.1.Q8_0.00001-of-00002.gguf AkaLlama-llama3-70b-v0.1.Q
 ## Evaluation
-|               Model              | #Parameter | Qunatized? | LogicKor |
-|:--------------------------------:|:----------:|------------|:--------:|
-| AkaLlama-llama3-70b-v0.1-GGUF.Q4 |     70B    | 4bit       |   6.56   |
-| AkaLlama-llama3-70b-v0.1-GGUF.Q8 |     70B    | 8bit       |   6.34   |
 ## Training Details
 ### Training Procedure
@@ -343,14 +345,14 @@ print(output.shape)
 You can find more examples at [our project page](https://yonsei-mir.github.io/AkaLLaMA-page)
-## Warning
-Although AKALlama-70B has significant potential, its responses can sometimes be inaccurate, biased, or misaligned, presenting risks if used without additional testing and refinement. Furthermore, the quality of the model's output is greatly influenced by the system prompt and decoding strategy. Changes in these areas could result in less precise outputs. Therefore, we strongly recommend handling our model with considerable caution.
 ## Special Thanks
 - Data Center of the Department of Artificial Intelligence and Jeong Mee Koh at Yonsei University for the computation resources
 ## Comments
 - Title image generated by DALL·E 3

 ## Evaluation
+|               Model              | #Parameter | Qunatized? | LogicKor* |
+|:--------------------------------:|:----------:|------------|:---------:|
+| AkaLlama-llama3-70b-v0.1-GGUF.Q4 |     70B    |    4bit    |    6.56   |
+| AkaLlama-llama3-70b-v0.1-GGUF.Q8 |     70B    |    8bit    |    6.34   |
+*mean over 3 random seeds
 ## Training Details
 ### Training Procedure
 You can find more examples at [our project page](https://yonsei-mir.github.io/AkaLLaMA-page)
 ## Special Thanks
 - Data Center of the Department of Artificial Intelligence and Jeong Mee Koh at Yonsei University for the computation resources
+## Warning
+Although AKALlama-70B has significant potential, its responses can sometimes be inaccurate, biased, or misaligned, presenting risks if used without additional testing and refinement. Furthermore, the quality of the model's output is greatly influenced by the system prompt and decoding strategy. Changes in these areas could result in less precise outputs. Therefore, we strongly recommend handling our model with considerable caution.
 ## Comments
 - Title image generated by DALL·E 3