Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
singhsidhukuldeepΒ 
posted an update May 20, 2024
Post
1031
You are happy that @Meta has open-sourced Llama 3 πŸ˜ƒ...

So you jump on @HuggingFace Hub to download the new shiny Llama 3 model only to see a few quintillion Llama 3's! πŸ¦™βœ¨

Which one should you use? πŸ€”

Not all Llamas are created equal! πŸ¦™βš–οΈ

An absolutely crazy comparison experiment by Wolfram Ravenwolf ( @Wolfram ) might answer your question! πŸ§ͺπŸ§™β€β™‚οΈ

- Comprehensive assessment of Llama 3 Instruct 70B and 8B models. πŸ“Š
- Tested 20 versions across HF, GGUF, and EXL2 formats. πŸ”„
- Methodology: The process tested translation capabilities and cross-language understanding, using deterministic generation settings to minimize random factors. Used German data protection training exams to evaluate cross-language understanding. πŸŒπŸ“
- Best performance from EXL2 4.5bpw quant, scoring perfect in all tests. πŸ†βœ…
- GGUF 8-bit to 4-bit quants also performed exceptionally. 🌟
- Llama 3 8B unquantized is best in its size class but not as good as 70B quants. πŸ“πŸ”
- 1-bit quantizations showed significant quality drops. βš οΈβ¬‡οΈ

Best models:
- turboderp/Llama-3-70B-Instruct-exl2
- casperhansen/llama-3-70b-instruct-awq

Blog: https://huggingface.co/blog/wolfram/llm-comparison-test-llama-3