Arcanum-12b / README.md

Adding Evaluation Results

7fff864 verified 4 months ago

4.82 kB

	---
	license: mit
	library_name: transformers
	model-index:
	- name: Arcanum-12b
	results:
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: IFEval (0-Shot)
	type: HuggingFaceH4/ifeval
	args:
	num_few_shot: 0
	metrics:
	- type: inst_level_strict_acc and prompt_level_strict_acc
	value: 29.07
	name: strict accuracy
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Xclbr7/Arcanum-12b
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: BBH (3-Shot)
	type: BBH
	args:
	num_few_shot: 3
	metrics:
	- type: acc_norm
	value: 31.88
	name: normalized accuracy
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Xclbr7/Arcanum-12b
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: MATH Lvl 5 (4-Shot)
	type: hendrycks/competition_math
	args:
	num_few_shot: 4
	metrics:
	- type: exact_match
	value: 10.27
	name: exact match
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Xclbr7/Arcanum-12b
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: GPQA (0-shot)
	type: Idavidrein/gpqa
	args:
	num_few_shot: 0
	metrics:
	- type: acc_norm
	value: 9.4
	name: acc_norm
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Xclbr7/Arcanum-12b
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: MuSR (0-shot)
	type: TAUR-Lab/MuSR
	args:
	num_few_shot: 0
	metrics:
	- type: acc_norm
	value: 13.53
	name: acc_norm
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Xclbr7/Arcanum-12b
	name: Open LLM Leaderboard
	- task:
	type: text-generation
	name: Text Generation
	dataset:
	name: MMLU-PRO (5-shot)
	type: TIGER-Lab/MMLU-Pro
	config: main
	split: test
	args:
	num_few_shot: 5
	metrics:
	- type: acc
	value: 28.74
	name: accuracy
	source:
	url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Xclbr7/Arcanum-12b
	name: Open LLM Leaderboard
	---

	![Arcanum-12b Banner](https://cdn-uploads.huggingface.co/production/uploads/66dcee3321f901b049f48002/SvGSozVAJMaf5PL21dMBb.jpeg)

	# Arcanum-12b 🧙‍♂️


	Arcanum-12b is a merged large language model created by combining TheDrummer/Rocinante-12B-v1.1 and MarinaraSpaghetti/NemoMix-Unleashed-12B using a novel merging technique.

	## Model Details 📊

	- Developed by: Xclbr7
	- Model type: Causal Language Model
	- Language(s): English (primarily), may support other languages
	- License: MIT
	- Repository: https://huggingface.co/Xclbr7/Arcanum-12b

	## Model Architecture 🏗️

	- Base model: MarinaraSpaghetti/NemoMix-Unleashed-12B
	- Parameter count: ~12 billion
	- Architecture specifics: Transformer-based language model

	## Training & Merging 🔄

	Arcanum-12b was created by merging two existing 12B models:

	1. TheDrummer/Rocinante-12B-v1.1
	- Density parameters: [1, 0.8, 0.6]
	- Weight: 0.7

	2. MarinaraSpaghetti/NemoMix-Unleashed-12B
	- Density parameters: [0.5, 0.7, 0.9]
	- Weight: 0.8

	Merging method: Ties
	Additional parameters:
	- Normalization: True
	- Int8 mask: True
	- Data type: float16

	## Intended Use 🎯

	Conversation with different personas.

	## Performance and Limitations ⚖️

	Not tested yet.

	## Ethical Considerations 🤔

	As a merged model based on existing language models, Arcanum-12b may inherit biases and limitations from its parent models. Users should be aware of potential biases in generated content and use the model responsibly.


	## Acknowledgments 🙏

	We acknowledge the contributions of the original model creators:
	- TheDrummer for Rocinante-12B-v1.1
	- MarinaraSpaghetti for NemoMix-Unleashed-12B

	Their work formed the foundation for Arcanum-12b.

	# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
	Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Xclbr7__Arcanum-12b)

	\| Metric \|Value\|
	\|-------------------\|----:\|
	\|Avg. \|20.48\|
	\|IFEval (0-Shot) \|29.07\|
	\|BBH (3-Shot) \|31.88\|
	\|MATH Lvl 5 (4-Shot)\|10.27\|
	\|GPQA (0-shot) \| 9.40\|
	\|MuSR (0-shot) \|13.53\|
	\|MMLU-PRO (5-shot) \|28.74\|