GenVRadmin
/

AryaBhatta-GemmaOrca

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

AryaBhatta-GemmaOrca / README.md

ZennyKenny's picture

correct typo

41543a8 verified 10 months ago

|

1.79 kB

	---
	license: mit
	---

	This model is finetuned from HuggingFaceH4/zephyr-7b-gemma-v0.1 and is finetuned on 9 Indian languages (Hindi, Tamil, Punjabi, Bengali, Gujarati, Oriya, Telugu, Kannada, Malayalam) plus English.
	To improve the reasoning and maths skills, we first SFT tune the gemma on Microsoft's Orca datasets.

	We utilize Orca maths Hindi dataset: GenVRadmin/Aryabhatta-Orca-Maths-Hindi \
	And original Orca maths dataset: microsoft/orca-math-word-problems-200k

	This pushes the MATHS score from 24.3 in Gemma-7B to 25.5 in Zephyr-Gemma and 31.6 in GemmaOrca.

	The model is then finetuned on GenVR's Samvaad datasets (GenVRadmin/Samvaad-Indic-Positive and GenVRadmin/Samvaad-Tamil-Mixtral and a subset of GenVRadmin/Samvaad-Mixed-Language-3).

	This is then finetuned on various open sourced datasets like:

	Telugu-LLM-Labs/yahma_alpaca_cleaned_telugu_filtered_and_romanized \
	Telugu-LLM-Labs/teknium_GPTeacher_general_instruct_telugu_filtered_and_romanized \
	abhinand/tamil-alpaca \
	Tensoic/airoboros-3.2_kn \
	Tensoic/gpt-teacher_kn \
	Tensoic/Alpaca-Gujarati \
	HydraIndicLM/bengali_alpaca_dolly_67k \
	Open-Orca/OpenOrca \
	pankajmathur/alpaca_orca \
	OdiaGenAI/Odia_Alpaca_instructions_52k \
	OdiaGenAI/gpt-teacher-roleplay-odia-3k \
	GenVRadmin/Samvaad-Punjabi-Mini \
	pankajmathur/WizardLM_Orca

	The model achieves following scores on benchmarks:

	Model AGIEval GPT4All TruthfulQA BigBench Average ⬇️ \
	AryaBhatta-GemmaOrca 35.9 72.26 53.85 40.35 50.59 \
	zephyr-7b-beta 37.52 71.77 55.26 39.77 51.08 \
	zephyr-7b-gemma-v0.1 34.22 66.37 52.19 37.10 47.47 \
	mlabonne/Gemmalpaca-7B 21.6 40.87 44.85 30.49 34.45 \
	google/gemma-7b-it 21.33 40.84 41.70 30.25 33.53