Qwen2.5-3B Fine-Tuned on BBH Dataset - Model Card π Model Overview Model Name: Qwen2.5-3B Fine-Tuned on BBH Base Model: Qwen2.5-3B-Instruct Fine-Tuned Dataset: BBH (BigBench Hard) Task: Causal Language Modeling (CLM) Fine-Tuning Objective: Improve performance on reasoning and knowledge-based multiple-choice tasks
π Dataset Information The model was fine-tuned on BigBench Hard (BBH), a dataset designed to evaluate complex reasoning tasks. Key subsets used for training:
Causal Judgement π§ : Evaluating causality understanding Date Understanding π: Temporal reasoning and date manipulation Boolean Expressions β β: Logical reasoning Dataset characteristics:
Format: Multiple-choice questions Domains: Logic, Mathematics, Commonsense Reasoning Label Mapping: Answers converted into numerical classes (e.g., A β 0, B β 1, etc.)