license: apache-2.0 language: - en metrics: - rouge base_model: - microsoft/phi-2 pipeline_tag: question-answering
This repo containes the last checkpoint of my fine tuned model. Click this link to go the final model https://huggingface.co/JamieAi33/Phi-2_PEFT
Model Card for PEFT-Fine-Tuned Model
This model card documents a PEFT-fine-tuned version of microsoft/phi-2
for question-answering tasks. The PEFT fine-tuning improved the model's performance, as detailed in the evaluation section.
Model Details
Model Description
- Developed by: JamieAi33
- Finetuned from model:
microsoft/phi-2
- Model type: PEFT fine-tuned transformer
- Language(s) (NLP): English
- License: Apache 2.0
The base model microsoft/phi-2
was adapted using Parameter-Efficient Fine-Tuning (PEFT) for question-answering tasks. The training process focused on improving performance metrics while keeping computational costs low.
Model Sources
- Repository: https://huggingface.co/JamieAi33/Phi-2-QLora
- Paper: [Optional: Add a reference to PEFT or any relevant paper]
- Demo: [Optional: Link to your Hugging Face Space or demo]
Uses
Direct Use
This model can be used out-of-the-box for question-answering tasks.
Downstream Use
The model can be fine-tuned further on domain-specific datasets for improved performance.
Out-of-Scope Use
Avoid using this model for tasks outside question-answering or where fairness, bias, and ethical considerations are critical without further validation.
Bias, Risks, and Limitations
Users should be aware that:
- The model is trained on publicly available data and may inherit biases present in the training data.
- It is optimized for English and may perform poorly in other languages.
How to Get Started with the Model
Here’s an example of loading the model:
from transformers import AutoModel
from peft import PeftModel
base_model = AutoModel.from_pretrained("microsoft/phi-2")
adapter_model = PeftModel.from_pretrained(base_model, "JamieAi33/Phi-2-QLora")
# Model Name: PEFT Fine-Tuned `microsoft/phi-2`
This repository contains a PEFT fine-tuned version of the `microsoft/phi-2` model for question-answering tasks. The fine-tuning process leveraged Parameter-Efficient Fine-Tuning (PEFT) techniques to achieve improved performance.
---
## Metrics
The model's performance was evaluated using the ROUGE metric. Below are the results:
| **Metric** | **Original Model** | **PEFT Model** | **Absolute Improvement** |
|-----------------|--------------------|----------------|---------------------------|
| **ROUGE-1** | 29.76% | 44.51% | +14.75% |
| **ROUGE-2** | 10.76% | 15.68% | +4.92% |
| **ROUGE-L** | 21.69% | 30.95% | +9.25% |
| **ROUGE-Lsum** | 22.75% | 31.49% | +8.74% |
---
## Training Configuration
| Hyperparameter | Value |
|-----------------------|-------------------------|
| **Batch Size** | 1 |
| **Learning Rate** | 2e-4 |
| **Max Steps** | 1000 |
| **Optimizer** | Paged AdamW (8-bit) |
| **Logging Steps** | 25 |
| **Evaluation Steps** | 25 |
| **Gradient Checkpointing** | Enabled |
- Downloads last month
- 10
Model tree for JamieAi33/Phi-2-QLora
Base model
microsoft/phi-2