Qwen2.5-7B-CLIPPER / README.md
nielsr's picture
nielsr HF staff
Add library_name and pipeline_tag to metadata
0b7ab54 verified
|
raw
history blame
2.32 kB
metadata
base_model:
  - Qwen/Qwen2.5-7B-Instruct
datasets:
  - chtmp223/CLIPPER
language:
  - en
license: apache-2.0
library_name: transformers
pipeline_tag: text-generation

Qwen2.5-7B-CLIPPER

Qwen2.5-7B-CLIPPER is a fine-tuned version of https://huggingface.co/Qwen/Qwen2.5-7B-Instruct using supervised finetuning over chtmp223/CLIPPER dataset. Please check our paper for more details on the method.

πŸ“’ Model Details

Model Description

Model Sources

πŸ’» Training Details

Training Data

chtmp223/CLIPPER

Training Procedure

Configurations Values
Hardware (Training and Inference) 8xA100s
Tracking wandb
batch size 16
gradient_checkpointing True
learning_rate 1.0e-6
lr_scheduler_type cosine
max_length 131072
num_train_epochs 1