machine-teaching-group's picture
Update README.md
9b40a44 verified
metadata
license: mit
base_model:
  - course-genai-w24/week4-phi-1.5-sft-shakespeare

Model Card for Model ID

Summary

This is a preference tuned model for text completion based on Phi 1.5. It has been tuned on a filtered version of the The Complete Works of William Shakespeare, which can be found and downloaded from here: https://www.gutenberg.org/ebooks/100.

Model Description

  • Developed by: Course Organizers
  • Finetuned from model: microsoft/phi-1_5

Training Details

This model has been trained using the TRL library and OPROTrainer class from Huggingface.

Training Data

The Complete Works of William Shakespeare, which can be found and downloaded from here: https://www.gutenberg.org/ebooks/100

Training Hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 1e-06
  • per_device_train_batch_size: 1
  • lr_scheduler_type: cosine
  • weight_decay: 0.01
  • num_epochs: 1

Framework Versions

  • accelerate==0.26.1
  • datasets==2.16.1
  • transformers==4.45.2
  • trl==0.11.2

Compute Infrastructure and Hardware

Slurm cluster with 8 x H100 Nvidia GPUs.