Phi2-QLoRa-OASST / README.md
nikhiljais's picture
Upload folder using huggingface_hub
e21be27 verified

A newer version of the Gradio SDK is available: 5.23.3

Upgrade
metadata
title: Phi-2 Fine-tuned Chat Assistant
emoji: 🤖
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.44.1
app_file: app.py
pinned: false
license: mit

Phi-2 Fine-tuned Chat Assistant

This Space hosts a fine-tuned version of Microsoft's Phi-2 model using QLoRA (Quantized Low-Rank Adaptation). The model has been trained on the OpenAssistant dataset to improve its conversational abilities.

Model Details

  • Base Model: Microsoft Phi-2
  • Training Method: QLoRA (4-bit quantization)
  • Dataset: OpenAssistant Conversations Dataset
  • Fine-tuning Parameters:
    • LoRA rank: 16
    • LoRA alpha: 32
    • Dropout: 0.1
    • Target modules: q_proj, v_proj

Usage

Simply type your message in the input box and press Enter. The model will generate a response based on your input. You can also try the example prompts provided below the chat interface.

Features

  • Interactive chat interface
  • Real-time response generation
  • Example prompts for quick testing
  • Configurable generation parameters (temperature, top-p)

Limitations

  • The model may occasionally generate incorrect or inconsistent responses
  • Response generation time may vary depending on the input length and server load
  • The model's knowledge is limited to its training data

License

This Space uses the Microsoft Phi-2 model which is subject to its original license. The fine-tuning additions are provided under the MIT License.