File size: 1,468 Bytes
2093c82
 
 
 
 
 
e21be27
2093c82
 
 
 
 
ea6cbcb
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2093c82
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
---

title: Phi-2 Fine-tuned Chat Assistant
emoji: 🤖
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 4.44.1
app_file: app.py
pinned: false
license: mit
---


# Phi-2 Fine-tuned Chat Assistant

This Space hosts a fine-tuned version of Microsoft's Phi-2 model using QLoRA (Quantized Low-Rank Adaptation). The model has been trained on the OpenAssistant dataset to improve its conversational abilities.

## Model Details

- Base Model: Microsoft Phi-2
- Training Method: QLoRA (4-bit quantization)
- Dataset: OpenAssistant Conversations Dataset
- Fine-tuning Parameters:
  - LoRA rank: 16
  - LoRA alpha: 32
  - Dropout: 0.1
  - Target modules: q_proj, v_proj

## Usage

Simply type your message in the input box and press Enter. The model will generate a response based on your input. You can also try the example prompts provided below the chat interface.

## Features

- Interactive chat interface
- Real-time response generation
- Example prompts for quick testing
- Configurable generation parameters (temperature, top-p)

## Limitations

- The model may occasionally generate incorrect or inconsistent responses
- Response generation time may vary depending on the input length and server load
- The model's knowledge is limited to its training data

## License

This Space uses the Microsoft Phi-2 model which is subject to its original license. The fine-tuning additions are provided under the MIT License.