pradeep6kumar2024 commited on
Commit
1a8f82f
·
1 Parent(s): 1494734

updated readme

Browse files
Files changed (1) hide show
  1. README.md +52 -30
README.md CHANGED
@@ -1,24 +1,12 @@
1
  ---
2
- language: en
3
- tags:
4
- - phi-2
5
- - qlora
6
- - fine-tuning
7
- - assistant
8
- - coding
9
- - writing
10
- license: mit
11
- datasets:
12
- - custom
13
- model-index:
14
- - name: phi2-qlora-assistant
15
- results:
16
- - task: text-generation
17
- type: text-generation
18
- metrics:
19
- - name: accuracy
20
- type: accuracy
21
- value: N/A
22
  ---
23
 
24
  # Phi-2 QLoRA Fine-tuned Assistant
@@ -32,6 +20,50 @@ This is a fine-tuned version of Microsoft's Phi-2 model using QLoRA (Quantized L
32
  - **Training Data**: Custom dataset focused on coding, technical explanations, and professional communication
33
  - **Primary Use Cases**: Code generation, technical writing, and professional communication
34
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
35
  ## Example Usage
36
 
37
  ```python
@@ -67,12 +99,6 @@ response = tokenizer.decode(outputs[0], skip_special_tokens=True)
67
  "Dear Team,
68
  I hope this email finds you well. I would like to schedule a team meeting next week to discuss our project progress..."
69
 
70
- ## Parameters
71
-
72
- - **Temperature**: Controls creativity (0.3-0.5 for code, 0.7-0.9 for writing)
73
- - **Max Length**: Adjustable based on desired response length (64-1024)
74
- - **Top P**: Controls response diversity (recommended: 0.9)
75
-
76
  ## Limitations
77
 
78
  - The model works best with clear, well-structured prompts
@@ -83,10 +109,6 @@ response = tokenizer.decode(outputs[0], skip_special_tokens=True)
83
 
84
  You can try this model directly in your browser using our Gradio Space: [Phi2-QLoRA-Assistant Demo](https://huggingface.co/spaces/pradeep6kumar2024/phi2-qlora-assistant-demo)
85
 
86
- ## License
87
-
88
- This model is released under the MIT License.
89
-
90
  ## Acknowledgments
91
 
92
  - Microsoft for the Phi-2 base model
 
1
  ---
2
+ title: Phi-2 QLoRA Assistant Demo
3
+ emoji: 🤖
4
+ colorFrom: blue
5
+ colorTo: purple
6
+ sdk: gradio
7
+ sdk_version: 4.19.2
8
+ app_file: app.py
9
+ pinned: false
 
 
 
 
 
 
 
 
 
 
 
 
10
  ---
11
 
12
  # Phi-2 QLoRA Fine-tuned Assistant
 
20
  - **Training Data**: Custom dataset focused on coding, technical explanations, and professional communication
21
  - **Primary Use Cases**: Code generation, technical writing, and professional communication
22
 
23
+ ## Usage Tips
24
+
25
+ ### For Code Generation (Temperature: 0.3-0.5)
26
+ ```python
27
+ # Example prompt:
28
+ "Write a Python function to calculate the factorial of a number and provide additional recursive function examples"
29
+ ```
30
+
31
+ ### For Technical Explanations (Temperature: 0.7)
32
+ ```text
33
+ # Example prompt:
34
+ "Explain what machine learning is in simple terms and provide some real-world applications"
35
+ ```
36
+
37
+ ### For Professional Writing (Temperature: 0.7-0.9)
38
+ ```text
39
+ # Example prompt:
40
+ "Write a professional email to schedule a team meeting for next week to discuss project progress"
41
+ ```
42
+
43
+ ## Parameters Guide
44
+
45
+ - **Maximum Length**: 64-1024 (default: 512)
46
+ - Increase for longer responses
47
+ - Decrease for quicker, more concise responses
48
+
49
+ - **Temperature**: 0.1-1.0 (default: 0.7)
50
+ - 0.3-0.5: Best for code generation
51
+ - 0.7-0.9: Best for creative writing
52
+ - 1.0: Maximum creativity
53
+
54
+ - **Top P**: 0.1-1.0 (default: 0.9)
55
+ - Controls diversity of word choices
56
+ - Higher values = more diverse vocabulary
57
+
58
+ ## Model Links
59
+
60
+ - **Model Card**: [pradeep6kumar2024/phi2-qlora-assistant](https://huggingface.co/pradeep6kumar2024/phi2-qlora-assistant)
61
+ - **Base Model**: [microsoft/phi-2](https://huggingface.co/microsoft/phi-2)
62
+
63
+ ## License
64
+
65
+ This demo is released under the MIT License.
66
+
67
  ## Example Usage
68
 
69
  ```python
 
99
  "Dear Team,
100
  I hope this email finds you well. I would like to schedule a team meeting next week to discuss our project progress..."
101
 
 
 
 
 
 
 
102
  ## Limitations
103
 
104
  - The model works best with clear, well-structured prompts
 
109
 
110
  You can try this model directly in your browser using our Gradio Space: [Phi2-QLoRA-Assistant Demo](https://huggingface.co/spaces/pradeep6kumar2024/phi2-qlora-assistant-demo)
111
 
 
 
 
 
112
  ## Acknowledgments
113
 
114
  - Microsoft for the Phi-2 base model