|
--- |
|
license: mit |
|
license_link: https://huggingface.co/microsoft/phi-2/resolve/main/LICENSE |
|
language: |
|
- en |
|
widget: |
|
- text: Hello who are you? |
|
example_title: Identity |
|
- text: What can you do? |
|
example_title: Capabilities |
|
- text: Create a fastapi endpoint to retrieve the weather given a zip code. |
|
example_title: Coding |
|
tags: |
|
- convAI |
|
- conversational |
|
pipeline_tag: text-generation |
|
inference: false |
|
--- |
|
|
|
<!-- description start --> |
|
# Phi-2 Super (SFT + cDPO) |
|
![image/png](https://cdn-uploads.huggingface.co/production/uploads/62ceeb27e7f6014c0e9d9268/5-LQCMrXi8FN_ewcWL47v.png) |
|
- **Model creator:** [Anton Bacaj](https://huggingface.co/abacaj) |
|
- **Original model:** [Phi-2 Super](https://huggingface.co/abacaj/phi-2-super) |
|
|
|
## Description |
|
This repo contains 4-bit Marlin format model files for [abacaj's Phi-2 Super](https://huggingface.co/abacaj/phi-2-super) |
|
|
|
### Phi-2-super (SFT + cDPO) |
|
|
|
Base Model: [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) |
|
|
|
### Chat template |
|
|
|
The model uses the same chat template as found in Mistral instruct models: |
|
|
|
```python |
|
text = "<|endoftext|>[INST] What is your favourite condiment? [/INST]" |
|
"Well, I'm quite partial to a good squeeze of fresh lemon juice. It adds just the right amount of zesty flavour to whatever I'm cooking up in the kitchen!<|endoftext|> " |
|
"[INST] Do you have mayonnaise recipes? [/INST]" |
|
``` |
|
|
|
### MT-bench / heval |
|
|
|
![image/png](https://cdn-uploads.huggingface.co/production/uploads/62ceeb27e7f6014c0e9d9268/lnFu3x1ufdpQVysIrX4-G.png) |
|
![image/png](https://cdn-uploads.huggingface.co/production/uploads/62ceeb27e7f6014c0e9d9268/mJfBpH8dIW7Ii2KAGI_A7.png) |
|
<!-- description end --> |