Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -25,7 +25,7 @@ Hi, I am Magpie 🐦, your efficient and high-quality synthetic data generation
|
|
25 |
|
26 |
## [🧭 Click here for full dataset navigation (SFT and DPO)](https://github.com/magpie-align/magpie/blob/main/navigation.md)
|
27 |
|
28 |
-
## Raw Datasets
|
29 |
|Model Name | Dataset | Type | Description |
|
30 |
|-------------|:-------|:-------|:-------|
|
31 |
| [Qwen2.5 72B Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct) | [Magpie-Qwen2.5-Pro-1M](https://huggingface.co/datasets/Magpie-Align/Magpie-Qwen2.5-Pro-1M-v0.1) | SFT | 1M Raw conversations built with Qwen2.5 72B Instruct.
|
@@ -37,7 +37,7 @@ Hi, I am Magpie 🐦, your efficient and high-quality synthetic data generation
|
|
37 |
| [Phi-3 Medium Instruct](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct) | [Magpie-Phi3-Pro-1M](https://huggingface.co/datasets/Magpie-Align/Magpie-Phi3-Pro-1M-v0.1) | SFT | 1M Raw conversations built with Phi-3 Medium Instruct.
|
38 |
| [Gemma-2-27b-it](https://huggingface.co/google/gemma-2-27b-it) | [Magpie-Gemma2-Pro-534K](https://huggingface.co/datasets/Magpie-Align/Magpie-Gemma2-Pro-534K-v0.1) | SFT | 534K conversations built with Gemma-2-27b-it.
|
39 |
| [Llama 3.1 405B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-Instruct) | [Magpie-Ultra-v0.1](https://huggingface.co/datasets/argilla/magpie-ultra-v0.1) | SFT | [Argilla] 50K Raw conversations built with Meta Llama 3.1 405B.
|
40 |
-
|
41 |
### Recommended Filtered Datasets
|
42 |
|
43 |
Here are some filtered datasets made by the authors, which are utilized in our [Magpie-Align models](https://huggingface.co/collections/Magpie-Align/magpie-models-668c4a8eea81ccc0db130bdf). We also encourage you to [create and apply your own filters to customize datasets](https://github.com/magpie-align/magpie?tab=readme-ov-file#4-design-and-apply-your-filter).
|
|
|
25 |
|
26 |
## [🧭 Click here for full dataset navigation (SFT and DPO)](https://github.com/magpie-align/magpie/blob/main/navigation.md)
|
27 |
|
28 |
+
<!-- ## Raw Datasets
|
29 |
|Model Name | Dataset | Type | Description |
|
30 |
|-------------|:-------|:-------|:-------|
|
31 |
| [Qwen2.5 72B Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct) | [Magpie-Qwen2.5-Pro-1M](https://huggingface.co/datasets/Magpie-Align/Magpie-Qwen2.5-Pro-1M-v0.1) | SFT | 1M Raw conversations built with Qwen2.5 72B Instruct.
|
|
|
37 |
| [Phi-3 Medium Instruct](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct) | [Magpie-Phi3-Pro-1M](https://huggingface.co/datasets/Magpie-Align/Magpie-Phi3-Pro-1M-v0.1) | SFT | 1M Raw conversations built with Phi-3 Medium Instruct.
|
38 |
| [Gemma-2-27b-it](https://huggingface.co/google/gemma-2-27b-it) | [Magpie-Gemma2-Pro-534K](https://huggingface.co/datasets/Magpie-Align/Magpie-Gemma2-Pro-534K-v0.1) | SFT | 534K conversations built with Gemma-2-27b-it.
|
39 |
| [Llama 3.1 405B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-Instruct) | [Magpie-Ultra-v0.1](https://huggingface.co/datasets/argilla/magpie-ultra-v0.1) | SFT | [Argilla] 50K Raw conversations built with Meta Llama 3.1 405B.
|
40 |
+
-->
|
41 |
### Recommended Filtered Datasets
|
42 |
|
43 |
Here are some filtered datasets made by the authors, which are utilized in our [Magpie-Align models](https://huggingface.co/collections/Magpie-Align/magpie-models-668c4a8eea81ccc0db130bdf). We also encourage you to [create and apply your own filters to customize datasets](https://github.com/magpie-align/magpie?tab=readme-ov-file#4-design-and-apply-your-filter).
|