Spaces:

crumbly
/

README

Running

crumb commited on Oct 27, 2023

Commit

f6e848e

1 Parent(s): 6cd43ca

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -17,10 +17,7 @@ Gale comprises three decoder-only transformer models derived from [Mistral](http
 | [Gale-Medium](https://hf.co/crumbly/Gale-medium) | 3B | 13/32 |
 | [Gale-Small](https://hf.co/crumbly/Gale-small) | 1B | 4/32 |
-## Horizon Dataset
-The dataset used to train the Gale models consists of updated English text and code to fine-tune models like Dante which need to "set" their architectural changes in place. It's an efficient approach to leverage prior model knowledge instead of starting from scratch.
 | Subset | Token % |
 | --- | --- |

 | [Gale-Medium](https://hf.co/crumbly/Gale-medium) | 3B | 13/32 |
 | [Gale-Small](https://hf.co/crumbly/Gale-small) | 1B | 4/32 |
+The Crumbly 'Horizon' dataset used to train the Gale models consists of updated English text and code to fine-tune models like Gale which need to "set" their architectural changes in place. It's an efficient approach to leverage prior model knowledge instead of starting from scratch.
 | Subset | Token % |
 | --- | --- |