eliebak HF Staff commited on
Commit
6e67a6f
·
verified ·
1 Parent(s): 79e48c3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -2
README.md CHANGED
@@ -22,7 +22,7 @@ language:
22
 
23
  ## Model Summary
24
 
25
- SmolLM is a series of state-of-the-art small language models available in three sizes: 135M, 360M, and 1.7B parameters. These models are built on Cosmo-Corpus, a meticulously curated high-quality training dataset. Cosmo-Corpus includes Cosmopedia v2 (28B tokens of synthetic textbooks and stories generated by Mixtral), Python-Edu (4B tokens of educational Python samples from The Stack), and FineWeb-Edu (220B tokens of deduplicated educational web samples from FineWeb). SmolLM models have shown promising results when compared to other models in their size categories across various benchmarks testing common sense reasoning and world knowledge. For detailed information on training, benchmarks and performance, please refer to our full blog post ADD LINK WHEN PUBLISH.
26
 
27
 
28
  ### Generation
@@ -118,7 +118,6 @@ This repository contains a converted version of our latest trained model. We've
118
  [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0)
119
 
120
  # Citation
121
- TO MODIFY
122
  ```bash
123
  @misc{allal2024SmolLM,
124
  title={SmolLM - blazingly fast and remarkably powerful},
 
22
 
23
  ## Model Summary
24
 
25
+ SmolLM is a series of state-of-the-art small language models available in three sizes: 135M, 360M, and 1.7B parameters. These models are built on Cosmo-Corpus, a meticulously curated high-quality training dataset. Cosmo-Corpus includes Cosmopedia v2 (28B tokens of synthetic textbooks and stories generated by Mixtral), Python-Edu (4B tokens of educational Python samples from The Stack), and FineWeb-Edu (220B tokens of deduplicated educational web samples from FineWeb). SmolLM models have shown promising results when compared to other models in their size categories across various benchmarks testing common sense reasoning and world knowledge. For detailed information on training, benchmarks and performance, please refer to our full blog post.
26
 
27
 
28
  ### Generation
 
118
  [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0)
119
 
120
  # Citation
 
121
  ```bash
122
  @misc{allal2024SmolLM,
123
  title={SmolLM - blazingly fast and remarkably powerful},