loubnabnl HF staff commited on
Commit
aa053f8
1 Parent(s): 3ab57b2

add info about datasets and models (#1)

Browse files

- add info about datasets and models (cf70bbdf57bc34cf7b430ca4e4d0c0f0d8ccad40)

Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -14,4 +14,8 @@ pinned: false
14
  Big Code is an open scientific collaboration working on responsible training of large language models for coding applications.
15
 
16
  You can find more information on the main website at <a href="https://www.bigcode-project.org/" class="underline">https://www.bigcode-project.org</a>. You can also follow Big Code on Twitter at <a href="https://twitter.com/BigCodeProject" class="underline">https://twitter.com/BigCodeProject</a>.
 
 
 
 
17
  </p>
 
14
  Big Code is an open scientific collaboration working on responsible training of large language models for coding applications.
15
 
16
  You can find more information on the main website at <a href="https://www.bigcode-project.org/" class="underline">https://www.bigcode-project.org</a>. You can also follow Big Code on Twitter at <a href="https://twitter.com/BigCodeProject" class="underline">https://twitter.com/BigCodeProject</a>.
17
+
18
+ In this organization, you can find <a href="https://huggingface.co/datasets/bigcode/the-stack" class="underline">The Stack</a>, a 3.1TB of source code in 30 programming languages, its near deduplicated version and a small subset.
19
+
20
+ If you want to access the models trained on these datasets, please send a request to [email protected].
21
  </p>