Spaces:
Running
Running
add info about datasets and models (#1)
Browse files- add info about datasets and models (cf70bbdf57bc34cf7b430ca4e4d0c0f0d8ccad40)
README.md
CHANGED
@@ -14,4 +14,8 @@ pinned: false
|
|
14 |
Big Code is an open scientific collaboration working on responsible training of large language models for coding applications.
|
15 |
|
16 |
You can find more information on the main website at <a href="https://www.bigcode-project.org/" class="underline">https://www.bigcode-project.org</a>. You can also follow Big Code on Twitter at <a href="https://twitter.com/BigCodeProject" class="underline">https://twitter.com/BigCodeProject</a>.
|
|
|
|
|
|
|
|
|
17 |
</p>
|
|
|
14 |
Big Code is an open scientific collaboration working on responsible training of large language models for coding applications.
|
15 |
|
16 |
You can find more information on the main website at <a href="https://www.bigcode-project.org/" class="underline">https://www.bigcode-project.org</a>. You can also follow Big Code on Twitter at <a href="https://twitter.com/BigCodeProject" class="underline">https://twitter.com/BigCodeProject</a>.
|
17 |
+
|
18 |
+
In this organization, you can find <a href="https://huggingface.co/datasets/bigcode/the-stack" class="underline">The Stack</a>, a 3.1TB of source code in 30 programming languages, its near deduplicated version and a small subset.
|
19 |
+
|
20 |
+
If you want to access the models trained on these datasets, please send a request to [email protected].
|
21 |
</p>
|