Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

BigCode Data

non-profit
BigCodeProject
bigcode-project
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

thomwolf  authored a paper 10 days ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
lvwerra  authored a paper 10 days ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
loubnabnl  authored a paper about 1 month ago
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text
View all activity

Thomas Wolf's profile picture Leandro von Werra's profile picture Yacine Jernite's profile picture Lewis Tunstall's profile picture Nouamane Tazi's profile picture Anton Lozhkov's profile picture Loubna Ben Allal's profile picture Edward Beeching's profile picture Nazneen Rajani's profile picture Harm de Vries's profile picture Raymond Li's profile picture Denis Kocetkov's profile picture Sean Hughes's profile picture Max Tian's profile picture Juan A. Rodriguez's profile picture Nathan Habib's profile picture

bigcode-data 's datasets 1

bigcode-data/license_list

Viewer • Updated Oct 18, 2023 • 824 • 26
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs