Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Data Is Better Together

community
Activity Feed

AI & ML interests

Building better datasets together

Recent Activity

davanstrien  updated a dataset about 1 hour ago
data-is-better-together/fineweb-c-progress
guipenedo  authored a paper 28 days ago
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
guipenedo  authored a paper about 2 months ago
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text
View all activity

Daniel van Strien's profile picture Daniel Vila's profile picture Alvaro Bartolome's profile picture Francisco Aranda's profile picture Guilherme Penedo's profile picture ben burtenshaw's profile picture Librarian Bot (Bot)'s profile picture Ame Vi's profile picture

data-is-better-together 's Spaces 6

pinned
Running
12

FineWeb 2 - Community Leaderboard

🌐

View and contribute to the FineWeb2-C Leaderboard

Jan 20
pinned
Running on CPU Upgrade
38

FineWeb-c - Annotation

🌐

Launch Argilla for data labeling and annotation

Dec 11, 2024
pinned
Running
38

Image Preferences - Argilla annotation space

🖼

A community project to create an image preferences dataset.

Nov 25, 2024
Running
4

FineWeb C Contributors

🏆

Browse and recognize contributors to the FineWeb-C dataset

17 days ago
Running
6

FineWeb 2 Communications Pack

🌐

Share annotation sprint communications

Dec 20, 2024
Runtime error
61

Prompt Collective

🗣

Jun 7, 2024
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs