Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ssmits
's Collections
Falcon2
Datasets
Layer Addition Models
Datasets
updated
Nov 7, 2024
Tokenized Dutch datasets based on BramVanroy/occiglot-fineweb-v0.5-nl
Upvote
-
ssmits/tokenized-falcon2-dutch-4096
Viewer
•
Updated
Nov 3, 2024
•
1.05M
•
203
ssmits/tokenized-falcon2-dutch-2048
Viewer
•
Updated
Nov 3, 2024
•
1.91M
•
374
ssmits/tokenized-llama3-dutch-2048
Viewer
•
Updated
Nov 3, 2024
•
1.27M
•
512
Upvote
-
Share collection
View history
Collection guide
Browse collections