Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
PSanni 's Collections
Papers
Personalization LLM
LLM Training
Indic Datasets
Alignment Dataset

Indic Datasets

updated May 10, 2024

List of text and voice datasets to train and finetune Indic LLMs

Upvote
-

  • ai4bharat/sangraha

    Viewer • Updated Mar 5 • 268M • 3.37k • 48

  • uonlp/CulturaX

    Viewer • Updated Dec 16, 2024 • 7.18B • 15.5k • 527

  • pary/hind_encorp

    Updated Jan 18, 2024 • 155 • 2

  • PleIAs/YouTube-Commons

    Updated Jun 26, 2024 • 1.94k • 356

  • rohansolo/BB_HindiHinglishV2

    Viewer • Updated Dec 31, 2023 • 249k • 24 • 2

  • smangrul/hinglish_self_instruct_v0

    Viewer • Updated Dec 24, 2023 • 1.02k • 39 • 9

  • pfin123/hindi-aggregated

    Viewer • Updated Jul 5, 2022 • 745k • 232 • 2

  • aneesh-b/SQuAD_Hindi

    Viewer • Updated Oct 16, 2022 • 4.73k • 6
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs