Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

FUXI
/
yuyan-10b

PyTorch
Chinese
bert
Model card Files Files and versions Community
yuyan-10b / tools /openwebtext
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
Shawn001's picture
Shawn001
Upload 21 files
1101a21 almost 2 years ago
  • README.md
    3.43 kB
    Upload 21 files almost 2 years ago
  • add_id.py
    2.2 kB
    Upload 21 files almost 2 years ago
  • blacklist_urls.py
    7.34 kB
    Upload 21 files almost 2 years ago
  • cleanup_dataset.py
    4.24 kB
    Upload 21 files almost 2 years ago
  • cleanup_fix_dataset.py
    7.23 kB
    Upload 21 files almost 2 years ago
  • filter_ngrams.py
    18.9 kB
    Upload 21 files almost 2 years ago
  • find_duplicates.py
    12 kB
    Upload 21 files almost 2 years ago
  • group_duplicate_url.py
    3.22 kB
    Upload 21 files almost 2 years ago
  • merge_jsons.py
    1.58 kB
    Upload 21 files almost 2 years ago
  • remove_group_duplicates.py
    2.54 kB
    Upload 21 files almost 2 years ago