nyuuzyou
PRO
nyuuzyou
·
AI & ML interests
None yet
Recent Activity
posted
an
update
4 days ago
🌐 Public MediaWiki Collection Dataset - https://huggingface.co/datasets/nyuuzyou/wikis
Collection of 1.66M+ articles from 930 public MediaWiki instances featuring:
- Full article content from diverse public wikis across the internet
- Complete metadata including templates, categories, and section structure
- Rich structural information preserving wiki organization and links
- Multilingual content across 35+ languages including English, Chinese, Spanish, and more
- Regional language variants including US/UK English, Brazilian Portuguese, and Traditional/Simplified Chinese
Key contents:
- 1,662,448 wiki articles with full text
- Extensive metadata including templates, categories, sections
- Internal wikilinks and external reference information
- Cross-domain knowledge spanning multiple topics and fields
View all activity
Organizations
nyuuzyou's activity
-
-
-
-
-
-
-
-
-
-
-
view article
From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub
view article
Open-R1: a fully open reproduction of DeepSeek-R1
upvoted
a
paper
8 months ago