Zheng Zian(Andy)
OrionZheng
AI & ML interests
LLM, Mixture-of-Experts, Data-Centric AI
Recent Activity
liked
a model
7 days ago
Qwen/QwQ-32B
liked
a model
12 days ago
MatrixTeam/TheMatrix
new activity
about 1 month ago
OrionZheng/openmoe-8b:Model source code
Organizations
None yet
OrionZheng's activity
Model source code
2
#2 opened about 1 month ago
by
not-found

model_type "llama"
1
#1 opened 7 months ago
by
Phando

Update config.json
#1 opened 8 months ago
by
OrionZheng

Update ada_vocab_factory.py
#1 opened 8 months ago
by
OrionZheng

convert t5x into pytorch model
1
#1 opened about 1 year ago
by
Siddharth63
Fixed some data in bad format
1
#6 opened over 1 year ago
by
OrionZheng

Is there any overlap between peS2o dataset and the arxiv subset from Redpajama?
#2 opened over 1 year ago
by
OrionZheng

Add snippet to locate errors in the data files
1
#3 opened over 1 year ago
by
OrionZheng

How to obtain the original git-commits dataset?
1
#5 opened over 1 year ago
by
OrionZheng

Confusion and Discrepancy Regarding Deduplication Versions and Dataset Sizes
1
#26 opened over 1 year ago
by
OrionZheng

Cannot run the inference on the playground
2
#1 opened almost 2 years ago
by
OrionZheng

Error while loading "alt-parallel" config: TypeError: Couldn't cast array
2
#3 opened almost 2 years ago
by
albertvillanova

🚩 Report
2
#2 opened almost 2 years ago
by
OrionZheng
