liyang31163150
liyang31163150
AI & ML interests
None yet
Recent Activity
liked
a model
11 days ago
deepseek-ai/DeepSeek-V3
new activity
14 days ago
deepseek-ai/DeepSeek-V3:无辅助损失的专家路由
Organizations
None yet
liyang31163150's activity
无辅助损失专家偏置代码实现的小问题 A Small Issue in the Code Implementation of Auxiliary-Loss-Free Load Balancing Expert Bias
#89 opened 14 days ago
by
liyang31163150
无辅助损失的专家路由
2
#56 opened 2 months ago
by
qing9
moss-moon-003-sft-plugin-int4运行提示not a folder containing a `.index.json` file
13
#1 opened almost 2 years ago
by
citibank
Can you provide the merged version(with llama version)
1
#2 opened almost 2 years ago
by
liyang31163150
Maybe some tokenizer files are missing?
10
#1 opened almost 2 years ago
by
lpy86786
AutoModelForCausalLM.from_pretrained() error
1
#2 opened almost 2 years ago
by
liyang31163150