Chang S
kkokkie2360
·
AI & ML interests
None yet
Recent Activity
updated
a model
27 days ago
deepseek-ai/DeepSeek-R1
new activity
27 days ago
deepseek-ai/DeepSeek-R1:Update model_max_length in tokenizer_config.json
new activity
7 months ago
meta-llama/Llama-3.1-405B-Instruct-FP8:8-kv-heads
Organizations
kkokkie2360's activity
Update model_max_length in tokenizer_config.json
#139 opened 27 days ago
by
kkokkie2360
8-kv-heads
8
#14 opened 7 months ago
by
ArthurZ

8 kv heads
2
#13 opened 7 months ago
by
kkokkie2360
8 kv heads
2
#13 opened 7 months ago
by
kkokkie2360