Flavio Catalani's picture

Flavio Catalani

fakezeta

AI & ML interests

None yet

Recent Activity

liked a model 24 days ago
hexgrad/Kokoro-82M
updated a model about 1 month ago
fakezeta/DeepSeek-R1-Distill-Llama-8B-ov-int8
published a model about 1 month ago
fakezeta/DeepSeek-R1-Distill-Llama-8B-ov-int8
View all activity

Organizations

LocalAI Community's profile picture

fakezeta's activity

reacted to csabakecskemeti's post with πŸ‘€ about 1 month ago
view post
Post
2325
I've run the open llm leaderboard evaluations + hellaswag on deepseek-ai/DeepSeek-R1-Distill-Llama-8B and compared to meta-llama/Llama-3.1-8B-Instruct and at first glance R1 do not beat Llama overall.

If anyone wants to double check the results are posted here:
https://github.com/csabakecskemeti/lm_eval_results

Am I made some mistake, or (at least this distilled version) not as good/better than the competition?

I'll run the same on the Qwen 7B distilled version too.
Β·
upvoted an article 3 months ago
view article
Article

πŸΊπŸ¦β€β¬› LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By wolfram β€’
β€’ 76
reacted to lunarflu's post with πŸ”₯ 3 months ago