Flavio Catalani's picture

Flavio Catalani

fakezeta

·

AI & ML interests

None yet

Recent Activity

liked a model 24 days ago

hexgrad/Kokoro-82M

updated a model about 1 month ago

fakezeta/DeepSeek-R1-Distill-Llama-8B-ov-int8

published a model about 1 month ago

fakezeta/DeepSeek-R1-Distill-Llama-8B-ov-int8

View all activity

Organizations

fakezeta's activity

liked a model 24 days ago

hexgrad/Kokoro-82M

Text-to-Speech • Updated 9 days ago • 1.58M • 3.65k

updated a model about 1 month ago

fakezeta/DeepSeek-R1-Distill-Llama-8B-ov-int8

Text Generation • Updated Jan 31 • 48

published a model about 1 month ago

fakezeta/DeepSeek-R1-Distill-Llama-8B-ov-int8

Text Generation • Updated Jan 31 • 48

liked 2 Spaces about 1 month ago

What could possibly go wrong?

Think in Sync

An addictive AI-powered word puzzle.

reacted to csabakecskemeti's post with 👀 about 1 month ago

Post

2325

I've run the open llm leaderboard evaluations + hellaswag on deepseek-ai/DeepSeek-R1-Distill-Llama-8B and compared to meta-llama/Llama-3.1-8B-Instruct and at first glance R1 do not beat Llama overall.

If anyone wants to double check the results are posted here:
https://github.com/csabakecskemeti/lm_eval_results

Am I made some mistake, or (at least this distilled version) not as good/better than the competition?

I'll run the same on the Qwen 7B distilled version too.

7 replies

·

upvoted a collection about 2 months ago

Visual Language Models

Collection of OpenVINO optimized models for visual-language assistance • 9 items • Updated Jan 27 • 3

liked a Space 3 months ago

Hacker News Listener

Navigate and analyze Hacker News posts and comments.

liked a model 3 months ago

Nexusflow/Athene-V2-Chat

Text Generation • Updated Nov 26, 2024 • 7.64k • 288

upvoted an article 3 months ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By

•

Dec 4, 2024

• 76

reacted to lunarflu's post with 🔥 3 months ago

Post

1830

great blogpost! 🔥@wolfram
https://huggingface.co/blog/wolfram/llm-comparison-test-2024-12-04

liked 2 models 4 months ago

Xkev/Llama-3.2V-11B-cot

Image-Text-to-Text • Updated Dec 16, 2024 • 5.18k • 147

kaitchup/Qwen2.5-72B-Instruct-AutoRound-GPTQ-4bit

Text Generation • Updated Nov 26, 2024 • 70 • 6

liked a Space 5 months ago

FacePoke

Import a portrait, click to move the head!

New activity in mistralai/Mistral-Small-Instruct-2409 6 months ago

Please make it CLEAR, this is NOT an OPEN SOURCE MODEL license

#15 opened 6 months ago by

updated 3 models 6 months ago

fakezeta/gemma-2-9b-it-SimPO-ov-int4

Updated Sep 16, 2024 • 11

fakezeta/gemma-2-9b-it-SimPO-ov-int8

Updated Sep 16, 2024 • 12

fakezeta/gemma-2-9b-it-ov-int4

Text Generation • Updated Sep 15, 2024 • 8

updated a collection 6 months ago

Gemma 2

4 items • Updated Sep 15, 2024