2 17 101

Alberto Cetoli PRO

fractalego

https://fractalego.social/@alberto

AI & ML interests

Entity/relation extraction, Q&A, Summarisation

Recent Activity

upvoted a paper 4 days ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

liked a model 12 days ago

openai/gpt-oss-20b

reacted to mitkox's post with 😎 24 days ago

I run Qwen3-Coder 480B locally on my Z8, with a 1-million token context window. It’s the equivalent of parallel-parking a Nimitz-class carrier in a kiddie pool. Thanks to whatever dark pact the llama.cpp, CUDA, and kernel folks signed, hybrid inferencing + VRAM↔RAM offload let me stream the model’s synapses across Xeon, RAM, and four lonely A6000s without summoning either the OOM killer or a small house fire.

View all activity

Organizations

upvoted a paper 4 days ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published 9 days ago • 139

liked a model 12 days ago

openai/gpt-oss-20b

Text Generation • 22B • Updated 3 days ago • 3.42M • • 3.04k

reacted to mitkox's post with 😎 24 days ago

Post

2087

liked 2 models about 1 month ago

mradermacher/Agentic-R1-GGUF

8B • Updated Jul 9 • 208 • 1

moonshotai/Kimi-K2-Instruct

Text Generation • Updated 6 days ago • 483k • • 2.09k

reacted to merve's post with 🔥 2 months ago

Post

2934

Qwen2.5-Omni is soooo good that people build multimodal reasoning models off of it 🥹
> KE-Team/Ke-Omni-R-3B is open-source audio reasoning model sota on average of benchmarks, based on Qwen/Qwen2.5-Omni-3B 🗣️
> Haoz0206/Omni-R1 is a video reasoning model with pixel level grounding (see below) and it's super competitive ⏯️ based on Qwen/Qwen2.5-Omni-7B

liked a model 2 months ago

Qwen/Qwen3-Embedding-0.6B-GGUF

0.6B • Updated Jul 14 • 32.2k • 429

liked 2 models 3 months ago

DAMO-NLP-SG/VideoLLaMA3-7B

Visual Question Answering • 8B • Updated Mar 20 • 87.7k • 64

ByteDance-Seed/BAGEL-7B-MoT

Any-to-Any • 15B • Updated Jun 23 • 987 • 1.11k

liked a dataset 3 months ago

nvidia/OpenMathReasoning

Viewer • Updated May 27 • 5.68M • 12.6k • 325

reacted to jeffboudier's post with 🚀 3 months ago

Post

2596

Transcribing 1 hour of audio for less than $0.01 🤯

@mfuntowicz cooked with 8x faster Whisper speech recognition - whisper-large-v3-turbo transcribes at 100x real time on a $0.80/hr L4 GPU!

How they did it: https://huggingface.co/blog/fast-whisper-endpoints

1-click deploy with HF Inference Endpoints: https://endpoints.huggingface.co/new?repository=openai%2Fwhisper-large-v3-turbo&vendor=aws&region=us-east&accelerator=gpu&instance_id=aws-us-east-1-nvidia-l4-x1&task=automatic-speech-recognition&no_suggested_compute=true

liked a model 3 months ago

Qwen/Qwen3-30B-A3B

Text Generation • 31B • Updated 22 days ago • 1.12M • • 765

liked 2 models 4 months ago

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated May 1 • 5.53k • 1.15k

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • 109B • Updated May 22 • 777k • • 1.05k

liked 2 models 5 months ago

sesame/csm-1b

Text-to-Speech • Updated 25 days ago • 29.7k • 2.17k

manycore-research/SpatialLM-Llama-1B

Text Generation • 1B • Updated Mar 21 • 858 • 980

reacted to BrigitteTousi's post with 🔥🚀 5 months ago

Post

3440

LeRobot goes to driving school! 🚗🚗🚗

Hugging Face just announced a new collab with Yaak to bring the largest open-source self-driving dataset to LeRobot!

Major kudos to HF's @cadene , as well as @sandhawalia , @Shnissen and the Yaak team!

Check out the blog post here: https://huggingface.co/blog/lerobot-goes-to-driving-school

1 reply

reacted to csabakecskemeti's post with 🔥 6 months ago

Post

2848

Testing Training on AMD/ROCm the first time!

I've got my hands on an AMD Instinct MI100. It's about the same price used as a V100 but on paper has more TOPS (V100 14TOPS vs MI100 23TOPS) also the HBM has faster clock so the memory bandwidth is 1.2TB/s.
For quantized inference it's a beast (MI50 was also surprisingly fast)

For LORA training with this quick test I could not make the bnb config works so I'm running the FT on the fill size model.

Will share all the install, setup and setting I've learned in a blog post, together with the cooling shroud 3D design.

8 replies

upvoted an article 6 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

Alberto Cetoli PRO

AI & ML interests

Recent Activity

Organizations

fractalego's activity

Open-R1: Update #1