10 24 236

Matricardi Fabio

FM-1976

https://medium.com/@fabio.matricardi

AI & ML interests

control system engineering, AI, LLM with python. ThePoorGPUguy on substack

Recent Activity

liked a model about 2 hours ago

alpha-ai/TB-Vibe-3B

updated a model 3 days ago

FM-1976/Lite-Oute-1-300M-Instruct-openvino

published a model 3 days ago

FM-1976/Lite-Oute-1-300M-Instruct-openvino

View all activity

Organizations

None yet

FM-1976's activity

upvoted 2 papers 6 days ago

How far can we go with ImageNet for Text-to-Image generation?

Paper • 2502.21318 • Published 9 days ago • 25

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published 12 days ago • 44

upvoted 3 papers 20 days ago

Large Language Diffusion Models

Paper • 2502.09992 • Published 24 days ago • 99

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Paper • 2502.08235 • Published 26 days ago • 54

FoNE: Precise Single-Token Number Embeddings via Fourier Features

Paper • 2502.09741 • Published 24 days ago • 11

upvoted 3 collections about 2 months ago

upvoted 7 papers 3 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 134

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 48

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 53

FluidML: Fast and Memory Efficient Inference Optimization

Paper • 2411.09242 • Published Nov 14, 2024 • 1

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 29

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 59

Ultra-Sparse Memory Network

Paper • 2411.12364 • Published Nov 19, 2024 • 22

upvoted a collection 4 months ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 18 days ago • 245

upvoted a paper 4 months ago

Scalable MatMul-free Language Modeling

Paper • 2406.02528 • Published Jun 4, 2024 • 11

upvoted a collection 5 months ago

LLM

Collection

Collection of OpenVINO optimized LLMs • 135 items • Updated Dec 23, 2024 • 23

upvoted an article 5 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 225

upvoted a collection 6 months ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 574