LUCA 's picture

LUCA

Gargaz

·

AI & ML interests

None yet

Recent Activity

liked a model 5 minutes ago

Gargaz/llama-2-7b

updated a model about 6 hours ago

EryonAI/Eryon-1B

new activity about 11 hours ago

Gargaz/llama-eryon:Adding `safetensors` variant of this model

View all activity

Organizations

Gargaz's activity

upvoted an article about 19 hours ago

Article

Fine-Tuning 1B LLaMA 3.2: A Comprehensive Step-by-Step Guide with Code

By

•

Oct 2, 2024

• 42

upvoted a collection 2 days ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Nov 14, 2024 • 538

upvoted a paper 2 days ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 18 days ago • 132

upvoted 5 collections 2 days ago

Qwen2.5

10 items • Updated Nov 25, 2024 • 2

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 13 days ago • 74

Granite 3.1 Language Models

A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated 13 days ago • 40

Bamba

Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 13 days ago • 17

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 12 days ago • 107

upvoted a paper 2 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 12 days ago • 333

upvoted 5 collections 2 days ago

whisper-guaraní

5 items • Updated Nov 25, 2024 • 2

TimesFM Release

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting. • 4 items • Updated 8 days ago • 10

The Big Benchmarks Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 181

Centaurus

Series of uncensored models based on Llama-3. • 5 items • Updated May 27, 2024 • 4

Recommended small models

This is everything recent smaller than ~25B parameters that are high quality/reputable • 19 items • Updated Nov 30, 2024 • 31

upvoted a collection 10 months ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 64 items • Updated about 1 hour ago • 491