PZ's picture

PZ PRO

philipp-zettl

·

philipp-zettl

AI & ML interests

NLP/CV/Multimodal learning

Recent Activity

liked a model 12 days ago

nashikone/iroiroLoRA

liked a model 13 days ago

deepseek-ai/DeepSeek-V3

upvoted a paper 13 days ago

1.58-bit FLUX

View all activity

Organizations

philipp-zettl's activity

upvoted a paper 13 days ago

1.58-bit FLUX

Paper • 2412.18653 • Published 19 days ago • 69

upvoted a paper 23 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 25 days ago • 121

upvoted 2 collections about 1 month ago

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 106

Common Models

The first generation of models pretrained on Common Corpus. • 5 items • Updated Dec 5, 2024 • 28

upvoted an article about 1 month ago

Article

They Said It Couldn’t Be Done

By

•

Dec 5, 2024

• 76

upvoted 2 collections about 1 month ago

LLäMmlein Chat Preview 🐑

https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/ • 8 items • Updated Nov 22, 2024 • 10

INTELLECT-1

13 items • Updated Dec 5, 2024 • 7

upvoted a paper about 2 months ago

Cautious Optimizers: Improving Training with One Line of Code

Paper • 2411.16085 • Published Nov 25, 2024 • 15

upvoted a collection 2 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 21 days ago • 198

upvoted a collection 3 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 225

upvoted an article 3 months ago

Article

HTRflow - A tool for HTR and OCR

By

•

Oct 1, 2024

• 15

upvoted a collection 3 months ago

Realistic Vision (SD1.5)

8 items • Updated Dec 4, 2023 • 33

upvoted an article 4 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

• 180

upvoted a paper 4 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 87

upvoted a collection 5 months ago

Zeroshot Classifiers

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 12 items • Updated 6 days ago • 120

upvoted an article 5 months ago

Article

Introducing TextImage Augmentation for Document Images

Aug 6, 2024

• 32

upvoted an article 6 months ago

Article

How we leveraged distilabel to create an Argilla 2.0 Chatbot

Jul 16, 2024

• 32

upvoted an article 7 months ago

Article

Thoughts on LoRA Training #1

By

•

Jun 18, 2024

• 33

upvoted a collection 7 months ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 354

upvoted an article 7 months ago

Article

quanto: a pytorch quantization toolkit

Mar 18, 2024

• 33