Patryk Binkowski's picture

1 9 90

Patryk Binkowski

ismu

·

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

Snowflake/snowflake-arctic-embed-m-v2.0

liked a model 3 days ago

apple/aimv2-large-patch14-224

liked a model 4 days ago

HuggingFaceTB/SmolVLM-256M-Base

View all activity

Organizations

None yet

ismu's activity

upvoted a collection 19 days ago

Hibiki fr-en

Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 21 days ago • 50

upvoted a collection about 1 month ago

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated about 14 hours ago • 202

upvoted 2 collections 3 months ago

GTE models

General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 21 items • Updated Jan 21 • 23

Hymba

A series of Hybrid Small Language Models. • 2 items • Updated Jan 17 • 27

upvoted a paper 3 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 114

upvoted a collection 4 months ago

LayerSkip

Models continually pretrained using LayerSkip - https://arxiv.org/abs/2404.16710 • 8 items • Updated Nov 21, 2024 • 47

upvoted 2 collections 7 months ago

BigVGAN

BigVGAN is a universal neural vocoder that generates audio waveform using mel spectrogram as input. • 11 items • Updated Jan 17 • 11

LLaVa-Interleave

LLaVa models that extends the model capabilities to Multi-image, Multi-frame (videos), Multi-patch (single-image) scenarios. • 3 items • Updated Jul 10, 2024 • 14

upvoted a paper over 1 year ago

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Paper • 2307.02486 • Published Jul 5, 2023 • 80