2 82 241

kelechic

tensorkelechi

https://kelechi-c.github.io/

AI & ML interests

vision

Recent Activity

updated a collection about 2 months ago

SAE

updated a collection 2 months ago

SAE

updated a collection 2 months ago

SAE

View all activity

Organizations

upvoted a paper 4 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 197

upvoted 2 papers 5 months ago

Neural Vocoder is All You Need for Speech Super-resolution

Paper • 2203.14941 • Published Mar 28, 2022 • 1

MusicInfuser: Making Video Diffusion Listen and Dance

Paper • 2503.14505 • Published Mar 18 • 11

upvoted 2 articles 5 months ago

Article

Open-Source Handwritten Signature Detection Model

•

Mar 14

• 117

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

and 3 others •

Mar 12

• 452

upvoted a paper 5 months ago

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Paper • 2503.03983 • Published Mar 6 • 25

upvoted an article 6 months ago

Article

Using LoRA for Efficient Stable Diffusion Fine-Tuning

and 1 other •

Jan 26, 2023

• 68

upvoted a collection 6 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated 27 days ago • 630

upvoted a paper 6 months ago

SoundStorm: Efficient Parallel Audio Generation

Paper • 2305.09636 • Published May 16, 2023 • 13

upvoted a collection 6 months ago

CLAP: Contrastive Language-Audio Pretraining

Collection

CLAP is to audio what CLIP is to image. • 5 items • Updated Oct 31, 2023 • 12

upvoted an article 6 months ago

Article

Design choices for Vision Language Models in 2024

•

Apr 16, 2024

• 30

upvoted a paper 6 months ago

Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities

Paper • 2402.01831 • Published Feb 2, 2024 • 15

upvoted 2 articles 6 months ago

Article

SmolVLM - small yet mighty Vision Language Model

and 4 others •

Nov 26, 2024

• 346

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

and 2 others •

Jan 23

• 182

upvoted a paper 6 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 241

upvoted 2 articles 7 months ago

Article

State of open video generation models in Diffusers

and 2 others •

Jan 27

• 59

Article

Upgrading Kokoro: natural TTS for short bursts

•

Nov 22, 2024

• 31

upvoted a paper 8 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 122

upvoted a collection 8 months ago

Cosmos-Tokenizer

Collection

A suite of image and video tokenizers • 13 items • Updated 3 days ago • 41

upvoted a paper 8 months ago

MobileVLM V2: Faster and Stronger Baseline for Vision Language Model

Paper • 2402.03766 • Published Feb 6, 2024 • 15

kelechic

AI & ML interests

Recent Activity

Organizations

tensorkelechi's activity

Open-Source Handwritten Signature Detection Model

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Design choices for Vision Language Models in 2024

SmolVLM - small yet mighty Vision Language Model

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

State of open video generation models in Diffusers

Upgrading Kokoro: natural TTS for short bursts