NB's picture

NB PRO

Skier8402

·

https://nyab.notion.site

Shuyib

AI & ML interests

Practicing Computer Vision, Optimization, NLP and multimodal system implementation.

Recent Activity

updated a collection 4 minutes ago

updated a collection 10 minutes ago

updated a collection 11 minutes ago

View all activity

Organizations

Skier8402's activity

updated a collection 4 minutes ago

Speech apps

Various applications to help deal with speech better. • 18 items • Updated 4 minutes ago

updated a collection 10 minutes ago

multimodal

7 items • Updated 10 minutes ago

updated a collection 11 minutes ago

multimodal

7 items • Updated 10 minutes ago

liked a model 19 minutes ago

sesame/csm-1b

Text-to-Speech • Updated about 17 hours ago • 440

upvoted an article 1 day ago

Article

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

7 days ago

• 32

updated a dataset 2 days ago

Skier8402/prompt-garden

Viewer • Updated 2 days ago • 12 • 170 • 1

liked a Space 3 days ago

Phi 4 Multimodal

Interact with AI using text, images, or audio

liked 2 Spaces 4 days ago

Magma UI

Magma-8B model for UI Agents

OmniParser V2

OmniParser, turn your LLM into GUI agent

New activity in Alfaxad/gemma2-27b-swahili-it 5 days ago

More datasets

#1 opened 18 days ago by

liked a Space 8 days ago

Agent Dino

@image @rAgent @web @text @tts1 @tts2 @3d

liked a dataset 10 days ago

allenai/olmOCR-mix-0225

Viewer • Updated 17 days ago • 259k • 4.51k • 90

liked a model 13 days ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated 1 day ago • 472k • 1.13k

upvoted 2 collections 14 days ago

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated about 19 hours ago • 67

olmOCR

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated about 19 hours ago • 92

liked a model 14 days ago

allenai/olmOCR-7B-0225-preview-GGUF

Updated 16 days ago • 6.84k • 21

liked a model 15 days ago

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • Updated 27 days ago • 1.02M • 263

liked a Space 15 days ago

Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

updated a collection 16 days ago

Realtime-apps

7 items • Updated 16 days ago

liked a Space 16 days ago

Talk to OpenAI (Gradio UI)

Talk to OpenAI (Gradio UI)