52 278 279

Yassine Ennaour

Lyte

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Scalable-Softmax Is Superior for Attention

upvoted an article 2 days ago

Open-R1: Update #1

liked a model 3 days ago

medmac01/darija_xtt_2.0

View all activity

Organizations

Lyte's activity

upvoted a paper 1 day ago

Scalable-Softmax Is Superior for Attention

Paper • 2501.19399 • Published 4 days ago • 17

upvoted an article 2 days ago

Article

Open-R1: Update #1

•

3 days ago

• 215

upvoted an article 7 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

8 days ago

• 621

upvoted a collection 7 days ago

YuE

Collection

YuE: Open Full-song Generation Foundation Model • 9 items • Updated 8 days ago • 15

upvoted an article 8 days ago

Article

Welcome to Inference Providers on the Hub 🔥

8 days ago

• 238

upvoted a collection 10 days ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 3 items • Updated 9 days ago • 316

upvoted a paper 13 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 14 days ago • 292

upvoted an article 24 days ago

Article

TerjamaBench: A Cultural Benchmark for English-Darija Machine Translation

•

26 days ago

• 26

upvoted a paper 27 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 28 days ago • 253

upvoted a paper 28 days ago

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published 29 days ago • 48

upvoted 3 papers about 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 345

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 89

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 94

upvoted a paper 2 months ago

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 57

upvoted 2 papers 3 months ago

Top-nσ: Not All Logits Are You Need

Paper • 2411.07641 • Published Nov 12, 2024 • 20

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 113

upvoted 3 papers 4 months ago

upvoted a collection 4 months ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 566