8 100 221

Chmielewski

Eryk-Chmielewski

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

BlinkDL/temp-latest-training-models

liked a model 4 days ago

unsloth/Mistral-Small-24B-Instruct-2501-unsloth-bnb-4bit

liked a model 4 days ago

unsloth/DeepSeek-R1-Distill-Qwen-32B-unsloth-bnb-4bit

View all activity

Organizations

Eryk-Chmielewski's activity

liked a model 1 day ago

BlinkDL/temp-latest-training-models

Updated 1 day ago • 52

liked 2 models 4 days ago

unsloth/Mistral-Small-24B-Instruct-2501-unsloth-bnb-4bit

Text Generation • Updated 9 days ago • 6.91k • 9

unsloth/DeepSeek-R1-Distill-Qwen-32B-unsloth-bnb-4bit

Text Generation • Updated 9 days ago • 1.35k • 5

upvoted 13 papers 8 days ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published 21 days ago • 49

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published 18 days ago • 22

ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer

Paper • 2501.15570 • Published 16 days ago • 23

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 14 days ago • 101

Large Language Models Think Too Fast To Explore Effectively

Paper • 2501.18009 • Published 12 days ago • 22

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published 12 days ago • 25

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 12 days ago • 51

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published 11 days ago • 34

s1: Simple test-time scaling

Paper • 2501.19393 • Published 11 days ago • 97

liked 4 models 8 days ago

arcee-ai/Virtuoso-Medium-v2

Text Generation • Updated 13 days ago • 783 • 41

bytedance-research/UI-TARS-7B-DPO

Image-Text-to-Text • Updated 17 days ago • 31.5k • 128

HKUSTAudio/Llasa-3B

Text-to-Speech • Updated 4 days ago • 7.75k • 425

Almawave/Velvet-14B

Text Generation • Updated 4 days ago • 3.34k • 114