6 4 509

SUMIN KIM

scissorstail

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Forgetting Transformer: Softmax Attention with a Forget Gate

liked a Space 10 days ago

nanotron/ultrascale-playbook

liked a dataset 10 days ago

GeneralReasoning/GeneralThought-195K

View all activity

Organizations

None yet

scissorstail's activity

upvoted a paper 4 days ago

Forgetting Transformer: Softmax Attention with a Forget Gate

Paper • 2503.02130 • Published 10 days ago • 27

liked a Space 10 days ago

2.24k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 10 days ago

GeneralReasoning/GeneralThought-195K

Viewer • Updated 4 days ago • 195k • 1.22k • 66

liked a model 15 days ago

kakaocorp/kanana-nano-2.1b-instruct

Text Generation • Updated 15 days ago • 13.4k • • 52

liked a dataset 22 days ago

facebook/natural_reasoning

Viewer • Updated 21 days ago • 1.15M • 11.1k • 401

liked a dataset 24 days ago

lemon-mint/smol-koreantalk

Viewer • Updated 25 days ago • 460k • 614 • 10

liked 4 datasets about 1 month ago

upvoted a collection about 1 month ago

WildChat-50m

Collection

All model responses associated with the WildChat-50m paper. • 55 items • Updated Jan 29 • 7

liked a Space about 1 month ago

2.03k

Anychat

🏢

Display code snippets for different chat providers

liked a model about 1 month ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 18 days ago • 1.6M • • 1.26k

liked 3 datasets about 1 month ago

cognitivecomputations/dolphin-r1

Viewer • Updated Jan 30 • 814k • 4.67k • 272

open-thoughts/OpenThoughts-114k

Viewer • Updated 22 days ago • 228k • 83.2k • 653

ServiceNow-AI/R1-Distill-SFT

Viewer • Updated Feb 8 • 1.85M • 5.19k • 274

liked a Space about 2 months ago

207

Janus Pro WebGPU

🏛

In-browser unified multimodal understanding and generation.

liked 2 models about 2 months ago

BlinkDL/rwkv-7-world

Text Generation • Updated Feb 10 • 83

jinaai/ReaderLM-v2

Text Generation • Updated 10 days ago • 29.3k • • 561

liked a dataset 2 months ago

allenai/dolmino-mix-1124

Viewer • Updated Dec 17, 2024 • 165M • 35.5k • 37