3 18 17

Metal Whale

metalwhale

https://blog.metalwhale.dev/

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

liked a model 15 days ago

tencent/HunyuanVideo

upvoted a collection 28 days ago

Molmo

View all activity

Organizations

None yet

metalwhale's activity

upvoted a paper 9 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published 13 days ago • 75

liked a model 15 days ago

tencent/HunyuanVideo

Text-to-Video • Updated 8 days ago • 7.07k • 1.28k

upvoted a collection 28 days ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated 28 days ago • 289

upvoted an article about 1 month ago

Article

Releasing the largest multilingual open pretraining dataset

•

Nov 13

• 98

upvoted a paper 2 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 168

liked 6 models 3 months ago

upvoted a collection 3 months ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated 28 days ago • 444

liked 2 models 3 months ago

Qwen/Qwen2.5-7B-Instruct

Text Generation • Updated Sep 25 • 1.81M • 373

deepseek-ai/DeepSeek-V2.5

Text Generation • Updated 15 days ago • 13.8k • 680

liked 2 models 4 months ago

fishaudio/fish-speech-1.4

Text-to-Speech • Updated Nov 5 • 7.67k • 444

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16 • 1.27M • • 7.51k

upvoted a paper 6 months ago

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Paper • 2406.07522 • Published Jun 11 • 37

upvoted a paper 7 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 253

liked a model 7 months ago

microsoft/Phi-3-vision-128k-instruct

Text Generation • Updated Aug 20 • 77.6k • 940

liked a model 8 months ago

microsoft/Phi-3-mini-128k-instruct

Text Generation • Updated Aug 20 • 515k • 1.62k