BuiDoan

AI & ML interests

None yet

Recent Activity

updated a collection about 23 hours ago

Great paper

upvoted a paper about 23 hours ago

Hermes 3 Technical Report

liked a dataset 1 day ago

saiyan-world/Goku-MovieGenBench

View all activity

Organizations

BuiDoan's activity

upvoted a paper about 23 hours ago

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 46

upvoted 2 articles 3 days ago

Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

4 days ago

• 42

Article

From Files to Chunks: Improving Hugging Face Storage Efficiency

Nov 20, 2024

• 51

upvoted a collection 6 days ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 10 days ago • 234

upvoted 2 papers 10 days ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published 13 days ago • 112

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 13 days ago • 174

upvoted 2 papers 13 days ago

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 21 days ago • 56

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 18 days ago • 105

upvoted a paper 29 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 273

upvoted a paper about 1 month ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 94

upvoted a collection about 1 month ago

AI Paper of the Day

Collection

A collection of papers that I think are interesting, one added each day • 296 items • Updated about 9 hours ago • 36

upvoted a paper about 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 345

upvoted 4 papers 2 months ago

upvoted a collection 2 months ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 227

upvoted 3 papers 2 months ago

Small Language Models: Survey, Measurements, and Insights

Paper • 2409.15790 • Published Sep 24, 2024 • 1

Fully Open Source Moxin-7B Technical Report

Paper • 2412.06845 • Published Dec 8, 2024 • 11

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 126