2 42 15

Junjie Chen

coderchen01

https://junjie-chen.info

AI & ML interests

Efficient AI, Multimodal AI, Generative AI

Recent Activity

liked a Space 30 minutes ago

Eliahu/Model-Atlas

upvoted a paper 33 minutes ago

Charting and Navigating Hugging Face's Model Atlas

upvoted a paper about 10 hours ago

Motion Anything: Any to Motion Generation

View all activity

Organizations

None yet

coderchen01's activity

upvoted a paper 33 minutes ago

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published about 18 hours ago • 22

upvoted a paper about 10 hours ago

Motion Anything: Any to Motion Generation

Paper • 2503.06955 • Published 4 days ago • 15

upvoted a paper about 24 hours ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published 1 day ago • 41

upvoted an article 12 days ago

Article

The Annotated Diffusion Model

Jun 7, 2022

• 159

upvoted a paper 12 days ago

Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think

Paper • 2502.20172 • Published 15 days ago • 27

upvoted 2 papers 13 days ago

FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute

Paper • 2502.20126 • Published 15 days ago • 20

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Paper • 2502.20321 • Published 15 days ago • 29

upvoted an article 15 days ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 453

upvoted an article 18 days ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 197

upvoted 2 papers 3 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 135

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 129

upvoted a paper 4 months ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 53

upvoted an article 4 months ago

Article

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.

•

May 21, 2024

• 35

upvoted 3 papers 4 months ago

SlimLM: An Efficient Small Language Model for On-Device Document Assistance

Paper • 2411.09944 • Published Nov 15, 2024 • 12

Top-nσ: Not All Logits Are You Need

Paper • 2411.07641 • Published Nov 12, 2024 • 20

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 49

upvoted an article 5 months ago

Article

🕳️ Attention Sinks in LLMs for endless fluency

•

Oct 9, 2023

• 8

upvoted a paper 5 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 146

upvoted 2 articles 5 months ago

Article

Scaling AI-based Data Processing with Hugging Face + Dask

Oct 9, 2024

• 30

Article

How 🤗 Accelerate runs very large models thanks to PyTorch

Sep 27, 2022

• 10