Julius Duin's picture

Julius Duin

duinamit

·

duinamit

AI & ML interests

None yet

Recent Activity

commented on a paper 22 days ago

Continuous Diffusion Model for Language Modeling

upvoted a paper 23 days ago

Idiosyncrasies in Large Language Models

upvoted a paper 24 days ago

Intuitive physics understanding emerges from self-supervised pretraining on natural videos

View all activity

Organizations

duinamit's activity

upvoted a paper 23 days ago

Idiosyncrasies in Large Language Models

Paper • 2502.12150 • Published 24 days ago • 1

upvoted 2 papers 24 days ago

Intuitive physics understanding emerges from self-supervised pretraining on natural videos

Paper • 2502.11831 • Published 24 days ago • 18

Distillation Scaling Laws

Paper • 2502.08606 • Published 29 days ago • 46

upvoted 4 papers about 1 month ago

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Paper • 2502.04320 • Published Feb 6 • 35

BTS: Harmonizing Specialized Experts into a Generalist LLM

Paper • 2502.00075 • Published Jan 31 • 1

Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Paper • 2501.18837 • Published Jan 31 • 10

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Paper • 2501.12202 • Published Jan 21 • 35

upvoted 11 papers about 2 months ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22 • 42

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published Jan 16 • 70

Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Paper • 2410.06940 • Published Oct 9, 2024 • 8

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 123

Medical SAM 2: Segment medical images as video via Segment Anything Model 2

Paper • 2408.00874 • Published Aug 1, 2024 • 50

Kolmogorov-Arnold Transformer

Paper • 2409.10594 • Published Sep 16, 2024 • 43

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28, 2024 • 108

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 610

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 80

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 84

Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 79

upvoted 2 papers 2 months ago

Decentralized Diffusion Models

Paper • 2501.05450 • Published Jan 9 • 1

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 263