Vaibhav Singh

veb-101

veb-101

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Scaling Properties of Diffusion Models for Perceptual Tasks

upvoted a collection about 2 months ago

Cosmos Tokenizer

liked a model about 2 months ago

lion-ai/MedImageInsights

View all activity

Organizations

None yet

veb-101's activity

upvoted a paper about 1 month ago

Scaling Properties of Diffusion Models for Perceptual Tasks

Paper • 2411.08034 • Published Nov 12 • 13

upvoted a collection about 2 months ago

Cosmos Tokenizer

Collection

A suite of image and video tokenizers • 12 items • Updated 9 days ago • 27

liked a model about 2 months ago

lion-ai/MedImageInsights

Updated Nov 4 • 39

upvoted a paper 3 months ago

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Paper • 2410.02073 • Published Oct 2 • 41

liked a model 3 months ago

pyannote/speaker-diarization

Automatic Speech Recognition • Updated May 10 • 7.03M • 899

liked 3 Spaces 4 months ago

Running

🏆

The timm Leaderboard

Runtime error

📚

timm CAM

Running

👁

timm Attention Visualization

upvoted an article 4 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 108

upvoted a collection 6 months ago

MobileNetV4 pretrained weights

Collection

Weights for MobileNet-V4 pretrained in timm • 17 items • Updated Sep 22 • 18

upvoted 2 papers 7 months ago

DiTFastAttn: Attention Compression for Diffusion Transformer Models

Paper • 2406.08552 • Published Jun 12 • 23

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Paper • 2405.18392 • Published May 28 • 12

upvoted an article 7 months ago

Article

MobileNet-V4 (now in timm)

•

Jun 17

• 39

updated a model 8 months ago

veb-101/MobileViT-v1-Keras-3

Updated May 15 • 1

liked a model 8 months ago

bigcode/starcoder2-15b-instruct-v0.1

Text Generation • Updated Nov 3 • 924 • 100

upvoted 3 papers 9 months ago

upvoted 2 papers 10 months ago

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect

Paper • 2403.03853 • Published Mar 6 • 61

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 96