SmolTulu: Higher Learning Rate to Batch Size Ratios Can Lead to Better Reasoning in SLMs Paper • 2412.08347 • Published 14 days ago • 4
When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards Paper • 2402.01781 • Published Feb 1 • 1
Fineweb-Edu-Ar: Machine-translated Corpus to Support Arabic Small Language Models Paper • 2411.06402 • Published Nov 10 • 2
SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos Paper • 2011.13367 • Published Nov 26, 2020
SCTN: Sparse Convolution-Transformer Network for Scene Flow Estimation Paper • 2105.04447 • Published May 10, 2021 • 1
MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions Paper • 2112.00431 • Published Dec 1, 2021
SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in Soccer Videos Paper • 2204.06918 • Published Apr 14, 2022
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual Queries Paper • 2212.06969 • Published Dec 14, 2022 • 1
SoccerNet Game State Reconstruction: End-to-End Athlete Tracking and Identification on a Minimap Paper • 2404.11335 • Published Apr 17
Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders Paper • 2403.17823 • Published Mar 26
TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild Paper • 1803.10794 • Published Mar 28, 2018
3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes Paper • 2411.14974 • Published Nov 22 • 17
Vivid-ZOO: Multi-View Video Generation with Diffusion Model Paper • 2406.08659 • Published Jun 12 • 8
3DShape2VecSet: A 3D Shape Representation for Neural Fields and Generative Diffusion Models Paper • 2301.11445 • Published Jan 26, 2023