jonatasgrosman/wav2vec2-large-xlsr-53-hungarian Automatic Speech Recognition • Updated Dec 14, 2022 • 115k • 9
An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels Paper • 2406.09415 • Published Jun 13, 2024 • 51
Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling Paper • 2405.21048 • Published May 31, 2024 • 16