VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis Paper โข 2403.08764 โข Published Mar 13 โข 36