I'd like to know if there is a process for training this model with multi-modal data, is this possible? If so, can you link a notebook?
· Sign up or log in to comment