Split part of vision encoder to CPU and optimize Transpose ops. (Reupload to correct path) 385f65d verified happyme531 commited on Sep 24, 2024