Split part of vision encoder to CPU and optimize Transpose ops. 043b275 verified happyme531 commited on Sep 24, 2024