How do you encode an image in only 81 tokens?

#2
by ChristineLai - opened

How do you encode an image in only 81 tokens?

The trick is inside the projector part, we use reshape mechanism to convert 729 image tokens into 81 tokens.

Sign up or log in to comment