Purpose of special tokens
#3
by
tdeboissiere
- opened
Hello !
Thanks for the detailed blog post, very helpful.
I was curious about the special tokens (e.g. ['<od>', '</od>', '<ocr>', '</ocr>']
) in the Florence2Processor
- These tokens don't seem to be used anywhere, so what is their purpose ?
- Related: how was Florence-2 initially trained, say, for object detection ? (Were the inputs to the model the image + a text prompt such as "Locate the objects with category name in the image." + the category + the actual location of the objects in the image ?