Spaces:

nxquang-al
/

atiso-beit3-full-api

Runtime error

App Files Files Community

atiso-beit3-full-api / src /itr /beit3 /README.md

ngxquang

beit3 both keyframes and subframes

68cd8f8 about 1 year ago

|

history blame contribute delete

710 Bytes

Using BEiT-3 to get text-vision embedding

For text embedding

Create file test_model.py inside folder itr.
Using the code follow:

from beit3_model import Beit3Model

if __name__ == '__main__':
    vlm = Beit3Model(device='cpu')

    print(vlm.get_embedding('A man who loves a girl.').shape)

For image embedding

Create file test_model.py inside folder itr.
Using the code follow:

from beit3_model import Beit3Model
from torchvision.datasets.folder import default_loader

if __name__ == '__main__':
    loader = default_loader
    image = loader('./path/to/your/image.jpg')

    vlm = Beit3Model(device='cpu')
    print(vlm.get_embedding(image).shape)