AskYoutube commited on
Commit
4644150
·
1 Parent(s): 6714874

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -2,10 +2,10 @@
2
  license: mit
3
  ---
4
 
5
- # AskVideos-VideoCLIP-7B-v0.1
6
  Like it's image-only counterpart, CLIP, VideoCLIP enables you to compute a single embedding for videos that can be used to compute similarity with text.
7
 
8
- VideoCLIP uses a Video Q-Former to aggregate frame-level embeddings temporally into a single embedding, maintaining relevance of the underlying content. The resulting embedding is then trained with contrastive learning to match it's corresponding text.
9
 
10
  # Usage
11
 
 
2
  license: mit
3
  ---
4
 
5
+ # AskVideos-VideoCLIP
6
  Like it's image-only counterpart, CLIP, VideoCLIP enables you to compute a single embedding for videos that can be used to compute similarity with text.
7
 
8
+ VideoCLIP uses a Video Q-Former to aggregate frame-level embeddings temporally into a single embedding, maintaining relevance of the underlying content. The resulting embedding is then trained with contrastive loss + captioning loss to match it's corresponding text.
9
 
10
  # Usage
11