Update app.py
Browse files
app.py
CHANGED
@@ -27,9 +27,10 @@ For example, if visual elements is like this:
|
|
27 |
An older man wearing a brown hat and glasses, with a beard and a beard on his chin, is looking at the camera."
|
28 |
It does not mean there are 3 older men, but this is the same man. Because we have extracted vere close frame from the video sequence.
|
29 |
|
30 |
-
In the meantme, Audio events are actually the description
|
31 |
|
32 |
-
Your job is to use these
|
|
|
33 |
"""
|
34 |
|
35 |
def extract_frames(video_in, interval=24, output_format='.jpg'):
|
|
|
27 |
An older man wearing a brown hat and glasses, with a beard and a beard on his chin, is looking at the camera."
|
28 |
It does not mean there are 3 older men, but this is the same man. Because we have extracted vere close frame from the video sequence.
|
29 |
|
30 |
+
In the meantme, Audio events are actually the scene description based on the audio of the video.
|
31 |
|
32 |
+
Your job is to use these informatios to smartly deduce and provide a very short resume about what is happening in the video.
|
33 |
+
Keep it short.
|
34 |
"""
|
35 |
|
36 |
def extract_frames(video_in, interval=24, output_format='.jpg'):
|