Dataset usage

by JonasWeinert - opened Sep 11, 2023

Discussion

JonasWeinert

Sep 11, 2023

Fascinating idea! Which dataset did you use for fine tuning?

sokaina55

Owner Sep 11, 2023

I use video dataset I collect it. I try to do continuous sign language recognition task but I got very bad accuracy, my dataset is skeleton video extracted from the original dataset. any suggestions?

JonasWeinert

Sep 11, 2023

Is it words or sentences? I reckon the temporal dimension makes it tricky/ requires a massive amount of samples. How do you extract the skeletons? Do you model time simultaneously or separately?

sokaina55

Owner Sep 11, 2023

It is sentences, each sentence have 3 to 5 word. I extract skeleton by mediaPip and redraw skeleton key points only and use it as input video. I have 15 sentence. number of videos for each sentence is between 24 to 38 video. I follow the example code at "video classification using transformers" provided by hugging face site.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment