Generate audio labels from speech
Generate speech timestamp labels from audio
Extract vocals from audio files
validate timestamp
Generate audio for video segments
Translate and rephrase English text
Clip YouTube or uploaded video