nvidia/parakeet-tdt-0.6b-v2 Automatic Speech Recognition β’ 0.6B β’ Updated Jun 26 β’ 708k β’ 1.25k
view reply It's not prompted. The source Audio had that emotional context and the model simply copied it.