fixie-ai
/

ultravox-v0_4-llama-3_1-70b

Audio-Text-to-Text

feature-extraction

Model card Files Files and versions Community

farzadab commited on Sep 27, 2024

Commit

207c10d

·

verified ·

1 Parent(s): 553b094

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -99,7 +99,7 @@ Supervised speech to audio finetuning. For more info, see [training code in Ultr
 #### Speeds, Sizes, Times
-The current version of Ultravox, when invoked with audio content, has a time-to-first-token (TTFT) of approximately 400ms, and a tokens-per-second rate of ~50-100 when using 4xA100-40GB GPU, all using a Llama 3.1 70B backbone.
 Check out the audio tab on [TheFastest.ai](https://thefastest.ai/?m=audio) for daily benchmarks and a comparison with other existing models.

 #### Speeds, Sizes, Times
+The current version of Ultravox, when invoked with audio content, has a time-to-first-token (TTFT) of approximately 400ms, and a tokens-per-second rate of ~50-100 when using 4xH100 SXM GPU, all using a Llama 3.1 70B backbone.
 Check out the audio tab on [TheFastest.ai](https://thefastest.ai/?m=audio) for daily benchmarks and a comparison with other existing models.