Update README.md
Browse files
README.md
CHANGED
@@ -99,7 +99,7 @@ Supervised speech to audio finetuning. For more info, see [training code in Ultr
|
|
99 |
|
100 |
#### Speeds, Sizes, Times
|
101 |
|
102 |
-
The current version of Ultravox, when invoked with audio content, has a time-to-first-token (TTFT) of approximately 400ms, and a tokens-per-second rate of ~50-100 when using
|
103 |
|
104 |
Check out the audio tab on [TheFastest.ai](https://thefastest.ai/?m=audio) for daily benchmarks and a comparison with other existing models.
|
105 |
|
|
|
99 |
|
100 |
#### Speeds, Sizes, Times
|
101 |
|
102 |
+
The current version of Ultravox, when invoked with audio content, has a time-to-first-token (TTFT) of approximately 400ms, and a tokens-per-second rate of ~50-100 when using 4xH100 SXM GPU, all using a Llama 3.1 70B backbone.
|
103 |
|
104 |
Check out the audio tab on [TheFastest.ai](https://thefastest.ai/?m=audio) for daily benchmarks and a comparison with other existing models.
|
105 |
|