OuteTTS-0.2-500M-GGUF

Original Model

Run with LlamaEdge

LlamaEdge version: v0.14.9

Run as LlamaEdge service

wasmedge --dir .:. \
  --nn-preload tts:GGML:AUTO:OuteTTS-0.2-500M-Q5_K_M.gguf \
  llama-api-server.wasm config \
  --file llama_server_config.toml \
  --tts

llama_server_config.toml can be derived from the template config file llama_server_config.toml.bkp. The recommended [tts] config is shown as below

[tts]
model_name   = "tts"            # Name of the TTS model.
model_alias  = "tts"            # Alias of the TTS model.
codec_model  = ""               # Required. Path to the codec model file.
speaker_file  = ""              # Path to an alternative speaker file.
ctx_size     = 8192             # Context size. Default is 8192.
batch_size   = 8192             # Batch size. Default is 8192.
ubatch_size  = 8192             # Physical maximum batch size. Default is 8192.
n_predict    = 4096             # Number of tokens to predict. Default is 4096.
n_gpu_layers = 100              # Number of layers to run on GPU. Default is 100.
temp         = 0.8              # Temperature. Default is 0.8.

Quantized GGUF Models

Name	Quant method	Bits	Size	Use case
OuteTTS-0.2-500M-Q2_K.gguf	Q2_K	2	344 MB	smallest, significant quality loss - not recommended for most purposes
OuteTTS-0.2-500M-Q3_K_L.gguf	Q3_K_L	3	375 MB	small, substantial quality loss
OuteTTS-0.2-500M-Q3_K_M.gguf	Q3_K_M	3	361 MB	very small, high quality loss
OuteTTS-0.2-500M-Q3_K_S.gguf	Q3_K_S	3	344 MB	very small, high quality loss
OuteTTS-0.2-500M-Q4_0.gguf	Q4_0	4	358 MB	legacy; small, very high quality loss - prefer using Q3_K_M
OuteTTS-0.2-500M-Q4_K_M.gguf	Q4_K_M	4	403 MB	medium, balanced quality - recommended
OuteTTS-0.2-500M-Q4_K_S.gguf	Q4_K_S	4	391 MB	small, greater quality loss
OuteTTS-0.2-500M-Q5_0.gguf	Q5_0	5	402 MB	legacy; medium, balanced quality - prefer using Q4_K_M
OuteTTS-0.2-500M-Q5_K_M.gguf	Q5_K_M	5	426 MB	large, very low quality loss - recommended
OuteTTS-0.2-500M-Q5_K_S.gguf	Q5_K_S	5	418 MB	large, low quality loss - recommended
OuteTTS-0.2-500M-Q6_K.gguf	Q6_K	6	511 MB	very large, extremely low quality loss
OuteTTS-0.2-500M-Q8_0.gguf	Q8_0	8	537 MB	very large, extremely low quality loss - not recommended
OuteTTS-0.2-500M-f16.gguf	f16	16	1.00 GB
wavtokenizer-large-75-ggml-f16.gguf	f16	16	1.00 GB

Quantized with llama.cpp b4381

second-state
/

OuteTTS-0.2-500M-GGUF

OuteTTS-0.2-500M-GGUF

Original Model

Run with LlamaEdge

Quantized GGUF Models

Model tree for second-state/OuteTTS-0.2-500M-GGUF