Hi, welcome to use,
- The audio tokenizer gguf file in iq4xs, even i say so, only little smaller and faster than q8_0;
- This repo's qwen3-tts-0.6b-q8_0.gguf is smaller. iq4xs quant hurts the voice clone ablity, but maybe we don't really care that little lose.
- Updated new quanted iq4_xs tokenizer, which downcasted some fp16 tensors to q8_0.(211M)
- Updated iq3s tokenizer, yes, it's working well and faster
Now you only need 708 MB RAM in total to run this model!
- Downloads last month
- 1,130
Hardware compatibility
Log In to add your hardware
Model tree for Jahaz/Qwen3-tts-0.6b-gguf-for-koboldcpp
Base model
Qwen/Qwen3-TTS-12Hz-0.6B-Base