下に引いて戻る
Qwen3.6 does not like Turboquant

Qwen3.6 does not like Turboquant

Qwen3.6 does not like Turboquant

https://preview.redd.it/67aud1op3nwg1.png?width=1678&format=png&auto=webp&s=9e584afb7c5aae71c2daed934823c85087dd7009 I've tried a prompt with llamma.cpp, ik_llama.cpp and TheTom/turboquant - I have 2 GPU (3080, 3060 12GB each) - Same settings save params except for -ctk -ctv / turbo3 vs q8_0 - using https://github.com/TheTom/llama-cpp-turboquant submitted by /u/Zarzou [link] [comments]