Switch the coder model from Q6_K to the UD-Q8_K_XL quant for better output quality, and raise spec-draft-n-max from 4 to 5 to allow longer speculative draft sequences. |
||
|---|---|---|
| .. | ||
| amd | ||
| attic | ||
| halo | ||
| mx | ||
| nixtee1 | ||
| sgx | ||
| t15 | ||
| x1 | ||
Switch the coder model from Q6_K to the UD-Q8_K_XL quant for better output quality, and raise spec-draft-n-max from 4 to 5 to allow longer speculative draft sequences. |
||
|---|---|---|
| .. | ||
| amd | ||
| attic | ||
| halo | ||
| mx | ||
| nixtee1 | ||
| sgx | ||
| t15 | ||
| x1 | ||