Add a 0.74 confidence threshold so speculative drafting stops early once the draft model's predicted token probability drops below it, favoring shorter, higher-acceptance draft sequences. |
||
|---|---|---|
| .. | ||
| aarch64-darwin | ||
| aarch64-linux | ||
| x86_64-darwin/mpro | ||
| x86_64-linux | ||
| nixbuild.nix | ||