nixcfg/systems/x86_64-linux/halo
Harald Hoyer ac70c57c15 chore(halo): preload both llama models and tune preset
Preload Qwen3.6-27B and Qwen3.6-35B-A3B at startup (load-on-startup)
so both are warm immediately under --models-max 2, set parallel = 1
as the [*] fallback for any other model, and adjust per-model context
size and draft depth.
2026-05-20 07:14:26 +02:00
..
default.nix feat(halo): serve multiple llama models via models.ini preset 2026-05-20 00:23:50 +02:00
hardware-configuration.nix feat(halo): verbose boot 2026-02-17 09:17:24 +01:00
llama-server.nix feat(halo): serve multiple llama models via models.ini preset 2026-05-20 00:23:50 +02:00
models.ini chore(halo): preload both llama models and tune preset 2026-05-20 07:14:26 +02:00
sound.nix chore: nix fmt 2026-05-03 14:57:49 +02:00
wyoming.nix nix fmt 2026-02-24 13:25:42 +01:00
xremap.nix chore: nix fmt 2026-05-03 14:57:49 +02:00