Replace the per-model llama-server units with a single service that uses llama-server's --models-preset (models.ini) and --models-max 2, so the 35B-A3B and 27B models are loaded on demand from one config. Drop the now-redundant 27B / 27B-MTP / coder-next variant files and the unused CacheDirectory + slot-save-path KV-slot handling. |
||
|---|---|---|
| .. | ||
| default.nix | ||
| hardware-configuration.nix | ||
| llama-server.nix | ||
| models.ini | ||
| sound.nix | ||
| wyoming.nix | ||
| xremap.nix | ||