nixcfg

History

Harald Hoyer 5ee2f65337 chore(halo): tune llama models.ini and drop 35B-A3B model Serve only Qwen3.6-27B; remove the unused 35B-A3B preset. Tuning: - Move model-specific keys (spec-type, sampling temp/top-p/top-k/min-p) out of the [] defaults into [Qwen3.6-27B] so they no longer leak onto other models; draft-mtp in particular only works on MTP-weighted models. - Drop the duplicate parallel key from []. - Bump ubatch-size 256 -> 512 for faster iGPU prefill on Strix Halo. - Add threads-batch = 16 to use all cores for prefill while keeping generation at threads = 8 under full GPU offload.		2026-05-20 14:23:42 +02:00
..
amd	feat(amd): enable Wake-on-LAN on enp7s0	2026-05-16 13:40:25 +02:00
attic	feat(headscale): add ACL policy, isolate mx, make mx an exit node	2026-05-13 09:06:40 +02:00
halo	chore(halo): tune llama models.ini and drop 35B-A3B model	2026-05-20 14:23:42 +02:00
mx	feat(halo): add `song <URL>` command to convert via song.link	2026-05-20 09:42:11 +02:00
nixtee1	refactor(nix): extract common system configs into reusable modules	2026-01-30 10:42:09 +01:00
sgx	refactor(opencode): extract serve service into shared NixOS module	2026-05-05 13:43:27 +02:00
t15	refactor(nix): extract common system configs into reusable modules	2026-01-30 10:42:09 +01:00
x1	chore(x1,amd): disable cratedocs-mcp service	2026-05-13 11:35:59 +02:00