nixcfg

harald/nixcfg

Fork 0

Commit graph

Author	SHA1	Message	Date
Harald Hoyer	807a3d0d8e	fix(halo): context	2026-05-20 01:21:10 +02:00
Harald Hoyer	0edf975c30	feat(halo): serve multiple llama models via models.ini preset Replace the per-model llama-server units with a single service that uses llama-server's --models-preset (models.ini) and --models-max 2, so the 35B-A3B and 27B models are loaded on demand from one config. Drop the now-redundant 27B / 27B-MTP / coder-next variant files and the unused CacheDirectory + slot-save-path KV-slot handling.	2026-05-20 00:23:50 +02:00

Author

SHA1

Message

Date

Harald Hoyer

807a3d0d8e

fix(halo): context

2026-05-20 01:21:10 +02:00

Harald Hoyer

0edf975c30

feat(halo): serve multiple llama models via models.ini preset

Replace the per-model llama-server units with a single service that
uses llama-server's --models-preset (models.ini) and --models-max 2,
so the 35B-A3B and 27B models are loaded on demand from one config.

Drop the now-redundant 27B / 27B-MTP / coder-next variant files and
the unused CacheDirectory + slot-save-path KV-slot handling.

2026-05-20 00:23:50 +02:00

2 commits