nixcfg/systems
Harald Hoyer ab729a0720 feat(halo): serve bge-m3 embeddings alongside coder
Add a multilingual bge-m3 embedding model to the llama-server preset and
raise --models-max to 2 so it stays co-resident with the coder model.
This gives the RAG stack a local embeddings endpoint without a second
service, keeping all inference on halo. Embedding-specific overrides
(ubatch-size, context, pooling) are pinned since the global defaults
would truncate or misconfigure embedding requests.
2026-05-22 00:35:54 +02:00
..
aarch64-darwin feat(rialo): add pi 2026-05-19 14:27:50 +02:00
aarch64-linux nix fmt 2026-02-24 13:25:42 +01:00
x86_64-darwin/mpro nix fmt 2024-11-19 10:31:29 +01:00
x86_64-linux feat(halo): serve bge-m3 embeddings alongside coder 2026-05-22 00:35:54 +02:00
nixbuild.nix chore: nix fmt 2026-05-03 14:57:49 +02:00