nixcfg/systems/x86_64-linux
Harald Hoyer ab729a0720 feat(halo): serve bge-m3 embeddings alongside coder
Add a multilingual bge-m3 embedding model to the llama-server preset and
raise --models-max to 2 so it stays co-resident with the coder model.
This gives the RAG stack a local embeddings endpoint without a second
service, keeping all inference on halo. Embedding-specific overrides
(ubatch-size, context, pooling) are pinned since the global defaults
would truncate or misconfigure embedding requests.
2026-05-22 00:35:54 +02:00
..
amd chore(halo): llama.cpp update 2026-05-21 20:46:06 +02:00
attic feat(headscale): add ACL policy, isolate mx, make mx an exit node 2026-05-13 09:06:40 +02:00
halo feat(halo): serve bge-m3 embeddings alongside coder 2026-05-22 00:35:54 +02:00
mx feat(halo): add song <URL> command to convert via song.link 2026-05-20 09:42:11 +02:00
nixtee1 refactor(nix): extract common system configs into reusable modules 2026-01-30 10:42:09 +01:00
sgx refactor(openwebui): drop stale backend env vars now managed via UI 2026-05-21 23:15:47 +02:00
t15 refactor(nix): extract common system configs into reusable modules 2026-01-30 10:42:09 +01:00
x1 chore(x1,amd): disable cratedocs-mcp service 2026-05-13 11:35:59 +02:00