feat(sgx): add CLI RAG stack (Qdrant + embeddings gateway + rag tool)

Stand up document retrieval as shared, client-agnostic primitives rather
than locking it inside Open WebUI:

- Qdrant as the LAN-reachable vector store
- LiteLLM gains a bge-m3 route so sgx:4000 also serves /v1/embeddings
- a thin `rag` CLI (ingest/query, optional coder synthesis) usable from
  any machine and from scripts

Embeddings and synthesis run on halo via the gateway; the CLI is
configured entirely through RAG_* env vars.
This commit is contained in:
Harald Hoyer 2026-05-22 00:35:35 +02:00
parent ab729a0720
commit 95668b71a7
4 changed files with 180 additions and 0 deletions

View file

@ -12,6 +12,7 @@
./wyoming.nix
./searx.nix
./litellm.nix
./qdrant.nix
./uptime-kuma.nix
./firefly.nix
./opencode.nix
@ -25,6 +26,7 @@
environment.systemPackages = with pkgs; [
claude-code
opencode
metacfg.rag
];
services.tailscale.enable = true;