feat: full-stack search engine — FTS5, vector search, hybrid merge, embedding cache, chunker · 0e7f501fd6 - harald/zeroclaw

feat: full-stack search engine — FTS5, vector search, hybrid merge, embedding cache, chunker

The Full Stack (All Custom):
- Vector DB: embeddings stored as BLOB, cosine similarity in pure Rust
- Keyword Search: FTS5 virtual tables with BM25 scoring + auto-sync triggers
- Hybrid Merge: weighted fusion of vector + keyword results (configurable weights)
- Embeddings: provider abstraction (OpenAI, custom URL, noop fallback)
- Chunking: line-based markdown chunker with heading preservation
- Caching: embedding_cache table with LRU eviction
- Safe Reindex: rebuild FTS5 + re-embed missing vectors

New modules:
- src/memory/embeddings.rs — EmbeddingProvider trait + OpenAI + Noop + factory
- src/memory/vector.rs — cosine similarity, vec↔bytes, ScoredResult, hybrid_merge
- src/memory/chunker.rs — markdown-aware document splitting

Upgraded:
- src/memory/sqlite.rs — FTS5 schema, embedding column, hybrid recall, cache, reindex
- src/config/schema.rs — MemoryConfig expanded with embedding/search settings
- All callers updated to pass api_key for embedding provider

739 tests passing, 0 clippy warnings (Rust 1.93.1), cargo-deny clean

This commit is contained in:

argenis de la rosa

2026-02-14 00:00:23 -05:00

parent 4fceba0740

commit 0e7f501fd6

10 changed files with 1423 additions and 96 deletions

									
										1

src/channels/mod.rs
									
										View file
										
				@ -227,6 +227,7 @@ pub async fn start_channels(config: Config) -> Result<()> {

				    let mem: Arc<dyn Memory> = Arc::from(memory::create_memory(

				        &config.memory,

				        &config.workspace_dir,

				        config.api_key.as_deref(),

				    )?);

				    // Build system prompt from workspace identity files + skills

Rows
Columns

feat: full-stack search engine — FTS5, vector search, hybrid merge, embedding cache, chunker

1 src/channels/mod.rs Unescape Escape View file

1

src/channels/mod.rs

View file