zeroclaw

Author	SHA1	Message	Date
Chummy	57be369771	chore(docker): keep install list indentation unchanged	2026-02-18 15:14:05 +08:00
Cemal Y. Dalar	7f15627f8c	fix(docker): restore benches/ copy after stub removal in builder stage The dep-caching layer creates stub files (src/main.rs and benches/agent_benchmarks.rs) to warm the cargo registry cache, then removes them with `rm -rf src benches`. The subsequent real source copy only restored `src/` — leaving `benches/` absent. Cargo's manifest parser then failed to locate `benches/agent_benchmarks.rs` referenced in Cargo.toml, aborting the release build with: error: failed to parse manifest at `/app/Cargo.toml` Caused by: can't find `agent_benchmarks` bench at `benches/agent_benchmarks.rs` Fix: add `COPY benches/ benches/` alongside the `COPY src/ src/` step so the real bench source is present for the incremental release build.	2026-02-18 15:14:05 +08:00
Mike Boensel	0166f2d4de	fix(token): update token generation to use rand::rng() to resolve deprecation warnings	2026-02-18 02:11:51 -05:00
Chummy	a3eedfdc78	docs(zai): align setup guide with runtime defaults - remove trailing whitespace in .env.example Z.AI block - align documented model defaults/options with current onboard/provider behavior - keep this PR docs-focused by reverting incidental workflow edits	2026-02-18 15:10:55 +08:00
adisusilayasa	e3d6058424	fix(ci): include workflow fix for CI to pass The first-interaction action requires snake_case input names. This fix is needed for CI to pass on this PR.	2026-02-18 15:10:55 +08:00
adisusilayasa	402d8f0a32	docs: add Z.AI GLM coding plan setup guide - Add comprehensive documentation for Z.AI GLM models - Include curl examples for testing Z.AI API - Document available models and troubleshooting - Update .env.example with Z.AI configuration Z.AI provides GLM models (glm-4.5, glm-4.6, glm-4.7, glm-5) through the OpenAI-compatible endpoint at api.z.ai/api/coding/paas/v4. Existing tests verify: - zai_base_url() returns correct URLs for global/CN variants - create_provider('zai', key) successfully creates provider - Regional alias predicates cover all variants	2026-02-18 15:10:55 +08:00
Chummy	42bf05df47	docs: clarify custom provider env vars and URL scheme	2026-02-18 15:04:11 +08:00
ZeroClaw Bot	f13553014b	docs: add custom provider endpoint configuration guide Add comprehensive documentation for custom API endpoint configuration to address missing documentation reported in issue #567. Changes: - Create docs/custom-providers.md with detailed guide for custom: and anthropic-custom: formats - Add custom endpoint examples to README.md configuration section - Add note about daemon requirement for channels in Quick Start - Add reference link to custom providers guide Addresses: #567 Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-18 15:04:11 +08:00
Chummy	dd454178ed	perf(memory): fold recall/vector/list optimizations into spawn_blocking refactor	2026-02-18 14:46:51 +08:00
Alex Gorevski	4e528dde7d	perf(memory): wrap blocking SQLite calls in tokio::task::spawn_blocking Problem: Every async fn in SqliteMemory acquired self.conn.lock() and ran synchronous rusqlite queries directly on the Tokio runtime thread. This blocks the async executor, preventing other tasks from making progress — especially harmful under concurrent recall/store load. Fix: - Change conn from Mutex<Connection> to Arc<Mutex<Connection>> so the connection handle can be cloned into spawn_blocking closures. - Wrap all synchronous database operations (store, recall, get, list, forget, count, health_check) in tokio::task::spawn_blocking. - Split get_or_compute_embedding into three phases: cache check (blocking), embedding computation (async I/O), cache store (blocking) — ensuring no lock is held across await points. - Apply the same pattern to the reindex method. The async I/O (embedding computation) remains on the Tokio runtime while all SQLite access runs on the blocking thread pool, preventing executor starvation. Ref: zeroclaw-labs/zeroclaw#710 (Item 4) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-02-18 14:46:51 +08:00
Chummy	83b098d7ac	fix(imessage): preserve sqlite conn across polling safely	2026-02-18 14:45:05 +08:00
Alex Gorevski	1ddcb0a573	perf(imessage): reuse persistent SQLite connection across poll cycles Problem: The iMessage listener opened a new SQLite connection to the Messages database on every ~3-second poll cycle via get_max_rowid() and fetch_new_messages(), creating ~40 connection open/close cycles per minute. Each cycle incurs filesystem syscalls, WAL header reads, and potential page cache cold starts. Fix: Open a single read-only connection before the poll loop and reuse it across iterations using the 'shuttle' pattern: the connection is moved into each spawn_blocking closure and returned alongside the results, then reassigned for the next iteration. This eliminates per-poll connection overhead while preserving the spawn_blocking pattern that keeps SQLite I/O off the Tokio runtime thread. The standalone get_max_rowid() and fetch_new_messages() helper functions are retained for use by tests and other callers. Ref: zeroclaw-labs/zeroclaw#710 (Item 9) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-02-18 14:45:05 +08:00
Chummy	14066d094f	test(runtime): stabilize docker root mount assertion	2026-02-18 14:42:39 +08:00
Alex Gorevski	9a6fa76825	readd tests, remove markdown files	2026-02-18 14:42:39 +08:00
Chummy	e2634c72c2	test(config): include query_classification in config fixtures	2026-02-18 14:41:58 +08:00
Edvard	6e53341bb1	feat(agent): add rule-based query classification for automatic model routing Classify incoming user messages by keyword/pattern and route to the appropriate model hint automatically, feeding into the existing RouterProvider. Disabled by default; opt-in via [query_classification] config section. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 14:41:58 +08:00
Edvard	1336c2f03e	feat(providers): add warmup() for OpenAI, Anthropic, Gemini, Compatible, GLM All five providers have HTTP clients but did not implement warmup(), relying on the trait default no-op. This adds lightweight warmup calls to establish TLS + HTTP/2 connection pools on startup, reducing first-request latency. Each warmup is skipped when credentials are absent, matching the OpenRouter pattern. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 14:35:03 +08:00
Chummy	a85a4a8194	fix(config): resolve ZEROCLAW_WORKSPACE root/workspace paths safely	2026-02-18 14:30:53 +08:00
bhagwan	b2976eb474	fix(config): support both legacy and new ZEROCLAW_WORKSPACE structure ZEROCLAW_WORKSPACE can now be either: - Legacy path: /path/to/workspace (config at /path/to/.zeroclaw/config.toml) - Parent path: /path/to (config at /path/to/config.toml, workspace at /path/to/workspace) This maintains backward compatibility with Docker's legacy folder structure while also supporting the new parent-dir layout.	2026-02-18 14:30:53 +08:00
Chummy	da7c21f469	style(anthropic): format cache conversation test block	2026-02-18 14:29:50 +08:00
tercerapersona	455eb3b847	feat: add prompt caching support to Anthropic provider Implements Anthropic's prompt caching API to enable significant cost reduction (up to 90%) and latency improvements (up to 85%) for requests with repeated content. Key features: - Auto-caching heuristics: large system prompts (>3KB), tool definitions, and long conversations (>4 messages) - Full backward compatibility: cache_control fields are optional - Supports both string and block-array system prompt formats - Cache control on all content types (text, tool_use, tool_result) Implementation details: - Added CacheControl, SystemPrompt, and SystemBlock structures - Updated NativeContentOut and NativeToolSpec with cache_control - Strategic cache breakpoint placement (last tool, last message) - Comprehensive test coverage for serialization and heuristics Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> (cherry picked from commit fff04f4edb5e4cb7e581b1b16035da8cc2e55cef)	2026-02-18 14:29:50 +08:00
Maya Walcher	63bc4721e3	feat(onboard): add signup URL, model catalog, and live fetch for Astrai Add three onboarding improvements for the Astrai provider: - Signup URL: users now see "Get your API key at: https://as-trai.com" during onboarding instead of a blank prompt - Curated model list: auto (best execution), GPT-4o, Claude Sonnet 4.5, DeepSeek V3, Llama 3.3 70B - Live model fetch: Astrai's OpenAI-compatible /v1/models endpoint is now queried when an API key is present, matching other providers Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 14:19:21 +08:00
Chummy	d70324f4f7	fix(robot-kit): format crate and harden cross-platform feature gating	2026-02-18 14:19:17 +08:00
Lumi-node	0dfc707c49	feat: add zeroclaw-robot-kit crate for AI-powered robotics Standalone robot toolkit providing AI agents with physical world interaction. Features: - 6 tools: drive, look, listen, speak, sense, emote - Multiple backends: ROS2, serial, GPIO, mock - Independent SafetyMonitor with E-stop, collision avoidance - Designed for Raspberry Pi 5 + Ollama offline operation - 55 unit/integration tests - Complete Pi 5 hardware setup guide	2026-02-18 14:19:17 +08:00
Chummy	431287184b	style(tests): apply rustfmt to brittle-test hardening changes	2026-02-18 14:17:58 +08:00
Alex Gorevski	45cdd25b3d	fix(tests): harden brittle tests for cross-platform stability and refactoring resilience ## Problem The test suite contained several categories of latent brittleness identified in docs/testing-brittle-tests.md that would surface during refactoring or cross-platform (Windows) CI execution: 1. Hardcoded Unix paths: \Path::new("/tmp")\ and \PathBuf::from("/tmp")\ used as workspace directories in agent tests, which fail on Windows where /tmp does not exist. 2. Exact string match assertions: ~20 \ssert_eq!(response, "exact text")\ assertions in agent unit and e2e tests that break on any mock wording change, even when the underlying orchestration behavior is correct. 3. Fragile error message string matching: \.contains("specific message")\ assertions coupled to internal error wording rather than testing the error category or behavioral outcome. ## What Changed ### Hardcoded paths → platform-agnostic temp dirs (4 files, 7 locations) - \src/agent/tests.rs\: Replaced all 4 instances of \Path::new("/tmp")\ and \PathBuf::from("/tmp")\ with \std::env::temp_dir()\ in \make_memory()\, \uild_agent_with()\, \uild_agent_with_memory()\, and \uild_agent_with_config()\ helpers. - \ ests/agent_e2e.rs\: Replaced all 3 instances in \make_memory()\, \uild_agent()\, and \uild_agent_xml()\ helpers. ### Exact string assertions → behavioral checks (2 files, ~20 locations) - \src/agent/tests.rs\: Converted 10 \ssert_eq!(response, "...")\ to \ssert!(!response.is_empty(), "descriptive message")\ across tests for text pass-through, tool execution, tool failure recovery, XML dispatch, mixed text+tool responses, multi-tool batch, and run_single delegation. - \ ests/agent_e2e.rs\: Converted 9 exact-match assertions to behavioral checks. Multi-turn test now uses \ssert_ne!(r1, r2)\ to verify sequential responses are distinct without coupling to exact wording. - Provider error propagation test simplified to \ssert!(result.is_err())\ without asserting on the error message string. ### Fragile error message assertions → structural checks (2 files) - \src/tools/git_operations.rs\: Replaced fragile OR-branch string match (\contains("git repository") \|\| contains("Git command failed")\) with structural assertions: checks \!result.success\, error is non-empty, and error does NOT mention autonomy/read-only (verifying the failure is git-related, not permission-related). - \src/cron/scheduler.rs\: Replaced \contains("agent job failed:")\ with \!success\ and \!output.is_empty()\ checks that verify failure behavior without coupling to exact log format. ## What Was NOT Changed (and why) - \src/agent/loop_.rs\ parser tests: Exact string assertions are the contract for XML tool call parsing — the exact output IS the spec. - \src/providers/reliable.rs\: Error message assertions test the error format contract (provider/model attribution in failure messages). - \src/service/mod.rs\: Already platform-gated with \#[cfg]\; XML escape test is a formatting contract where exact match is appropriate. - \src/config/schema.rs\: TOML test strings use /tmp as data values for deserialization tests, not filesystem access; HOME tests already use \std::env::temp_dir()\. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-02-18 14:17:58 +08:00
Chummy	decea532ed	refactor(memory): keep default hybrid weights while adding relevance threshold	2026-02-18 14:14:33 +08:00
Edvard	8a1e7cc7ef	fix(agent): use config max_tool_iterations, add memory relevance filtering, rebalance search weights Three fixes for conversation quality issues: 1. loop_.rs and channels now read max_tool_iterations from AgentConfig instead of using a hardcoded constant of 10, making it configurable. 2. Memory recall now filters entries below a configurable min_relevance_score threshold (default 0.4), preventing unrelated memories from bleeding into conversation context. 3. Default hybrid search weights rebalanced from 70/30 vector/keyword to 40/60, reducing cross-topic semantic bleed. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 14:14:33 +08:00
Alex Gorevski	21c5f58363	perf(cron): wrap record_run INSERT+DELETE in explicit transaction Problem: In record_run(), an INSERT into cron_runs followed by a pruning DELETE ran as separate implicit transactions. If the INSERT succeeded but the DELETE failed (e.g., due to disk pressure or lock contention), the run table would grow unboundedly since the pruning step was lost while the new row persisted. Fix: Wrap both statements in an explicit transaction using conn.unchecked_transaction(). If either statement fails, the entire transaction is rolled back, maintaining the invariant that the run history stays bounded by max_run_history. Ref: zeroclaw-labs/zeroclaw#710 (Item 5) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-02-18 14:07:31 +08:00
Alex Gorevski	9967eeb954	perf(cron): add composite index on cron_runs(job_id, started_at) Problem: The pruning query in record_run uses WHERE job_id = ?1 with ORDER BY started_at DESC, but only single-column indexes exist for job_id and started_at separately. SQLite must scan one index and then sort or scan the other, which is suboptimal for the combined filter + sort pattern used during pruning. Fix: Add a composite index CREATE INDEX IF NOT EXISTS idx_cron_runs_job_started ON cron_runs(job_id, started_at). This lets SQLite satisfy the WHERE job_id = ?1 ORDER BY started_at DESC subquery in a single index scan without a separate sort step. The existing single-column indexes are retained for other queries that filter on only one column. Ref: zeroclaw-labs/zeroclaw#710 (Item 7) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>	2026-02-18 14:06:59 +08:00
Chummy	151bc6a600	fix(scripts): align installer filename and docs placement	2026-02-18 13:56:35 +08:00
reidliu41	fdef03e455	feat(scripts): add one-line install script - Add `scripts/install.sh` — a single `curl \| bash` installer that handles system deps, Rust, clone, build, and install automatically. - Update README Linux/macOS section with a "One-Line Installer (Recommended)" block above the existing manual steps. 1. Detects OS (Linux apt/dnf, macOS Xcode CLT) 2. Installs build deps + git via system package manager (sudo only here) 3. Installs Rust via rustup (skipped if already present) 4. Shallow-clones the repo to `/tmp/zeroclaw-install` 5. `cargo build --release --locked` + `cargo install --path . --force --locked` 6. Cleans up temp dir and prints next steps (`source ~/.cargo/env`, `zeroclaw onboard`)	2026-02-18 13:56:35 +08:00
ikunali	61eb72f6eb	docs(readme): dark mode support for Star History chart	2026-02-18 13:50:42 +08:00
Chummy	d42cb1e906	fix(auth): rebase PR #200 onto main and restore auth CLI flow	2026-02-18 12:57:44 +08:00
Codex	96109d46d1	Fix pending OAuth verifier storage and account id fallback	2026-02-18 12:57:44 +08:00
Codex	e8aa63822a	fix PR #200 review issues	2026-02-18 12:57:44 +08:00
Codex	39087a446d	Fix OpenAI Codex contract, SSE parsing, and default xhigh reasoning	2026-02-18 12:57:44 +08:00
Codex	007368d586	feat(auth): add subscription auth profiles and codex/claude flows	2026-02-18 12:57:44 +08:00
Edvard	6d8725c9e6	fix(agent): log warning when native tool call arguments fail JSON parsing The NativeToolDispatcher silently defaults to an empty object when tool call arguments from the LLM fail to parse as JSON. The XML dispatcher already logs a warning for the same case (line 68). Add a matching tracing::warn with tool name and parse error for observability parity. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 12:56:56 +08:00
fettpl	7de052c7d2	fix(cron): add timeout and bounded execution for due jobs	2026-02-18 12:55:21 +08:00
Alex Gorevski	5f5cb27690	fix(cron): handle ALTER TABLE race condition in schema migration Problem: add_column_if_missing() checks PRAGMA table_info for column existence, then issues ALTER TABLE ADD COLUMN if not found. When two concurrent processes both pass the check before either executes the ALTER, the second process fails with a 'duplicate column name' error. Fix: Catch the 'duplicate column name' SQLite error after the ALTER TABLE and treat it as a benign no-op. Also explicitly drop statement/rows handles before ALTER to release locks. Ref: #710 (Item 8)	2026-02-17 23:50:08 -05:00
Edvard	63602a262f	fix(agent): use config-driven limits in run_tool_call_loop and trim_history run_tool_call_loop used a hardcoded MAX_TOOL_ITERATIONS (10) and trim_history/auto_compact_history used a hardcoded MAX_HISTORY_MESSAGES (50), ignoring the user-configurable agent.max_tool_iterations and agent.max_history_messages values in config.toml. Meanwhile, agent.rs correctly reads from config — creating an inconsistency where CLI single-shot mode respected config but the channel runtime and interactive CLI loop silently ignored it. Changes: - Rename constants to DEFAULT_* to clarify they are fallback defaults - Add max_tool_iterations parameter to run_tool_call_loop - Add max_history parameter to trim_history and auto_compact_history - Thread config.agent.max_tool_iterations through ChannelRuntimeContext - Both CLI code paths now pass config values to run_tool_call_loop Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 12:49:28 +08:00
h1n054ur	1c074d5204	fix(discord): use channel name for reply routing instead of discord channel ID The Discord channel was setting msg.channel to the numeric Discord channel ID instead of the literal string 'discord'. This caused process_channel_message() to fail the channels_by_name lookup since the map is keyed by channel name (e.g. 'discord', 'telegram', 'slack'). The result: the bot receives messages and generates LLM responses but never sends them back -- target_channel resolves to None so the send call is silently skipped. Every other channel (telegram, slack, whatsapp, matrix, signal, irc, imessage, lark, dingtalk, qq, email, mattermost) correctly sets this field to its channel name string. Discord was the only one using the platform-specific ID.	2026-02-18 12:49:06 +08:00
fettpl	4f9c87ff74	fix(policy): standardize side-effect tool autonomy gates	2026-02-18 12:42:56 +08:00
Edvard	89d0fb9a1e	feat(providers): implement chat_with_history for GLM provider The GLM provider previously relied on the trait default for chat_with_history, which only forwarded the last user message. This adds a proper multi-turn implementation that sends the full conversation history to the GLM API, matching the pattern used by OpenRouter, Ollama, and other providers. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 12:33:51 +08:00
Ademílson Tonato	73e675d298	feat(memory): optional SQLite connection open timeout - Add memory.sqlite_open_timeout_secs config (None = wait indefinitely). - When set, open the DB in a thread with recv_timeout; cap at 300s. - Default remains None for backward compatibility. - Document in README; add tests for timeout path and default.	2026-02-18 12:18:05 +08:00
Edvard	b3b1679218	feat(channels): implement typing indicator for Telegram channel Add start_typing/stop_typing overrides to TelegramChannel following the same pattern as DiscordChannel: spawn a tokio task that sends sendChatAction every 4 seconds (Telegram typing expires after 5s), and abort it on stop_typing. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 12:06:58 +08:00
Chummy	2560399423	feat(observability): focus PR 596 on Prometheus backend	2026-02-18 12:06:05 +08:00
argenis de la rosa	eba544dbd4	feat(observability): implement Prometheus metrics backend with /metrics endpoint - Adds PrometheusObserver backend with counters, histograms, and gauges - Tracks agent starts/duration, tool calls, channel messages, heartbeat ticks, errors, request latency, tokens, sessions, queue depth - Adds GET /metrics endpoint to gateway for Prometheus scraping - Adds provider/model labels to AgentStart and AgentEnd events for better observability - Adds as_any() method to Observer trait for backend-specific downcast Metrics exposed: - zeroclaw_agent_starts_total (Counter) with provider/model labels - zeroclaw_agent_duration_seconds (Histogram) with provider/model labels - zeroclaw_tool_calls_total (Counter) with tool/success labels - zeroclaw_tool_duration_seconds (Histogram) with tool label - zeroclaw_channel_messages_total (Counter) with channel/direction labels - zeroclaw_heartbeat_ticks_total (Counter) - zeroclaw_errors_total (Counter) with component label - zeroclaw_request_latency_seconds (Histogram) - zeroclaw_tokens_used_last (Gauge) - zeroclaw_active_sessions (Gauge) - zeroclaw_queue_depth (Gauge) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 12:06:05 +08:00
Edvard	c04f2855e4	feat(tools): expose custom memory categories in memory_store tool The MemoryCategory::Custom variant already exists in the memory backend but the memory_store tool only accepted core/daily/conversation. Now any string is accepted as a category, passing through to Custom(name) for non-builtin values. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-18 12:05:37 +08:00

1 2 3 4 5 ...

729 commits