botserver

Author	SHA1	Message	Date
Rodrigo Rodriguez (Pragmatismo)	7b4753af0d	fix: init_redis tries both no-password and password URLs for Valkey All checks were successful BotServer CI/CD / build (push) Successful in 27s Details - Root cause: Valkey in prod runs without password but Vault stores one - Previous code only tried password URL, got AUTH failed - Fix: try no-password URL first, then password URL as fallback - Also removed unused cache_url variable and cleaned up retry logic	2026-04-02 07:36:16 -03:00
Rodrigo Rodriguez (Pragmatismo)	dae0feb6a5	fix: SecretPaths match Vault seeding paths (gbo/cache not gbo/system/cache) All checks were successful BotServer CI/CD / build (push) Successful in 3m49s Details - Root cause: Vault seeding writes to secret/gbo/cache but code reads gbo/system/cache - kv2::read prepends secret/ so it looks for secret/gbo/system/cache (wrong) - Fix: update SecretPaths to match seeding paths (gbo/cache, gbo/drive, etc.) - Testing: compiles clean, paths now match vault kv list output	2026-04-02 07:16:32 -03:00
Rodrigo Rodriguez (Pragmatismo)	f118c74cf1	fix: init_redis uses async Vault call instead of sync block_on (fixes panic) All checks were successful BotServer CI/CD / build (push) Successful in 5m40s Details - Root cause: get_cache_config() uses runtime.block_on() which panics when called from within an async runtime - Fix: call SecretsManager::get_secret() directly with .await - Testing: compiles clean, no runtime nesting issues	2026-04-02 06:59:21 -03:00
Rodrigo Rodriguez (Pragmatismo)	b3edf21d21	fix: init_redis fetches cache password from Vault (fixes connection timeout) All checks were successful BotServer CI/CD / build (push) Successful in 4m59s Details - Root cause: init_redis() used redis://localhost:6379 without password - Valkey requires authentication, causing connection timeouts - Fix: use get_cache_config() from SecretsManager to build URL with password - Falls back to env vars (CACHE_URL/REDIS_URL/VALKEY_URL) if set	2026-04-01 20:17:37 -03:00
Rodrigo Rodriguez (Pragmatismo)	3c9e4ba6e7	fix: cache_health_check uses ss instead of nc (nc missing in prod container) All checks were successful BotServer CI/CD / build (push) Successful in 4m42s Details - Root cause: prod container lacks nc (netcat), causing fallback to valkey-cli ping - valkey-cli ping hangs indefinitely when Valkey requires password auth - Fix: use ss -tlnp as primary check (always available), nc as fallback - Testing: verified ss is available in prod, nc is not	2026-04-01 20:06:13 -03:00
Rodrigo Rodriguez (Pragmatismo)	d098961142	fix: Bootstrap checks stack/.env path in addition to ./.env All checks were successful BotServer CI/CD / build (push) Successful in 4m39s Details - Production has .env in botserver-stack/.env not ./.env - Checks both locations to detect completed bootstrap - Fixes E0716: use let bindings for Path borrows	2026-04-01 19:30:08 -03:00
Rodrigo Rodriguez (Pragmatismo)	8fd3254334	fix: Bootstrap checks stack/.env path in addition to ./.env Some checks failed BotServer CI/CD / build (push) Failing after 1m25s Details - Production has .env in botserver-stack/.env not ./.env - Checks both locations to detect completed bootstrap - Prevents full re-bootstrap on restart in production	2026-04-01 19:26:32 -03:00
Rodrigo Rodriguez (Pragmatismo)	318367d439	fix: Valkey health check uses nc first (avoids password hang) All checks were successful BotServer CI/CD / build (push) Successful in 3m58s Details - nc -z checks port connectivity instantly (no auth needed) - valkey-cli ping as fallback (hangs when password required) - Fixes bootstrap hang on production where Valkey has Vault password	2026-04-01 18:52:04 -03:00
Rodrigo Rodriguez (Pragmatismo)	c26e483cc9	fix: All services check health before starting (idempotent bootstrap) All checks were successful BotServer CI/CD / build (push) Successful in 4m9s Details - Tables (PostgreSQL): pg_isready health check before start - Drive (MinIO): /minio/health/live check before start - ALM (Forgejo): HTTP health check before start - ALM CI (Forgejo Runner): pgrep check before start - Valkey: health check uses absolute path to valkey-cli - Vault, Qdrant, Zitadel: already had health checks - Result: no duplicate starts, no hangs on restart	2026-04-01 18:28:54 -03:00
Rodrigo Rodriguez (Pragmatismo)	ba7f1ba5eb	fix: Valkey health check uses absolute path to valkey-cli Some checks failed BotServer CI/CD / build (push) Has been cancelled Details - Use BOTSERVER_STACK_PATH/bin/cache/bin/valkey-cli instead of relying on PATH - Remove bash /dev/tcp fallback (unreliable in restricted environments) - Falls back to redis-cli and nc if valkey-cli unavailable	2026-04-01 18:11:26 -03:00
Rodrigo Rodriguez (Pragmatismo)	68ef554132	fix: Vault as single source of truth - credentials + location for all services All checks were successful BotServer CI/CD / build (push) Successful in 4m53s Details - Qdrant health check: recognize 'healthz check passed' response (fixes 45s timeout) - seed_vault_defaults: add host/port/url/grpc_port for ALL 10 services - fetch_vault_credentials: fetch ALL services via generic loop (drive, cache, tables, vectordb, directory, llm, meet, alm, encryption) - vectordb URL: fix https://localhost:6334 -> http://localhost:6333 in all config getters - get_from_env: add host/port/grpc_port for vectordb fallback - Tested: .reset (fresh install) + .restart (idempotent) - zero errors	2026-04-01 16:46:16 -03:00
Rodrigo Rodriguez (Pragmatismo)	fb2e5242da	fix: Vault seeding, service health checks, and restart idempotency All checks were successful BotServer CI/CD / build (push) Successful in 55m52s Details - Replace hardcoded passwords with generate_random_string() for all Vault-seeded services - Add valkey-cli, nc to SafeCommand allowlist; fix PATH in all 4 execution methods - Fix empty Vault KV values ('none' placeholder) preventing 'Failed to parse K=V' errors - Fix special chars in generated passwords triggering shell injection false positives - Add ALM app.ini creation with absolute paths for Forgejo CLI - Increase Qdrant timeout 15s→45s, ALM wait 5s→20s - Persist file_states and kb_states to disk for .bas/KB idempotency across restarts - Add duplicate check to use_website registration (debug log for existing) - Remove dead code (SERVER_START_EPOCH, server_epoch) - Add generate_random_string() to shared mod.rs, remove duplicates	2026-04-01 12:22:57 -03:00
Rodrigo Rodriguez (Pragmatismo)	3e46a16469	fix: Seed default credentials into Vault after initialization Some checks failed BotServer CI/CD / build (push) Failing after 3h13m28s Details - Add seed_vault_defaults() to write default creds for all components (drive, cache, tables, directory, email, llm, encryption, meet, vectordb, alm) - Call seed_vault_defaults() after KV2 enable in initialize_vault_local() - Call seed_vault_defaults() in recover_existing_vault() for recovery path - Rewrite fetch_vault_credentials() to use SafeCommand directly instead of safe_sh_command, avoiding '//' shell injection false positive on URLs - Components like Drive now get credentials from Vault instead of 403 errors	2026-03-31 22:19:09 -03:00
Rodrigo Rodriguez (Pragmatismo)	9919a8321c	fix: Use SafeCommand directly for vault health check to avoid shell injection false positive All checks were successful BotServer CI/CD / build (push) Successful in 6m46s Details - Replace safe_sh_command with SafeCommand::new("curl").args() in vault_health_check() - The URL contains https:// which triggered '//' pattern detection in shell command - Direct SafeCommand bypasses shell parsing, URL passed as single argument - Add vault data directory existence check before recovery attempt - Prevents 'Dangerous pattern // detected' errors during bootstrap	2026-03-31 21:34:04 -03:00
Rodrigo Rodriguez (Pragmatismo)	07a6c1edb3	Merge commit '582ea634' All checks were successful BotServer CI/CD / build (push) Successful in 7m38s Details	2026-03-31 21:10:25 -03:00
Rodrigo Rodriguez (Pragmatismo)	582ea634e7	fix: Vault bootstrap recovery for sealed but initialized instances - Fix vault_health_check() stub that always returned false - Add recover_existing_vault() to handle Vault with existing data but no init.json - Add unseal_vault() helper to unseal with existing vault-unseal-keys - Detect initialized Vault via health endpoint or data directory presence - Prevents bootstrap failure when reset.sh deletes init.json but Vault data persists Root cause: vault_health_check() was a stub returning false, causing bootstrap to always try vault operator init on already-initialized (but sealed) Vault, which failed with connection refused. This cascaded to all services failing to fetch credentials from Vault.	2026-03-31 20:49:29 -03:00
Rodrigo Rodriguez (Pragmatismo)	4ae16017ff	Merge commit '644dfe2d' Some checks failed BotServer CI/CD / build (push) Has been cancelled Details	2026-03-31 19:57:57 -03:00
Rodrigo Rodriguez (Pragmatismo)	644dfe2d19	fix: Improve .gbdialog file detection for nested paths	2026-03-31 19:57:33 -03:00
Rodrigo Rodriguez (Pragmatismo)	2fa59057fa	fix: Resolve migration error, Vault 403, cache timeout, and shell injection false positives Some checks failed BotServer CI/CD / build (push) Has been cancelled Details - Fix migration 6.2.5: Create lost_reason column before VIEW that references it - Fix Vault 403: Enable KV2 secrets engine after initialization - Fix cache timeout: Increase Valkey readiness wait from 12s to 30s - Fix command_guard: Remove () from forbidden chars (safe in std::process::Command)	2026-03-31 19:55:16 -03:00
Rodrigo Rodriguez (Pragmatismo)	b83b4ffc4d	fix: Remove server_epoch() from start_bas_executed Redis key The epoch caused a new key to be created every second, bypassing the 'already executed' check and running start.bas multiple times, resulting in triplicated suggestions.	2026-03-21 20:40:25 -03:00
Rodrigo Rodriguez (Pragmatismo)	1132983064	feat(kb): add with_bot_config to load embedding from bot config - Adds KnowledgeBaseManager::with_default_config() as alias to new() - Adds KnowledgeBaseManager::with_bot_config() to load embedding_url, embedding_model, and qdrant config from bot's config.csv - Updates bootstrap to use with_bot_config with default_bot_id - Enables per-bot embedding configuration instead of global env vars	2026-03-21 18:55:36 -03:00
Rodrigo Rodriguez (Pragmatismo)	622f1222dc	fix(websocket): force start.bas execution on connection to restore chat on page reload while preventing duplicate execution	2026-03-21 16:38:03 -03:00
Rodrigo Rodriguez (Pragmatismo)	363c056bab	fix(bootstrap): add strict timeout to Redis connection initialization to prevent hanging on dropped tcp packets	2026-03-21 14:37:04 -03:00
Rodrigo Rodriguez (Pragmatismo)	adb26330d2	fix: Simple 50ms timeout for Redis connection	2026-03-21 10:48:47 -03:00
Rodrigo Rodriguez (Pragmatismo)	9d6c2686f1	fix: Remove connection caching (no Clone)	2026-03-21 10:37:49 -03:00
Rodrigo Rodriguez (Pragmatismo)	b3ce293487	fix: Clean up duplicate Redis code and fix WebSocket log level	2026-03-21 10:30:19 -03:00
Rodrigo Rodriguez (Pragmatismo)	cfe6453d1e	perf: Add shared Redis connection pool with 50ms timeout	2026-03-21 10:14:10 -03:00
Rodrigo Rodriguez (Pragmatismo)	43fd40aed9	fix: Add timeout to Redis get_connection to prevent blocking - Added get_redis_connection() helper with 2s timeout - All cache operations now fail fast if Valkey is not ready - Prevents start.bas from blocking for minutes waiting for cache - Changes: add_suggestion.rs	2026-03-21 09:34:41 -03:00
Rodrigo Rodriguez (Pragmatismo)	e5f3380469	perf: Fix USE TOOL thread contention by removing runtime creation - Replace thread spawn + tokio runtime creation with block_in_place - Eliminates 10+ runtime creations per start.bas execution - Reduces USE TOOL execution from ~2min to milliseconds - Fixes suggestions not appearing due to start.bas timeout	2026-03-20 22:54:19 -03:00
Rodrigo Rodriguez (Pragmatismo)	705d925947	fix: Allow anonymous access to /api/suggestions for bot chat	2026-03-20 18:44:08 -03:00
Rodrigo Rodriguez (Pragmatismo)	d19984fa07	feat: Improve KB keywords and package manager installer	2026-03-20 17:38:47 -03:00
Rodrigo Rodriguez (Pragmatismo)	57a8b7f8f0	Fix: use pgrep to check valkey/qdrant running state - valkey check_cmd: replaced valkey-cli ping (network) with pgrep -x valkey-server - qdrant check_cmd: replaced curl https check (TLS error 35) with pgrep -x qdrant - Prevents duplicate instances on each botserver restart	2026-03-20 15:40:22 -03:00
Rodrigo Rodriguez (Pragmatismo)	3bb115266b	feat: Add GUID prefix to Qdrant collection names for KB security isolation	2026-03-19 19:51:28 -03:00
Rodrigo Rodriguez (Pragmatismo)	d6ebd0cf6e	fix: send suggestions separately from TALK, clear Redis keys for refresh - Remove suggestions fetching from TALK function - WebSocket handler now fetches and sends suggestions after start.bas executes - Clear suggestions and start_bas_executed keys to allow re-run on refresh - Decouple TALK from suggestions handling	2026-03-19 09:53:39 -03:00
Rodrigo Rodriguez (Pragmatismo)	2fcfb05fd6	fix: USE_WEBSITE non-blocking - timeout 3s, never blocks start.bas	2026-03-18 19:41:23 -03:00
Rodrigo Rodriguez (Pragmatismo)	6e594d68dd	Fix: Wait for send_task to be ready before executing start.bas	2026-03-18 14:38:46 -03:00
Rodrigo Rodriguez (Pragmatismo)	8f073a15fd	Fix: Wait for send_task to be ready before executing start.bas so TALK messages work	2026-03-18 14:18:05 -03:00
Rodrigo Rodriguez (Pragmatismo)	1a9208b88e	Fix: Use bot_id instead of user_id in TALK suggestions Redis key	2026-03-18 11:05:56 -03:00
Rodrigo Rodriguez (Pragmatismo)	ec4fcc094a	Fix: Use bot_id instead of user_id in suggestion Redis keys - Changed all suggestion key formats from suggestions:user_id:session_id to suggestions:bot_id:session_id - Fixes bug where suggestions were stored under wrong key, preventing frontend from retrieving them - Affects: CLEAR SUGGESTIONS, ADD SUGGESTION, ADD SUGGESTION TEXT, ADD_SUGGESTION_TOOL - Impact: Suggestions now correctly associated with bot, not user	2026-03-18 10:39:27 -03:00
Rodrigo Rodriguez (Pragmatismo)	346c83871a	Fix Vault TLS certificate to include Subject Alternative Name for modern client compatibility	2026-03-18 09:30:27 -03:00
Rodrigo Rodriguez (Pragmatismo)	ed2a1d83f0	fix: include server epoch in start_bas_executed key to invalidate after restart	2026-03-17 15:45:02 -03:00
Rodrigo Rodriguez (Pragmatismo)	492530ee77	Fix panic: Cannot start a runtime from within a runtime in secrets module Removed tokio::runtime::Handle::block_on() calls that were causing panics when called from within async contexts. Now uses direct fallback to environment variables instead.	2026-03-17 15:04:40 -03:00
Rodrigo Rodriguez (Pragmatismo)	af7441ebcb	fix: generate mcp.json for tools without PARAM declarations Tools using only USE KB or other keywords without PARAM were not getting .mcp.json generated, causing USE TOOL to silently skip them.	2026-03-17 12:20:47 -03:00
Rodrigo Rodriguez (Pragmatismo)	7906a9bf32	security: add CoreDNS ACL hardening and fail2ban proxy jail - dns_hardener.rs: apply ACL (anti-amplification) + errors plugin to Corefile via lxc - fail2ban.rs: add apply_proxy() for caddy-http-flood jail in pragmatismo-proxy container - security_fix.rs: integrate dns and fail2ban_proxy steps into run_security_fix/status - mod.rs: export dns_hardener module	2026-03-17 11:18:19 -03:00
Rodrigo Rodriguez (Pragmatismo)	c340f95da1	security: bind MinIO and Valkey to 127.0.0.1 only Some checks failed BotServer CI / build (push) Failing after 6m44s Details	2026-03-17 01:32:21 -03:00
Rodrigo Rodriguez (Pragmatismo)	9fc38b80d3	Fix clippy type complexity warnings	2026-03-17 01:12:05 -03:00
Rodrigo Rodriguez (Pragmatismo)	ab1f2df476	Read Drive config from Vault at runtime with fallback defaults Some checks failed BotServer CI / build (push) Failing after 7m26s Details	2026-03-17 00:00:36 -03:00
Rodrigo Rodriguez (Pragmatismo)	b57c53e2ff	Remove WORKFLOW_PLAN.md (moved to gb/prompts) Some checks failed BotServer CI / build (push) Failing after 7m22s Details	2026-03-16 23:40:56 -03:00
Rodrigo Rodriguez (Pragmatismo)	7849031ffe	Move WORKFLOW_PLAN.md to src/basic/ Some checks failed BotServer CI / build (push) Has been cancelled Details	2026-03-16 23:38:35 -03:00
Rodrigo Rodriguez (Pragmatismo)	ec1e203859	HEAR: add configurable timeout (hear-timeout-secs, default 1h) Some checks failed BotServer CI / build (push) Has been cancelled Details	2026-03-16 23:12:45 -03:00

1 2 3 4 5 ...

984 commits