botserver

Author	SHA1	Message	Date
Rodrigo Rodriguez (Pragmatismo)	8e539206d4	fix: KB processor works with and without llm/research features All checks were successful BotServer CI/CD / build (push) Successful in 3m55s Details - Added stub start_kb_processor() for non-llm builds - Added _pending_kb_index field for non-llm builds - Extracted KB processor logic into start_kb_processor_inner() - Removed unused is_embedding_server_ready import This ensures DriveMonitor compiles and runs correctly in production where CI builds without --features llm. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-12 21:40:06 -03:00
Rodrigo Rodriguez (Pragmatismo)	112ac51da3	fix: KB processor runs as background task, no longer blocks check_for_changes All checks were successful BotServer CI/CD / build (push) Successful in 3m50s Details - Added start_kb_processor() method: long-running background task per bot - check_gbkb_changes now queues KB folders to pending_kb_index (non-blocking) - KB processor polls pending_kb_index and processes one at a time per bot - Removed inline tokio::spawn from check_gbkb_changes that was causing 5min timeouts - Added pending_kb_index field to DriveMonitor struct This fixes salesianos DriveMonitor timeout - check_for_changes now completes in seconds instead of hanging on KB embedding/indexing. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-12 21:28:03 -03:00
Rodrigo Rodriguez (Pragmatismo)	ad998b52d4	fix: check_gbot only scans .gbot/ folder, not entire bucket All checks were successful BotServer CI/CD / build (push) Successful in 4m21s Details - Added prefix filter to list_objects_v2 call: only scans {bot}.gbot/ - Removed scanning of .gbkb and .gbdialog paths which caused 5min timeouts - This fixes salesianos DriveMonitor timeout and embed/index failure Also fixed header detection for name,value CSV format. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-12 21:02:01 -03:00
Rodrigo Rodriguez (Pragmatismo)	36fdf52780	fix: sync_gbot_config now handles CSV with or without header row All checks were successful BotServer CI/CD / build (push) Successful in 3m32s Details - Removed unconditional .skip(1) that was skipping first config line - Added header detection: skips first line only if it looks like 'key,value' header - Added validation to skip empty keys - Also fixed indentation in drive_monitor gbkb file processing This fixes the issue where config.csv changes on Drive weren't being synced to bot_configuration database table for salesianos bot. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-12 20:32:30 -03:00
Rodrigo Rodriguez (Pragmatismo)	4cd469afc3	fix: track config.csv ETag to avoid unnecessary syncs All checks were successful BotServer CI/CD / build (push) Successful in 5m2s Details - Add ETag tracking for config.csv files in DriveMonitor - Only download and sync config.csv when ETag changes - Prevents unnecessary database updates on every check - Uses __config__ prefix for config.csv state keys	2026-04-12 19:49:28 -03:00
Rodrigo Rodriguez (Pragmatismo)	af85426ed4	fix: delete orphaned .gbkb files when removed from MinIO All checks were successful BotServer CI/CD / build (push) Successful in 3m6s Details When a .gbkb file is deleted from the bucket, DriveMonitor now: - Deletes the downloaded file from work directory - When entire KB folder is empty, removes the folder too - Prevents disk accumulation of orphaned knowledge base files	2026-04-12 16:49:05 -03:00
Rodrigo Rodriguez (Pragmatismo)	135dfb06d5	fix: delete orphaned .ast files when .bas is removed from MinIO All checks were successful BotServer CI/CD / build (push) Successful in 3m4s Details When a .bas file is deleted from the bucket, DriveMonitor now: - Deletes the corresponding .ast compiled file - Deletes .bas, .mcp.json, .tool.json files from work directory - Removes the path from file_states tracking This prevents stale compiled files from accumulating in production.	2026-04-12 16:43:29 -03:00
Rodrigo Rodriguez (Pragmatismo)	9cf176008d	fix: preserve indexed status after .bas compilation All checks were successful BotServer CI/CD / build (push) Successful in 3m20s Details Fixed bug where DriveMonitor would overwrite indexed=true status after successful compilation, causing files to be recompiled on every cycle. Changes: - Track successful compilations in HashSet before acquiring write lock - Set indexed=true for successfully compiled files in merge loop - Preserve indexed status for unchanged files - Handle compilation failures with proper fail_count tracking This ensures new .bas files are compiled to .ast once and the indexed status is preserved, preventing unnecessary recompilation.	2026-04-12 16:36:03 -03:00
Rodrigo Rodriguez (Pragmatismo)	7c4ec37700	fix: properly track compilation status in DriveMonitor All checks were successful BotServer CI/CD / build (push) Successful in 3m15s Details - Do not mark .bas files as indexed unconditionally - Only set indexed=true when compile_tool() completes successfully - Reset fail_count and last_failed_at on successful compilation - Retry failed compilations automatically on next cycle - Fixes permanent compilation failure state for salesianos start.bas	2026-04-12 16:06:23 -03:00
Rodrigo Rodriguez (Pragmatismo)	73f1898b62	Add fail_count and last_failed_at to kb_documents All checks were successful BotServer CI/CD / build (push) Successful in 3m7s Details Simplified KB indexing state tracking - added columns directly to kb_documents instead of separate table. This enables per-file backoff retry logic.	2026-04-12 09:36:39 -03:00
Rodrigo Rodriguez (Pragmatismo)	256d55fc93	Add smart sleep based on fail_count to prevent excessive monitoring cycles All checks were successful BotServer CI/CD / build (push) Successful in 3m9s Details - fail_count >= 3: sleep 1 hour - fail_count >= 2: sleep 15 min - fail_count >= 1: sleep 5 min - fail_count = 0: sleep 10 sec (default)	2026-04-12 09:20:17 -03:00
Rodrigo Rodriguez (Pragmatismo)	789789e313	Fix backoff logic to be per KB folder instead of global Some checks failed BotServer CI/CD / build (push) Has been cancelled Details - Filter states by kb_folder_pattern (e.g. 'cartas/', 'proc/') - Only apply backoff based on files in that specific KB folder - Each KB folder has independent retry timing	2026-04-12 09:15:32 -03:00
Rodrigo Rodriguez (Pragmatismo)	ee273256fb	Add backoff logic to KB indexing to prevent excessive retries Some checks failed BotServer CI/CD / build (push) Has been cancelled Details - fail_count 1: wait 5 minutes before retry - fail_count 2: wait 15 minutes before retry - fail_count 3+: wait 1 hour before retry This prevents the 'already being indexed, skipping duplicate task' loop.	2026-04-12 09:13:33 -03:00
Rodrigo Rodriguez (Pragmatismo)	f48fa6d5f0	Add fail_count/last_failed_at to FileState for indexing retries All checks were successful BotServer CI/CD / build (push) Successful in 3m21s Details - Skip re-indexing files that failed 3+ times within 1 hour - Update file_states on indexing success (indexed=true, fail_count=0) - Update file_states on indexing failure (fail_count++, last_failed_at=now) - Don't skip KB indexing when embedding server not marked ready yet - Embedding server health will be detected via wait_for_server() in kb_indexer - Remove drive_monitor bypass of embedding check - let kb_indexer handle it	2026-04-12 07:47:13 -03:00
Rodrigo Rodriguez (Pragmatismo)	cdab04e999	Fix embedding health check: behavior-based instead of URL whitelist All checks were successful BotServer CI/CD / build (push) Successful in 3m32s Details - Remove hardcoded URL list for remote API detection - Try /health first, then probe with HEAD if 404/405 - Re-enable embedding server ready check in drive_monitor - No more embedding_key hack that skipped health checks entirely	2026-04-12 07:15:54 -03:00
Rodrigo Rodriguez (Pragmatismo)	2bafd57046	Temp fix: Skip embedding server ready check in DriveMonitor KB indexing All checks were successful BotServer CI/CD / build (push) Successful in 3m19s Details	2026-04-12 06:58:55 -03:00
Rodrigo Rodriguez (Pragmatismo)	7a1ec157f1	Fix KB indexing: upsert kb_collections, consistent collection names, preserve indexed flag All checks were successful BotServer CI/CD / build (push) Successful in 3m23s Details - Bug 1: check_gbkb_changes now preserves indexed=true from previous state when etag matches, preventing redundant re-indexing every cycle - Bug 2: USE KB fallback uses bot_id_short (8 chars) instead of random UUID, matching the collection name convention used by DriveMonitor - Bug 3: handle_gbkb_change now upserts into kb_collections table after successful indexing, so USE KB can find the collection at runtime - Changed ON CONFLICT DO NOTHING to DO UPDATE for kb_collections inserts - Changed process_gbkb_folder return type to Result<IndexingResult>	2026-04-11 21:26:02 -03:00
Rodrigo Rodriguez (Pragmatismo)	e81aee6221	fix: use bucket_name instead of bot_id (UUID) for file_states.json path All checks were successful BotServer CI/CD / build (push) Successful in 3m22s Details File states were stored under /opt/gbo/work/{UUID}/file_states.json but should be under /opt/gbo/work/{bucket_name}/file_states.json like other bot data (e.g. /opt/gbo/work/salesianos.gbai/) Also fixed file_states_static signature to use bucket_name consistently.	2026-04-11 20:40:23 -03:00
Rodrigo Rodriguez (Pragmatismo)	cf4a00e16e	fix: work path uses production /opt/gbo when env exists or path exists; mark .bas files indexed=true after compilation All checks were successful BotServer CI/CD / build (push) Successful in 3m20s Details - get_work_path_default/get_stack_path no longer rely on CWD-relative botserver-stack check which caused wrong output path in production when CI left that directory - DriveMonitor now marks .bas file states as indexed=true after list+compile cycle - Added compile_tool logging for work_dir path	2026-04-11 20:16:22 -03:00
Rodrigo Rodriguez (Pragmatismo)	5fdb3be5b4	fix: save file_states after prompt etag update to stop PROMPT.md download loop All checks were successful BotServer CI/CD / build (push) Successful in 3m41s Details	2026-04-11 19:21:26 -03:00
Rodrigo Rodriguez (Pragmatismo)	f4c99030aa	fix: use get_work_path() instead of get_stack_path()+data/system for work dir, add etag check for PROMPT.md downloads All checks were successful BotServer CI/CD / build (push) Successful in 3m37s Details	2026-04-11 18:42:09 -03:00
Rodrigo Rodriguez (Pragmatismo)	a131120638	Fix KB indexing: bot-specific embedding config, PROMPT.md sync, single-file streaming All checks were successful BotServer CI/CD / build (push) Successful in 4m1s Details	2026-04-11 13:27:48 -03:00
Rodrigo Rodriguez (Pragmatismo)	12988b637d	Fix KB indexing: single file streaming, dedup tracking, .ast cache All checks were successful BotServer CI/CD / build (push) Successful in 12m31s Details	2026-04-11 13:10:09 -03:00
Rodrigo Rodriguez (Pragmatismo)	821dd1d7ab	fix: Use bot-specific embedding config in DriveMonitor KB manager All checks were successful BotServer CI/CD / build (push) Successful in 3m47s Details	2026-04-11 08:55:41 -03:00
Rodrigo Rodriguez (Pragmatismo)	db2dc3fb34	Fix warnings: remove unused variables in drive_monitor All checks were successful BotServer CI/CD / build (push) Successful in 11m32s Details	2026-04-10 12:58:20 -03:00
Rodrigo Rodriguez (Pragmatismo)	26b009d4e6	Fix: Remove duplicate method definitions in DriveMonitor All checks were successful BotServer CI/CD / build (push) Successful in 4m52s Details - Removed duplicate file_state_path() and load_file_states() methods - Kept only new save_file_states_static() helper - Original methods still exist at lines 79-84 and 87-128 - Fixes compilation errors from previous commit	2026-04-10 11:31:17 -03:00
Rodrigo Rodriguez (Pragmatismo)	816d416eee	Fix DriveMonitor dispatch failure in main repo Some checks failed BotServer CI/CD / build (push) Failing after 1m31s Details - Added static save_file_states_static() helper method - Changed tokio::spawn calls to use Arc::clone instead of Arc::new(self.clone()) - This prevents double Arc wrapping which causes 'dispatch failure' errors - Fixes config.csv not syncing from bucket to database for salesianos/default bots	2026-04-10 11:24:56 -03:00
Rodrigo Rodriguez (Pragmatismo)	f526fa1daa	Fix hardcoded paths for production environment - Update get_work_path_default() to check for .env in /opt/gbo/bin/.env - Update get_stack_path() to check for .env in /opt/gbo/bin/.env - Update DriveMonitor::new() to use get_work_path() instead of hardcoded path - Update start_config_watcher() to use get_work_path() instead of hardcoded path This fixes the issue where botserver was using development paths (/home/rodriguez/src/gb/botserver-stack/data/system/work) in production instead of production paths (/opt/gbo/work).	2026-04-09 18:21:17 -03:00
Rodrigo Rodriguez (Pragmatismo)	5371047fa1	Drive monitor: download PROMPT.md from MinIO to work directory Some checks failed BotServer CI/CD / build (push) Failing after 6m18s Details - When system-prompt-file is configured in config.csv, download the file from MinIO - Save to {bot}.gbai/{bot}.gbot/ folder in work directory - Config loaded from MinIO (gbo-* buckets)	2026-04-08 20:09:39 -03:00
Rodrigo Rodriguez (Pragmatismo)	9b04af9e7b	Fix USE KB and USE WEBSITE default features compilation Some checks failed BotServer CI/CD / build (push) Failing after 10m2s Details	2026-04-07 20:14:12 -03:00
Rodrigo Rodriguez (Pragmatismo)	73002b36cc	Update botserver: various fixes and improvements All checks were successful BotServer CI/CD / build (push) Successful in 9m59s Details	2026-04-07 13:33:50 -03:00
Rodrigo Rodriguez (Pragmatismo)	e992ed3b39	Enforce Vault-only secrets: remove env var fallbacks, all secrets from Vault Some checks are pending BotServer CI/CD / build (push) Waiting to run Details - Remove all std::env::var calls except VAULT_* and PORT - get_from_env returns hardcoded defaults only (no env var reading) - Auth config, rate limits, email, analytics, calendar all use Vault - WORK_PATH replaced with get_work_path() helper reading from Vault - .env on production cleaned to only VAULT_ADDR, VAULT_TOKEN, VAULT_CACERT, PORT - All service IPs/credentials stored in Vault secret/gbo/*	2026-04-03 07:11:40 -03:00
Rodrigo Rodriguez (Pragmatismo)	3bb115266b	feat: Add GUID prefix to Qdrant collection names for KB security isolation	2026-03-19 19:51:28 -03:00
Rodrigo Rodriguez (Pragmatismo)	260a13e77d	refactor: apply various fixes across botserver Some checks failed BotServer CI / build (push) Has been cancelled Details	2026-03-10 15:15:21 -03:00
Rodrigo Rodriguez (Pragmatismo)	5404e3e7ba	feat: Enhance KB context, embedding generator, and website crawler - Improved kb_context with better context management - Enhanced embedding_generator with extended functionality (+231 lines) - Updated kb_indexer with improved indexing logic - Expanded website_crawler_service capabilities (+230 lines) - Updated use_website keyword implementation - Refined bootstrap_manager and utils - Improved drive monitoring and local file monitor - Added server enhancements	2026-03-04 15:43:37 -03:00
Rodrigo Rodriguez (Pragmatismo)	2c92a81302	merge: Unify master into main - all commits unified Some checks failed BotServer CI / build (push) Failing after 6m9s Details	2026-03-01 07:43:07 -03:00
Rodrigo Rodriguez (Pragmatismo)	8f495c75ec	WIP: Local changes before merging master into main	2026-03-01 07:40:11 -03:00
Rodrigo Rodriguez (Pragmatismo)	764f058653	fix: update work directory paths to use botserver-stack/data/system/work All checks were successful BotServer CI / build (push) Successful in 7m4s Details Updated all hardcoded work/ directory references to use the correct relative path from the current working directory: - botserver-stack/data/system/work This ensures consistent file location resolution regardless of where botserver is run from (/home/rodriguez/src/gb/ or /opt/gbo/bin/). Changes: - local_file_monitor.rs: Use std::env::current_dir() for work_root - drive_monitor/mod.rs: Use work_root PathBuf for tool compilation - website_crawler_service.rs: Use std::env::current_dir() for work_path 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2026-02-22 16:20:07 -03:00
Rodrigo Rodriguez (Pragmatismo)	1856215d05	chore: update dependencies and formatting All checks were successful BotServer CI / build (push) Successful in 7m30s Details	2026-02-22 15:55:39 -03:00
Rodrigo Rodriguez	8a8008a72c	fix: Resolve compilation errors All checks were successful BotServer CI / build (push) Successful in 13m3s Details - Escape format placeholders in designer_ai.rs ({{botname}}) - Remove undefined 'prefix' filter in drive_monitor - Fix type mismatch in use_tool.rs (str vs &String) - Remove unused TextExpressionMethods import Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 22:15:35 +00:00
Rodrigo Rodriguez	e34848507d	fix: Update multiple modules for i18n and drive monitoring Some checks failed BotServer CI / build (push) Failing after 6m22s Details - Update auto_task modules (app_generator, designer_ai, intent_classifier) - Refactor use_tool.rs for better structure - Update bot core and website crawler - Improve drive_monitor and local_file_monitor - Update bootstrap module Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-14 22:06:57 +00:00
Rodrigo Rodriguez	5ea171d126	Refactor: Split large files into modular subdirectories Some checks failed BotServer CI / build (push) Failing after 1m34s Details Split 20+ files over 1000 lines into focused subdirectories for better maintainability and code organization. All changes maintain backward compatibility through re-export wrappers. Major splits: - attendance/llm_assist.rs (2074→7 modules) - basic/keywords/face_api.rs → face_api/ (7 modules) - basic/keywords/file_operations.rs → file_ops/ (8 modules) - basic/keywords/hear_talk.rs → hearing/ (6 modules) - channels/wechat.rs → wechat/ (10 modules) - channels/youtube.rs → youtube/ (5 modules) - contacts/mod.rs → contacts_api/ (6 modules) - core/bootstrap/mod.rs → bootstrap/ (5 modules) - core/shared/admin.rs → admin_*.rs (5 modules) - designer/canvas.rs → canvas_api/ (6 modules) - designer/mod.rs → designer_api/ (6 modules) - docs/handlers.rs → handlers_api/ (11 modules) - drive/mod.rs → drive_handlers.rs, drive_types.rs - learn/mod.rs → types.rs - main.rs → main_module/ (7 modules) - meet/webinar.rs → webinar_api/ (8 modules) - paper/mod.rs → (10 modules) - security/auth.rs → auth_api/ (7 modules) - security/passkey.rs → (4 modules) - sources/mod.rs → sources_api/ (5 modules) - tasks/mod.rs → task_api/ (5 modules) Stats: 38,040 deletions, 1,315 additions across 318 files Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-12 21:09:30 +00:00
Rodrigo Rodriguez	fc0926ffff	WIP: Multiple code improvements from previous session - Fix various compiler warnings - Update analytics, auto_task, and basic keywords - Improve security, channels, and core modules - Update designer, directory, and drive modules - Fix embedded UI and LLM modules Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-08 12:25:37 +00:00
Rodrigo Rodriguez (Pragmatismo)	355215c2a2	Update: refactor migrations, update source files, and add new features	2026-02-04 13:29:29 -03:00
Rodrigo Rodriguez (Pragmatismo)	0a24cd4b50	Fix build errors and unused imports in core, security and package_manager modules	2026-01-24 22:04:47 -03:00
Rodrigo Rodriguez (Pragmatismo)	6fa52e1dd8	feat: implement feature bundling architecture and fix conditional compilation - Restructured Cargo.toml with Bundle Pattern for easy feature selection - Added feature bundles: tasks → automation + drive + monitoring - Applied conditional compilation guards throughout codebase: * AppState fields (drive, cache, task_engine, task_scheduler) * main.rs initialization (S3, Redis, Tasks) * SessionManager Redis usage * bootstrap S3/Drive operations * compiler task scheduling * shared module Task/NewTask exports - Eliminated all botserver compilation warnings - Minimal build now compiles successfully - Accepted core dependencies: automation (Rhai), drive (S3), cache (Redis) - Created DEPENDENCY_FIX_PLAN.md with complete documentation Minimal feature set: chat + automation + drive + cache Verified: cargo check -p botserver --no-default-features --features minimal ✅	2026-01-23 13:14:20 -03:00
Rodrigo Rodriguez (Pragmatismo)	66abce913f	Feature gating refactor: modular compilation with minimal feature set	2026-01-22 19:45:18 -03:00
Rodrigo Rodriguez (Pragmatismo)	033bb504b9	Various updates: dependencies, features, and bug fixes	2026-01-16 11:29:22 -03:00
Rodrigo Rodriguez (Pragmatismo)	479950945b	feat(auth): Add OTP password display on bootstrap and fix Zitadel login flow - Add generate_secure_password() for OTP generation during admin bootstrap - Display admin credentials (username/password) in console on first run - Save credentials to ~/.gb-setup-credentials file - Fix Zitadel client to support PAT token authentication - Replace OAuth2 password grant with Zitadel Session API for login - Fix get_current_user to fetch user data from Zitadel session - Return session_id as access_token for proper authentication - Set email as verified on user creation to skip verification - Add password grant type to OAuth application config - Update directory_setup to include proper redirect URIs	2026-01-06 22:56:35 -03:00
Rodrigo Rodriguez (Pragmatismo)	29b80f597c	Fix email_accounts -> user_email_accounts table name typo in list_emails_htmx	2026-01-04 08:48:27 -03:00

1 2

64 commits