botserver

Author	SHA1	Message	Date
Rodrigo Rodriguez (Pragmatismo)	da9facf036	fix: add 5s connect_timeout to LLM HTTP client so unreachable APIs fail fast All checks were successful BotServer CI/CD / build (push) Successful in 3m52s Details Without connect_timeout, reqwest can hang for the full 60s timeout when the remote server is unreachable (DNS, TCP connect, etc.). Now fails in 5s max for connection issues, 30s for full request. This means one user's LLM failure no longer blocks new users for a full minute — the channel closes quickly and the WebSocket is freed. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 23:54:50 -03:00
Rodrigo Rodriguez (Pragmatismo)	3ec72f6121	fix: add 60s timeout to OpenAI-compatible HTTP client preventing LLM deadlock All checks were successful BotServer CI/CD / build (push) Successful in 4m2s Details reqwest::Client::new() has no timeout — when external APIs (NVIDIA, Groq, etc.) hang or throttle, the request blocks forever, freezing the entire response pipeline for the user. Also add std::time::Duration import to llm/mod.rs. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 23:31:12 -03:00
Rodrigo Rodriguez (Pragmatismo)	25d6d2fd57	fix: eliminate LLM keyword deadlock with isolated worker thread All checks were successful BotServer CI/CD / build (push) Successful in 3m32s Details The previous fix used Handle::current().block_on() which deadlocks when the Rhai engine runs on a Tokio worker thread — it blocks the very thread the async task needs to make progress. New approach: spawn a dedicated background thread with its own single-threaded Tokio runtime, communicate via mpsc channel with a 45s timeout. This completely isolates the LLM runtime from the caller's runtime, eliminating any possibility of thread starvation or nested-runtime deadlock. Also remove unused 'trace' import from llm/mod.rs. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 23:20:10 -03:00
Rodrigo Rodriguez (Pragmatismo)	7d911194f3	fix: disable all thinking detection to prevent deadlock All checks were successful BotServer CI/CD / build (push) Successful in 3m36s Details	2026-04-13 22:47:27 -03:00
Rodrigo Rodriguez (Pragmatismo)	6acf5fb4c0	fix: add reasoning_split=false to Minimax API calls All checks were successful BotServer CI/CD / build (push) Successful in 3m8s Details	2026-04-13 21:22:02 -03:00
Rodrigo Rodriguez (Pragmatismo)	ea8857ec8a	debug: trace LLM delta content for Minimax All checks were successful BotServer CI/CD / build (push) Successful in 4m7s Details	2026-04-13 21:08:46 -03:00
Rodrigo Rodriguez (Pragmatismo)	22e94f32ed	fix: filter reasoning when content exists (Minimax/GLM/Kimi) All checks were successful BotServer CI/CD / build (push) Successful in 3m57s Details	2026-04-13 20:58:58 -03:00
Rodrigo Rodriguez (Pragmatismo)	517d5435a9	fix: add max_tokens 131072 to OpenAI provider for minimax All checks were successful BotServer CI/CD / build (push) Successful in 4m25s Details	2026-04-13 20:26:03 -03:00
Rodrigo Rodriguez (Pragmatismo)	81c60ceb25	feat: add Kimi client and GLM thinking mode support, fix tool exec direct return All checks were successful BotServer CI/CD / build (push) Successful in 6m22s Details	2026-04-13 17:36:31 -03:00
Rodrigo Rodriguez (Pragmatismo)	60fd3dbbc4	fix: handle both reasoning_content and reasoning fields for NVIDIA API All checks were successful BotServer CI/CD / build (push) Successful in 3m12s Details	2026-04-13 16:21:05 -03:00
Rodrigo Rodriguez (Pragmatismo)	8ddcde4830	fix: detect NVIDIA API as GLM provider, handle full URL path Some checks failed BotServer CI/CD / build (push) Has been cancelled Details	2026-04-13 16:18:00 -03:00
Rodrigo Rodriguez (Pragmatismo)	498c771d7b	feat: add thinking indicator for reasoning models (GLM4.7, Kimi K2.5) All checks were successful BotServer CI/CD / build (push) Successful in 3m27s Details - Show thinking indicator while LLM is in reasoning mode - Skip reasoning content (thinking text) from user response - Only show actual HTML content after thinking ends Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 15:35:22 -03:00
Rodrigo Rodriguez (Pragmatismo)	3e99235a49	fix: support reasoning models (GLM4.7, Kimi K2.5) - use reasoning_content when content is null All checks were successful BotServer CI/CD / build (push) Successful in 3m19s Details - GLM4.7 and Kimi K2.5 send response in 'reasoning_content' field, 'content' is null - Prefer 'content' for normal models, fallback to 'reasoning_content' for reasoning models - Fixes blank white screen when using z-ai/glm4.7 model Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 15:18:21 -03:00
Rodrigo Rodriguez (Pragmatismo)	c5d30adebe	revert: restore llm/mod.rs to stable April 9 version All checks were successful BotServer CI/CD / build (push) Successful in 3m26s Details Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 15:07:19 -03:00
Rodrigo Rodriguez (Pragmatismo)	f8b47d1ac2	refactor: unify BASIC compilation into BasicCompiler only, runtime uses ScriptService::run() on pre-compiled .ast Some checks failed BotServer CI/CD / build (push) Has been cancelled Details - Move all preprocessing transforms (convert_multiword_keywords, preprocess_llm_keyword, convert_while_wend_syntax, predeclare_variables) into BasicCompiler::preprocess_basic so .ast files are fully preprocessed by Drive Monitor - Replace ScriptService compile/compile_preprocessed/compile_tool_script with single run(ast_content) that does engine.compile() + eval_ast_with_scope() - Remove .bas fallback in tool_executor and start.bas paths - .ast only - Remove dead code: preprocess_basic_script, normalize_variables_to_lowercase, convert_save_for_tools, parse_save_parts, normalize_word - Fix: USE KB 'cartas' in tool .ast now correctly converted to USE_KB('cartas') during compilation, ensuring KB context injection works after tool execution - Fix: add trace import in llm/mod.rs	2026-04-13 14:05:55 -03:00
Rodrigo Rodriguez (Pragmatismo)	723407cfd6	fix: add 60s timeout to LLM stream reads and add concurrent scan guard All checks were successful BotServer CI/CD / build (push) Successful in 3m53s Details - Add tokio timeout to SSE stream reads in OpenAI client (60s) - Prevents indefinite hang when Kimi/Nvidia stops responding - Add scanning AtomicBool to prevent concurrent check_gbkb_changes calls - Skip GBKB scan entirely when all KBs already indexed in Qdrant Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 12:58:11 -03:00
Rodrigo Rodriguez (Pragmatismo)	dd68cdbe6c	fix: remove hardcoded salesianos, strip think tags globally, block reasoning_content leak All checks were successful BotServer CI/CD / build (push) Successful in 6m38s Details - drive_monitor: replace hardcoded salesianos.gbot with dynamic bot_name - llm/mod.rs: stop falling back to reasoning_content as content - llm/claude.rs: same fix for Claude handler - deepseek_r3: export strip_think_tags for reuse - gpt_oss_20b: use strip_think_tags so all models strip tags - gpt_oss_120b: use strip_think_tags so all models strip tags	2026-04-13 09:04:22 -03:00
Rodrigo Rodriguez (Pragmatismo)	2f3dd957e3	fix: resolve kb_collections and kb_group_associations imports for directory feature All checks were successful BotServer CI/CD / build (push) Successful in 7m50s Details - Extract kb_collections and kb_group_associations into dedicated schema/kb.rs module - Gate kb module behind rbac feature (directory depends on rbac) - Remove duplicate definitions from research.rs - Fix import paths in directory/groups/kbs.rs - Remove dead rbac_kb imports from settings/rbac.rs - Gate llm::local module behind llm feature to fix missing set_embedding_server_ready Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-12 12:48:42 -03:00
Rodrigo Rodriguez (Pragmatismo)	be3e4c4e54	Fix: Handle 'reasoning' field from NVIDIA kimi-k2.5 model All checks were successful BotServer CI/CD / build (push) Successful in 3m6s Details	2026-04-11 22:58:50 -03:00
Rodrigo Rodriguez (Pragmatismo)	47cb470c8e	Fix: Handle reasoning_content from NVIDIA reasoning models (gpt-oss-120b) All checks were successful BotServer CI/CD / build (push) Successful in 3m16s Details	2026-04-11 22:30:39 -03:00
Rodrigo Rodriguez (Pragmatismo)	a4a3837c4c	fix: critical bugs - LLM context truncation, bot creation, S3 endpoint, vectordb seed All checks were successful BotServer CI/CD / build (push) Successful in 3m28s Details 1. Fix model.starts_with('') always true - was limiting ALL models to 768 tokens (local llama limit), truncating system prompts and KB context. Now only applies when model=='local' or empty string, default is 32k tokens. 2. Fix create_bot_from_drive missing NOT NULL columns (llm_provider, context_provider) - bots auto-created from S3 buckets failed to persist. 3. Fix S3 endpoint URL construction missing port 9100. 4. Fix Vault seed: vectordb.url was empty string, now defaults to http://localhost:6333. 5. Fix Vault credential regeneration on recovery - added vault_seeds_exist(). 6. Fix CA cert path for Vault TLS (botserver-stack vs botserver-stack). 7. Add bot verification after insert in create_bot_from_drive.	2026-04-11 17:56:03 -03:00
Rodrigo Rodriguez (Pragmatismo)	5576378b3f	Update botserver: Multiple improvements across core modules All checks were successful BotServer CI/CD / build (push) Successful in 10m41s Details	2026-04-11 07:33:32 -03:00
Rodrigo Rodriguez (Pragmatismo)	2b2b386f5e	Fix duplicate endpoint path in LLM URL All checks were successful BotServer CI/CD / build (push) Successful in 5m58s Details	2026-04-09 22:51:32 -03:00
Rodrigo Rodriguez (Pragmatismo)	49e9f419f3	Fix: export observability module from llm/mod.rs Some checks failed BotServer CI / build (push) Has been cancelled Details	2026-03-16 22:00:55 -03:00
Rodrigo Rodriguez (Pragmatismo)	e98de24fe6	chore: update submodules All checks were successful BotServer CI / build (push) Successful in 9m56s Details	2026-03-10 19:39:31 -03:00
Rodrigo Rodriguez (Pragmatismo)	1053c86a73	fix: whatsapp dynamic routing and openai tool call accumulation All checks were successful BotServer CI / build (push) Successful in 13m40s Details	2026-03-10 17:19:17 -03:00
Rodrigo Rodriguez (Pragmatismo)	786d404938	feat: handle 429 rate limit in OpenAI non-stream generate All checks were successful BotServer CI / build (push) Successful in 11m7s Details	2026-03-10 15:26:10 -03:00
Rodrigo Rodriguez (Pragmatismo)	f34d401c2e	feat: handle 429 rate limit in OpenAI client Some checks failed BotServer CI / build (push) Has been cancelled Details	2026-03-10 15:21:40 -03:00
Rodrigo Rodriguez (Pragmatismo)	260a13e77d	refactor: apply various fixes across botserver Some checks failed BotServer CI / build (push) Has been cancelled Details	2026-03-10 15:15:21 -03:00
Rodrigo Rodriguez (Pragmatismo)	82bfd0a443	Fix Bedrock config for OpenAI GPT-OSS models All checks were successful BotServer CI / build (push) Successful in 12m35s Details	2026-03-10 12:36:24 -03:00
Rodrigo Rodriguez (Pragmatismo)	77c35ccde5	feat: add WhatsApp rate limiting and LLM hallucination detection All checks were successful BotServer CI / build (push) Successful in 11m51s Details	2026-03-09 17:22:47 -03:00
Rodrigo Rodriguez (Pragmatismo)	1856215d05	chore: update dependencies and formatting All checks were successful BotServer CI / build (push) Successful in 7m30s Details	2026-02-22 15:55:39 -03:00
Rodrigo Rodriguez	b1118f977d	fix: Correct parameter names in tool .bas files to match database schema - Tool 06: Change tipoExibicao to tipoDescricao (matches pedidos_uso_imagem table) - Tool 07: Change tipoExibicao to categoriaDescricao (matches licenciamentos table) - Both tools now compile and execute successfully with database inserts Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-18 17:51:47 +00:00
Rodrigo Rodriguez	5ea171d126	Refactor: Split large files into modular subdirectories Some checks failed BotServer CI / build (push) Failing after 1m34s Details Split 20+ files over 1000 lines into focused subdirectories for better maintainability and code organization. All changes maintain backward compatibility through re-export wrappers. Major splits: - attendance/llm_assist.rs (2074→7 modules) - basic/keywords/face_api.rs → face_api/ (7 modules) - basic/keywords/file_operations.rs → file_ops/ (8 modules) - basic/keywords/hear_talk.rs → hearing/ (6 modules) - channels/wechat.rs → wechat/ (10 modules) - channels/youtube.rs → youtube/ (5 modules) - contacts/mod.rs → contacts_api/ (6 modules) - core/bootstrap/mod.rs → bootstrap/ (5 modules) - core/shared/admin.rs → admin_*.rs (5 modules) - designer/canvas.rs → canvas_api/ (6 modules) - designer/mod.rs → designer_api/ (6 modules) - docs/handlers.rs → handlers_api/ (11 modules) - drive/mod.rs → drive_handlers.rs, drive_types.rs - learn/mod.rs → types.rs - main.rs → main_module/ (7 modules) - meet/webinar.rs → webinar_api/ (8 modules) - paper/mod.rs → (10 modules) - security/auth.rs → auth_api/ (7 modules) - security/passkey.rs → (4 modules) - sources/mod.rs → sources_api/ (5 modules) - tasks/mod.rs → task_api/ (5 modules) Stats: 38,040 deletions, 1,315 additions across 318 files Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-12 21:09:30 +00:00
Rodrigo Rodriguez	76abcea5e9	Fix start.bas repeated execution and add tool calling system - Add Redis-based tracking to prevent start.bas from running repeatedly when clicking suggestion buttons. start.bas now executes only once per session with a 24-hour expiration on the tracking key. - Add generic tool executor (ToolExecutor) for parsing and executing tool calls from any LLM provider. Works with Claude, OpenAI, and other providers that use standard tool calling formats. - Update both start.bas execution paths (WebSocket handler and LLM message handler) to check Redis before executing. - Fix suggestion duplication by clearing suggestions from Redis after fetching them. - Add rate limiter for LLM API calls. - Improve error handling and logging throughout. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-10 13:49:54 +00:00
Rodrigo Rodriguez	6215908536	Add ConfigWatcher and fix model routing New features: - Add ConfigWatcher for hot-reloading config.csv from ~/data - Add LocalFileMonitor for watching ~/data/*.gbai directories - Add GLM LLM provider implementation - Add tool context for LLM tool calling Bug fixes: - Fix model routing to respect session → bot → default hierarchy - Fix ConfigWatcher to handle local embedded (llm-server=true) - Skip DriveMonitor for default bot (managed via ConfigWatcher) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-09 15:10:27 +00:00
Rodrigo Rodriguez (Pragmatismo)	5fb4c889b7	fix(llm-config): Fix ConfigManager fallback logic for LLM configuration Some checks failed GBCI / build (push) Failing after 12m26s Details - Fix ConfigManager to treat 'none', 'null', 'n/a', and empty values as placeholders and fall back to default bot's configuration instead of using these as literal values - Fix ConfigManager to detect local file paths (e.g., .gguf, .bin, ../) and fall back to default bot's model when using remote API, allowing bots to keep local model config for local LLM server while automatically using remote model for API calls - Fix get_default_bot() to return the bot actually named 'default' instead of the first active bot by ID, ensuring consistent fallback behavior - Add comprehensive debug logging to trace LLM configuration from database to API call This fixes the issue where bots with incomplete or local LLM configuration would fail with 401/400 errors when trying to use remote API, instead of automatically falling back to the default bot's configuration from config.csv. Closes: #llm-config-fallback	2026-02-02 19:20:37 -03:00
Rodrigo Rodriguez (Pragmatismo)	51c8a53a90	Enable LLM feature by default and fix compilation errors - Add llm to default features in Cargo.toml - Fix duplicate smart_router module declaration - Remove unused LLMProvider import and fix unused variable warnings - Fix move error in enhanced_llm.rs by cloning state separately for each closure - Improve code formatting and consistency	2026-01-28 16:58:14 -03:00
Rodrigo Rodriguez (Pragmatismo)	9087bb17cd	feat: complete General Bots 7.0 (v6.2.0)	2026-01-25 10:29:54 -03:00
Rodrigo Rodriguez (Pragmatismo)	fc9c07d204	Refactor schema.rs into modular files	2026-01-22 13:57:40 -03:00
Rodrigo Rodriguez (Pragmatismo)	38f9abb7db	Fix organizations foreign key reference (org_id not id)	2025-12-29 08:07:42 -03:00
Rodrigo Rodriguez (Pragmatismo)	c67aaa677a	feat(security): Complete security infrastructure implementation SECURITY MODULES ADDED: - security/auth.rs: Full RBAC with roles (Anonymous, User, Moderator, Admin, SuperAdmin, Service, Bot, BotOwner, BotOperator, BotViewer) and permissions - security/cors.rs: Hardened CORS (no wildcard in production, env-based config) - security/panic_handler.rs: Panic catching middleware with safe 500 responses - security/path_guard.rs: Path traversal protection, null byte prevention - security/request_id.rs: UUID request tracking with correlation IDs - security/error_sanitizer.rs: Sensitive data redaction from responses - security/zitadel_auth.rs: Zitadel token introspection and role mapping - security/sql_guard.rs: SQL injection prevention with table whitelist - security/command_guard.rs: Command injection prevention - security/secrets.rs: Zeroizing secret management - security/validation.rs: Input validation utilities - security/rate_limiter.rs: Rate limiting with governor crate - security/headers.rs: Security headers (CSP, HSTS, X-Frame-Options) MAIN.RS UPDATES: - Replaced tower_http::cors::Any with hardened create_cors_layer() - Added panic handler middleware - Added request ID tracking middleware - Set global panic hook SECURITY STATUS: - 0 unwrap() in production code - 0 panic! in production code - 0 unsafe blocks - cargo audit: PASS (no vulnerabilities) - Estimated completion: ~98% Remaining: Wire auth middleware to handlers, audit logs for sensitive data	2025-12-28 19:29:18 -03:00
Rodrigo Rodriguez (Pragmatismo)	14b7cf70af	feat(autotask): Implement AutoTask system with intent classification and app generation - Add IntentClassifier with 7 intent types (APP_CREATE, TODO, MONITOR, ACTION, SCHEDULE, GOAL, TOOL) - Add AppGenerator with LLM-powered app structure analysis - Add DesignerAI for modifying apps through conversation - Add app_server for serving generated apps with clean URLs - Add db_api for CRUD operations on bot database tables - Add ask_later keyword for pending info collection - Add migration 6.1.1 with tables: pending_info, auto_tasks, execution_plans, task_approvals, task_decisions, safety_audit_log, generated_apps, intent_classifications, designer_changes - Write apps to S3 drive and sync to SITE_ROOT for serving - Clean URL structure: /apps/{app_name}/ - Integrate with DriveMonitor for file sync Based on Chapter 17 - Autonomous Tasks specification	2025-12-27 21:10:09 -03:00
Rodrigo Rodriguez (Pragmatismo)	5da86bbef2	Fix clippy warnings: match arms, async/await, Debug impls, formatting - Fix match arms with identical bodies by consolidating patterns - Fix case-insensitive file extension comparisons using eq_ignore_ascii_case - Fix unnecessary Debug formatting in log/format macros - Fix clone_from usage instead of clone assignment - Fix let...else patterns where appropriate - Fix format! append to String using write! macro - Fix unwrap_or with function calls to use unwrap_or_else - Add missing fields to manual Debug implementations - Fix duplicate code in if blocks - Add type aliases for complex types - Rename struct fields to avoid common prefixes - Various other clippy warning fixes Note: Some 'unused async' warnings remain for functions that are called with .await but don't contain await internally - these are kept async for API compatibility.	2025-12-26 08:59:25 -03:00
Rodrigo Rodriguez (Pragmatismo)	6bc6a35948	fix: resolve all warnings - wire up services properly	2025-12-17 17:41:37 -03:00
Rodrigo Rodriguez (Pragmatismo)	340ac8840a	- New templates.	2025-12-03 07:15:54 -03:00
Rodrigo Rodriguez (Pragmatismo)	fd5a1ee1e1	Add implementation plan and multi-agent features This commit introduces comprehensive documentation and implementation for multi-agent orchestration capabilities: - Add IMPLEMENTATION-PLAN.md with 4-phase roadmap - Add Kubernetes deployment manifests (deployment.yaml, hpa.yaml) - Add database migrations for multi-agent tables (6.1.1, 6.1.2) - Implement A2A protocol for agent-to-agent communication - Implement user memory keywords for cross-session persistence - Implement model routing for dynamic L	2025-11-30 19:18:23 -03:00
Rodrigo Rodriguez (Pragmatismo)	1789952c5b	Remove unused sqlx dependency and related code The sqlx database library has been removed from the project along with associated database-specific code that was no longer being used. This includes removal of various sqlx-related dependencies from Cargo.lock and cleanup of database connection pool references.	2025-11-28 09:27:29 -03:00
Rodrigo Rodriguez (Pragmatismo)	82e92de011	Remove unused Python scripts and book.toml The scripts were development utilities for SVG processing and mdBook configuration that are no longer needed in the project.	2025-11-26 15:27:47 -03:00
Rodrigo Rodriguez (Pragmatismo)	9ea9360231	- Removing warnings.	2025-11-22 22:54:45 -03:00

1 2

71 commits