botserver

Author	SHA1	Message	Date
Rodrigo Rodriguez (Pragmatismo)	251ee9e106	chore: disable DriveMonitor temporarily for WebSocket/LLM testing All checks were successful BotServer CI/CD / build (push) Successful in 7m30s Details DriveMonitor polling may be consuming resources and interfering with LLM response delivery. Disabling to isolate the chat pipeline. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-14 09:18:49 -03:00
Rodrigo Rodriguez (Pragmatismo)	3159d04414	fix: spawn LLM response in separate task to prevent recv_task blocking All checks were successful BotServer CI/CD / build (push) Successful in 5m3s Details Previously the recv_task awaited stream_response() directly, which froze the entire WebSocket message receiver while the LLM ran (30s+). This meant a second user message couldn't be processed until the first LLM call finished — a race condition that locked the session. Now stream_response runs in its own tokio::spawn, keeping recv_task free to handle new messages immediately. Also fixed borrow/lifetime issue by cloning the response channel sender out of the lock scope. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-14 08:59:10 -03:00
Rodrigo Rodriguez (Pragmatismo)	dc97813614	fix: revert stream timeout that broke message processing All checks were successful BotServer CI/CD / build (push) Successful in 4m40s Details	2026-04-14 02:11:46 -03:00
Rodrigo Rodriguez (Pragmatismo)	ed3406dd80	revert: restore working LLM streaming code from `260a13e7` All checks were successful BotServer CI/CD / build (push) Successful in 11m59s Details The recent LLM changes (timeouts, tool call accumulation, extra logging) broke the WebSocket message flow. Reverting to the known working version.	2026-04-14 01:15:20 -03:00
Rodrigo Rodriguez (Pragmatismo)	301a7dda33	Add LLM stream timeout and debug logs All checks were successful BotServer CI/CD / build (push) Successful in 4m8s Details	2026-04-14 00:55:43 -03:00
Rodrigo Rodriguez (Pragmatismo)	da9facf036	fix: add 5s connect_timeout to LLM HTTP client so unreachable APIs fail fast All checks were successful BotServer CI/CD / build (push) Successful in 3m52s Details Without connect_timeout, reqwest can hang for the full 60s timeout when the remote server is unreachable (DNS, TCP connect, etc.). Now fails in 5s max for connection issues, 30s for full request. This means one user's LLM failure no longer blocks new users for a full minute — the channel closes quickly and the WebSocket is freed. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 23:54:50 -03:00
Rodrigo Rodriguez (Pragmatismo)	3ec72f6121	fix: add 60s timeout to OpenAI-compatible HTTP client preventing LLM deadlock All checks were successful BotServer CI/CD / build (push) Successful in 4m2s Details reqwest::Client::new() has no timeout — when external APIs (NVIDIA, Groq, etc.) hang or throttle, the request blocks forever, freezing the entire response pipeline for the user. Also add std::time::Duration import to llm/mod.rs. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 23:31:12 -03:00
Rodrigo Rodriguez (Pragmatismo)	25d6d2fd57	fix: eliminate LLM keyword deadlock with isolated worker thread All checks were successful BotServer CI/CD / build (push) Successful in 3m32s Details The previous fix used Handle::current().block_on() which deadlocks when the Rhai engine runs on a Tokio worker thread — it blocks the very thread the async task needs to make progress. New approach: spawn a dedicated background thread with its own single-threaded Tokio runtime, communicate via mpsc channel with a 45s timeout. This completely isolates the LLM runtime from the caller's runtime, eliminating any possibility of thread starvation or nested-runtime deadlock. Also remove unused 'trace' import from llm/mod.rs. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 23:20:10 -03:00
Rodrigo Rodriguez (Pragmatismo)	b3fd90b056	fix: remove blocking recv_timeout from LLM keyword All checks were successful BotServer CI/CD / build (push) Successful in 3m41s Details	2026-04-13 23:01:54 -03:00
Rodrigo Rodriguez (Pragmatismo)	6468588f58	fix: remove LLM streaming lock that caused deadlocks All checks were successful BotServer CI/CD / build (push) Successful in 3m40s Details	2026-04-13 22:51:29 -03:00
Rodrigo Rodriguez (Pragmatismo)	7d911194f3	fix: disable all thinking detection to prevent deadlock All checks were successful BotServer CI/CD / build (push) Successful in 3m36s Details	2026-04-13 22:47:27 -03:00
Rodrigo Rodriguez (Pragmatismo)	f48f87cadc	debug: add processing traces All checks were successful BotServer CI/CD / build (push) Successful in 3m29s Details	2026-04-13 22:34:27 -03:00
Rodrigo Rodriguez (Pragmatismo)	99909de75d	fix: disable thinking detection to prevent deadlock All checks were successful BotServer CI/CD / build (push) Successful in 3m19s Details	2026-04-13 22:26:31 -03:00
Rodrigo Rodriguez (Pragmatismo)	318d199d6c	fix: clear thinking indicator on stream complete All checks were successful BotServer CI/CD / build (push) Successful in 3m21s Details	2026-04-13 22:19:10 -03:00
Rodrigo Rodriguez (Pragmatismo)	200b026efe	fix: add thinking indicator and 30s timeout to prevent deadlock All checks were successful BotServer CI/CD / build (push) Successful in 3m16s Details	2026-04-13 21:40:50 -03:00
Rodrigo Rodriguez (Pragmatismo)	3ddcc5a1d1	fix: simplify MinimaxHandler without regex All checks were successful BotServer CI/CD / build (push) Successful in 3m17s Details	2026-04-13 21:35:41 -03:00
Rodrigo Rodriguez (Pragmatismo)	6acf5fb4c0	fix: add reasoning_split=false to Minimax API calls All checks were successful BotServer CI/CD / build (push) Successful in 3m8s Details	2026-04-13 21:22:02 -03:00
Rodrigo Rodriguez (Pragmatismo)	2c82a8bd2e	fix: add MinimaxHandler to strip thinking tags from content All checks were successful BotServer CI/CD / build (push) Successful in 4m8s Details	2026-04-13 21:17:01 -03:00
Rodrigo Rodriguez (Pragmatismo)	ea8857ec8a	debug: trace LLM delta content for Minimax All checks were successful BotServer CI/CD / build (push) Successful in 4m7s Details	2026-04-13 21:08:46 -03:00
Rodrigo Rodriguez (Pragmatismo)	22e94f32ed	fix: filter reasoning when content exists (Minimax/GLM/Kimi) All checks were successful BotServer CI/CD / build (push) Successful in 3m57s Details	2026-04-13 20:58:58 -03:00
Rodrigo Rodriguez (Pragmatismo)	850db4b588	fix: add missing debug import All checks were successful BotServer CI/CD / build (push) Successful in 3m16s Details	2026-04-13 20:51:18 -03:00
Rodrigo Rodriguez (Pragmatismo)	650cb70961	debug: add WebSocket message tracing Some checks failed BotServer CI/CD / build (push) Failing after 4m10s Details	2026-04-13 20:46:28 -03:00
Rodrigo Rodriguez (Pragmatismo)	517d5435a9	fix: add max_tokens 131072 to OpenAI provider for minimax All checks were successful BotServer CI/CD / build (push) Successful in 4m25s Details	2026-04-13 20:26:03 -03:00
Rodrigo Rodriguez (Pragmatismo)	1b040a4278	fix: kimi stream - handle channel close, add parse error trace, fix buffer handling All checks were successful BotServer CI/CD / build (push) Successful in 3m27s Details	2026-04-13 19:59:00 -03:00
Rodrigo Rodriguez (Pragmatismo)	79997fc3b3	fix: kimi max_tokens 131072, add stream traces, content chars tracking All checks were successful BotServer CI/CD / build (push) Successful in 4m12s Details	2026-04-13 19:42:31 -03:00
Rodrigo Rodriguez (Pragmatismo)	4d9d38ffda	fix: enable chat_template_kwargs for GLM thinking mode, add stream traces, fix config_manager scope All checks were successful BotServer CI/CD / build (push) Successful in 3m55s Details	2026-04-13 19:23:19 -03:00
Rodrigo Rodriguez (Pragmatismo)	d6ffe265ef	fix: GLM max_tokens 131072, disable thinking kwargs - model sends content after reasoning naturally All checks were successful BotServer CI/CD / build (push) Successful in 3m11s Details	2026-04-13 18:52:02 -03:00
Rodrigo Rodriguez (Pragmatismo)	c9fa057203	fix: TOOL_EXEC uses correct MessageType constant from botlib, not enums Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 18:43:14 -03:00
Rodrigo Rodriguez (Pragmatismo)	87df733db0	fix: GLM client - add chat_template_kwargs, handle reasoning_content, increase max_tokens to 16384 All checks were successful BotServer CI/CD / build (push) Successful in 5m52s Details	2026-04-13 18:33:16 -03:00
Rodrigo Rodriguez (Pragmatismo)	8a65afbfc5	feat: add [BASIC_EXEC] traces for start, tool, scheduler, webhook execution All checks were successful BotServer CI/CD / build (push) Successful in 3m18s Details	2026-04-13 18:16:01 -03:00
Rodrigo Rodriguez (Pragmatismo)	99572f0dc5	fix: ensure websocket_session_id and channel context are set before tool execution so TALK can route messages to frontend Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 18:04:02 -03:00
Rodrigo Rodriguez (Pragmatismo)	5a24137a5b	fix: remove extra_body param from GLM client - NVIDIA API rejects it All checks were successful BotServer CI/CD / build (push) Successful in 5m58s Details	2026-04-13 17:57:02 -03:00
Rodrigo Rodriguez (Pragmatismo)	81c60ceb25	feat: add Kimi client and GLM thinking mode support, fix tool exec direct return All checks were successful BotServer CI/CD / build (push) Successful in 6m22s Details	2026-04-13 17:36:31 -03:00
Rodrigo Rodriguez (Pragmatismo)	e48b5610db	fix: prevent KB re-download loop when file_states fails to load All checks were successful BotServer CI/CD / build (push) Successful in 3m28s Details - Add fallback: skip files from indexed KB folders even when file_states is empty - Add file_states_count to debug log to detect load failures - Add indexed_kb_names set for quick KB folder lookup - This prevents the infinite download loop when file_states.json fails to deserialize Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 16:26:12 -03:00
Rodrigo Rodriguez (Pragmatismo)	60fd3dbbc4	fix: handle both reasoning_content and reasoning fields for NVIDIA API All checks were successful BotServer CI/CD / build (push) Successful in 3m12s Details	2026-04-13 16:21:05 -03:00
Rodrigo Rodriguez (Pragmatismo)	8ddcde4830	fix: detect NVIDIA API as GLM provider, handle full URL path Some checks failed BotServer CI/CD / build (push) Has been cancelled Details	2026-04-13 16:18:00 -03:00
Rodrigo Rodriguez (Pragmatismo)	32fbdb4b17	fix: detect new PDFs in already-indexed KB folders All checks were successful BotServer CI/CD / build (push) Successful in 3m11s Details - Don't skip entire GBKB scan when all KBs are indexed - Instead, skip individual files that are already tracked (not new) - This allows new PDFs added to existing KB folders to be detected and indexed Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 16:03:22 -03:00
Rodrigo Rodriguez (Pragmatismo)	6d987c0eea	feat: add ADD_SWITCHER keyword with underscore preprocessing All checks were successful BotServer CI/CD / build (push) Successful in 3m25s Details Implement ADD_SWITCHER keyword following the same pattern as ADD_SUGGESTION_TOOL: - Created switcher.rs module with add_switcher_keyword() and clear_switchers_keyword() - Added preprocessing to convert "ADD SWITCHER" to "ADD_SWITCHER" - Added to keyword patterns and get_all_keywords() - Stores switcher suggestions in Redis with type "switcher" and action "switch_context" - Supports both "ADD SWITCHER" and "ADD_SWITCHER" syntax 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2026-04-13 15:47:21 -03:00
Rodrigo Rodriguez (Pragmatismo)	498c771d7b	feat: add thinking indicator for reasoning models (GLM4.7, Kimi K2.5) All checks were successful BotServer CI/CD / build (push) Successful in 3m27s Details - Show thinking indicator while LLM is in reasoning mode - Skip reasoning content (thinking text) from user response - Only show actual HTML content after thinking ends Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 15:35:22 -03:00
Rodrigo Rodriguez (Pragmatismo)	3e99235a49	fix: support reasoning models (GLM4.7, Kimi K2.5) - use reasoning_content when content is null All checks were successful BotServer CI/CD / build (push) Successful in 3m19s Details - GLM4.7 and Kimi K2.5 send response in 'reasoning_content' field, 'content' is null - Prefer 'content' for normal models, fallback to 'reasoning_content' for reasoning models - Fixes blank white screen when using z-ai/glm4.7 model Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 15:18:21 -03:00
Rodrigo Rodriguez (Pragmatismo)	c5d30adebe	revert: restore llm/mod.rs to stable April 9 version All checks were successful BotServer CI/CD / build (push) Successful in 3m26s Details Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 15:07:19 -03:00
Rodrigo Rodriguez (Pragmatismo)	765bd624f4	fix: use Redis SADD instead of RPUSH for suggestions to prevent duplicate buttons All checks were successful BotServer CI/CD / build (push) Successful in 3m11s Details SADD stores suggestions in a set (deduplicated) instead of a list (accumulates). get_suggestions now uses SMEMBERS instead of LRANGE. Removed the TODO about clearing suggestions since SADD inherently prevents duplicates.	2026-04-13 14:09:57 -03:00
Rodrigo Rodriguez (Pragmatismo)	f8b47d1ac2	refactor: unify BASIC compilation into BasicCompiler only, runtime uses ScriptService::run() on pre-compiled .ast Some checks failed BotServer CI/CD / build (push) Has been cancelled Details - Move all preprocessing transforms (convert_multiword_keywords, preprocess_llm_keyword, convert_while_wend_syntax, predeclare_variables) into BasicCompiler::preprocess_basic so .ast files are fully preprocessed by Drive Monitor - Replace ScriptService compile/compile_preprocessed/compile_tool_script with single run(ast_content) that does engine.compile() + eval_ast_with_scope() - Remove .bas fallback in tool_executor and start.bas paths - .ast only - Remove dead code: preprocess_basic_script, normalize_variables_to_lowercase, convert_save_for_tools, parse_save_parts, normalize_word - Fix: USE KB 'cartas' in tool .ast now correctly converted to USE_KB('cartas') during compilation, ensuring KB context injection works after tool execution - Fix: add trace import in llm/mod.rs	2026-04-13 14:05:55 -03:00
Rodrigo Rodriguez (Pragmatismo)	723407cfd6	fix: add 60s timeout to LLM stream reads and add concurrent scan guard All checks were successful BotServer CI/CD / build (push) Successful in 3m53s Details - Add tokio timeout to SSE stream reads in OpenAI client (60s) - Prevents indefinite hang when Kimi/Nvidia stops responding - Add scanning AtomicBool to prevent concurrent check_gbkb_changes calls - Skip GBKB scan entirely when all KBs already indexed in Qdrant Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 12:58:11 -03:00
Rodrigo Rodriguez (Pragmatismo)	c1df15eb48	fix: skip GBKB scan when all KBs already indexed in Qdrant All checks were successful BotServer CI/CD / build (push) Successful in 3m39s Details - Check kb_indexed_folders before acquiring file_states write lock - Eliminates deadlock from concurrent check_gbkb_changes calls - Prevents unnecessary PDF re-downloads every 10 seconds - Removes debug logging, adds clean early-return Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 12:22:11 -03:00
Rodrigo Rodriguez (Pragmatismo)	326305d55e	debug: add LLM output traces to diagnose blank HTML rendering issue All checks were successful BotServer CI/CD / build (push) Successful in 4m0s Details - Log full LLM response preview (500 chars) with has_html detection - Log WebSocket send with message type, completeness, and content preview - Use clone() for chunk in BotResponse to ensure accurate logging Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 11:57:43 -03:00
Rodrigo Rodriguez (Pragmatismo)	d1652fc413	feat: add build_date to health endpoint for CI deploy verification All checks were successful BotServer CI/CD / build (push) Successful in 4m21s Details - Add BOTSERVER_BUILD_DATE env var to /api/health response - Set build date during CI compilation via environment variable - Enables checking deployed binary age without SSH access Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 11:49:10 -03:00
Rodrigo Rodriguez (Pragmatismo)	4fb626399d	fix: prevent infinite KB reindexing loop by using last_modified as primary change detector All checks were successful BotServer CI/CD / build (push) Successful in 4m2s Details - Use last_modified timestamp instead of ETag for change detection - Skip re-queueing KBs that are already indexed in Qdrant - Preserve indexed status across scans when content unchanged - Add normalize_etag helper for consistent ETag comparison Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>	2026-04-13 11:24:37 -03:00
Rodrigo Rodriguez (Pragmatismo)	e98dc47ea1	fix: TOOL_EXEC with USE KB now falls through to LLM pipeline for KB-injected response All checks were successful BotServer CI/CD / build (push) Successful in 3m50s Details When a tool button like Cartas activates a KB via USE KB, instead of returning just the tool result (empty/label), the handler now checks if session has active KBs. If so and result is empty/trivial, falls through to the full LLM pipeline which injects KB context.	2026-04-13 10:02:47 -03:00
Rodrigo Rodriguez (Pragmatismo)	1f77d7f099	fix: skip KB re-indexing when kb_collections already has docs, prevents vector DB loop All checks were successful BotServer CI/CD / build (push) Successful in 4m5s Details	2026-04-13 09:53:25 -03:00

1 2 3 4 5 ...

4356 commits