GeneralBots/botbook

Fork 0

Rodrigo Rodriguez (Pragmatismo) 0cf9736944 Update: General project updates

2025-12-06 11:09:12 -03:00

25 KiB

Raw Blame History

Configuration Parameters

Complete reference of all available parameters in config.csv.

Server Parameters

Web Server

Parameter	Description	Default	Type
`server-host`	Server bind address	`0.0.0.0`	IP address
`server-port`	Server listen port	`8080`	Number (1-65535)
`sites-root`	Generated sites directory	`/tmp`	Path

MCP Server

Parameter	Description	Default	Type
`mcp-server`	Enable MCP protocol server	`false`	Boolean

LLM Parameters

Core LLM Settings

Parameter	Description	Default	Type
`llm-key`	API key for LLM service	`none`	String
`llm-url`	LLM service endpoint	`http://localhost:8081`	URL
`llm-model`	Model path or identifier	Required	Path/String
`llm-models`	Available model aliases for routing	`default`	Semicolon-separated

LLM Cache

Parameter	Description	Default	Type
`llm-cache`	Enable response caching	`false`	Boolean
`llm-cache-ttl`	Cache time-to-live	`3600`	Seconds
`llm-cache-semantic`	Semantic similarity cache	`true`	Boolean
`llm-cache-threshold`	Similarity threshold	`0.95`	Float (0-1)

Embedded LLM Server

Parameter	Description	Default	Type
`llm-server`	Run embedded server	`false`	Boolean
`llm-server-path`	Server binary path	`botserver-stack/bin/llm/build/bin`	Path
`llm-server-host`	Server bind address	`0.0.0.0`	IP address
`llm-server-port`	Server port	`8081`	Number
`llm-server-gpu-layers`	GPU offload layers	`0`	Number
`llm-server-n-moe`	MoE experts count	`0`	Number
`llm-server-ctx-size`	Context size	`4096`	Tokens
`llm-server-n-predict`	Max predictions	`1024`	Tokens
`llm-server-parallel`	Parallel requests	`6`	Number
`llm-server-cont-batching`	Continuous batching	`true`	Boolean
`llm-server-mlock`	Lock in memory	`false`	Boolean
`llm-server-no-mmap`	Disable mmap	`false`	Boolean
`llm-server-reasoning-format`	Reasoning output format for llama.cpp	`none`	String

Hardware-Specific LLM Tuning

For RTX 3090 (24GB VRAM)

You can run impressive models with proper configuration:

DeepSeek-R3-Distill-Qwen-7B: Set llm-server-gpu-layers to 35-40
Qwen2.5-32B-Instruct (Q4_K_M): Fits with llm-server-gpu-layers to 40-45
DeepSeek-V3 (with MoE): Set llm-server-n-moe to 2-4 to run even 120B models! MoE only loads active experts
Optimization: Use llm-server-ctx-size of 8192 for longer contexts

For RTX 4070/4070Ti (12-16GB VRAM)

Mid-range cards work great with quantized models:

Qwen2.5-14B (Q4_K_M): Set llm-server-gpu-layers to 25-30
DeepSeek-R3-Distill-Llama-8B: Fully fits with layers at 32
Tips: Keep llm-server-ctx-size at 4096 to save VRAM

For CPU-Only (No GPU)

Modern CPUs can still run capable models:

DeepSeek-R3-Distill-Qwen-1.5B: Fast on CPU, great for testing
Phi-3-mini (3.8B): Excellent CPU performance
Settings: Set llm-server-mlock to true to prevent swapping
Parallel: Increase llm-server-parallel to CPU cores -2

Recommended Models (GGUF Format)

Best Overall: DeepSeek-R3-Distill series (1.5B to 70B)
Best Small: Qwen2.5-3B-Instruct-Q5_K_M
Best Medium: DeepSeek-R3-Distill-Qwen-14B-Q4_K_M
Best Large: DeepSeek-V3, Qwen2.5-32B, or GPT2-120B-GGUF (with MoE enabled)

Pro Tip: The llm-server-n-moe parameter is magic for large models - it enables Mixture of Experts, letting you run 120B+ models on consumer hardware by only loading the experts needed for each token!

Local vs Cloud: A Practical Note

General Bots excels at local deployment - you own your hardware, your data stays private, and there are no recurring costs. However, if you need cloud inference:

Groq is the speed champion - They use custom LPU (Language Processing Unit) chips instead of GPUs, delivering 10x faster inference than traditional cloud providers. Their hardware is purpose-built for transformers, avoiding the general-purpose overhead of NVIDIA GPUs.

This isn't about market competition - it's about architecture. NVIDIA GPUs are designed for many tasks, while Groq's chips do one thing incredibly well: transformer inference. If speed matters and you're using cloud, Groq is currently the fastest option available.

For local deployment, stick with General Bots and the configurations above. For cloud bursts or when you need extreme speed, consider Groq's API with these settings:

llm-url,https://api.groq.com/openai/v1
llm-key,your-groq-api-key
llm-model,mixtral-8x7b-32768

Embedding Parameters

Parameter	Description	Default	Type
`embedding-url`	Embedding service endpoint	`http://localhost:8082`	URL
`embedding-model`	Embedding model path	Required for KB	Path

Email Parameters

Parameter	Description	Default	Type
`email-from`	Sender address	Required for email	Email
`email-server`	SMTP hostname	Required for email	Hostname
`email-port`	SMTP port	`587`	Number
`email-user`	SMTP username	Required for email	String
`email-pass`	SMTP password	Required for email	String
`email-read-pixel`	Enable read tracking pixel in HTML emails	`false`	Boolean

Email Read Tracking

When email-read-pixel is enabled, a 1x1 transparent tracking pixel is automatically injected into HTML emails sent via the API. This allows you to:

Track when emails are opened
See how many times an email was opened
Get the approximate location (IP) and device (user agent) of the reader

API Endpoints for tracking:

Endpoint	Method	Description
`/api/email/tracking/pixel/{tracking_id}`	GET	Serves the tracking pixel (called by email client)
`/api/email/tracking/status/{tracking_id}`	GET	Get read status for a specific email
`/api/email/tracking/list`	GET	List all sent emails with tracking status
`/api/email/tracking/stats`	GET	Get overall tracking statistics

Example configuration:

email-read-pixel,true
server-url,https://yourdomain.com

Note: The server-url parameter is used to generate the tracking pixel URL. Make sure it's accessible from the recipient's email client.

Privacy considerations: Email tracking should be used responsibly. Consider disclosing tracking in your email footer for transparency.

Theme Parameters

Parameter	Description	Default	Type
`theme-color1`	Primary color	Not set	Hex color
`theme-color2`	Secondary color	Not set	Hex color
`theme-logo`	Logo URL	Not set	URL
`theme-title`	Bot display title	Not set	String
`bot-name`	Bot display name	Not set	String
`welcome-message`	Initial greeting message	Not set	String

Custom Database Parameters

These parameters configure external database connections for use with BASIC keywords like MariaDB/MySQL connections.

Parameter	Description	Default	Type
`custom-server`	Database server hostname	`localhost`	Hostname
`custom-port`	Database port	`5432`	Number
`custom-database`	Database name	Not set	String
`custom-username`	Database user	Not set	String
`custom-password`	Database password	Not set	String

Website Crawling Parameters

Parameter	Description	Default	Type
`website-expires`	Cache expiration for crawled content	`1d`	Duration
`website-max-depth`	Maximum crawl depth	`3`	Number
`website-max-pages`	Maximum pages to crawl	`100`	Number

Image Generator Parameters

Parameter	Description	Default	Type
`image-generator-model`	Diffusion model path	Not set	Path
`image-generator-steps`	Inference steps	`4`	Number
`image-generator-width`	Output width	`512`	Pixels
`image-generator-height`	Output height	`512`	Pixels
`image-generator-gpu-layers`	GPU offload layers	`20`	Number
`image-generator-batch-size`	Batch size	`1`	Number

Video Generator Parameters

Parameter	Description	Default	Type
`video-generator-model`	Video model path	Not set	Path
`video-generator-frames`	Frames to generate	`24`	Number
`video-generator-fps`	Frames per second	`8`	Number
`video-generator-width`	Output width	`320`	Pixels
`video-generator-height`	Output height	`576`	Pixels
`video-generator-gpu-layers`	GPU offload layers	`15`	Number
`video-generator-batch-size`	Batch size	`1`	Number

BotModels Service Parameters

Parameter	Description	Default	Type
`botmodels-enabled`	Enable BotModels service	`true`	Boolean
`botmodels-host`	BotModels bind address	`0.0.0.0`	IP address
`botmodels-port`	BotModels port	`8085`	Number

Generator Parameters

Parameter	Description	Default	Type
`default-generator`	Default content generator	`all`	String

Teams Channel Parameters

Parameter	Description	Default	Type
`teams-app-id`	Microsoft Teams App ID	Not set	String
`teams-app-password`	Microsoft Teams App Password	Not set	String
`teams-tenant-id`	Microsoft Teams Tenant ID	Not set	String
`teams-bot-id`	Microsoft Teams Bot ID	Not set	String

SMS Parameters

Parameter	Description	Default	Type
`sms-provider`	SMS provider (`twilio`, `aws`, `vonage`, `messagebird`, `custom`)	Not set	String
`sms-fallback-provider`	Fallback provider if primary fails	Not set	String

Twilio Parameters

Parameter	Description	Default	Type
`twilio-account-sid`	Twilio Account SID	Not set	String
`twilio-auth-token`	Twilio Auth Token	Not set	String
`twilio-phone-number`	Twilio phone number (E.164 format)	Not set	String
`twilio-messaging-service-sid`	Messaging Service SID for routing	Not set	String
`twilio-status-callback`	Webhook URL for delivery status	Not set	URL

Parameter	Description	Default	Type
`aws-access-key-id`	AWS Access Key ID	Not set	String
`aws-secret-access-key`	AWS Secret Access Key	Not set	String
`aws-region`	AWS Region (e.g., `us-east-1`)	Not set	String
`aws-sns-sender-id`	Sender ID (alphanumeric)	Not set	String
`aws-sns-message-type`	`Promotional` or `Transactional`	`Transactional`	String

Vonage (Nexmo) Parameters

Parameter	Description	Default	Type
`vonage-api-key`	Vonage API Key	Not set	String
`vonage-api-secret`	Vonage API Secret	Not set	String
`vonage-from`	Sender number or alphanumeric ID	Not set	String
`vonage-callback-url`	Delivery receipt webhook	Not set	URL

MessageBird Parameters

Parameter	Description	Default	Type
`messagebird-access-key`	MessageBird Access Key	Not set	String
`messagebird-originator`	Sender number or name	Not set	String
`messagebird-report-url`	Status report webhook	Not set	URL

Custom Provider Parameters

Parameter	Description	Default	Type
`sms-custom-url`	API endpoint URL	Not set	URL
`sms-custom-method`	HTTP method (`POST`, `GET`)	`POST`	String
`sms-custom-auth-header`	Authorization header value	Not set	String
`sms-custom-body-template`	JSON body with `{{to}}`, `{{message}}` placeholders	Not set	String
`sms-custom-from`	Sender number for custom provider	Not set	String

Example: Twilio Configuration

sms-provider,twilio
twilio-account-sid,ACxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
twilio-auth-token,your_auth_token
twilio-phone-number,+15551234567

sms-provider,aws
aws-access-key-id,AKIAIOSFODNN7EXAMPLE
aws-secret-access-key,wJalrXUtnFEMI/K7MDENG/bPxRfiCYEXAMPLEKEY
aws-region,us-east-1
aws-sns-message-type,Transactional

See SMS Provider Configuration for detailed setup instructions.

WhatsApp Parameters

Parameter	Description	Default	Type
`whatsapp-api-key`	Access token from Meta Business	Not set	String
`whatsapp-phone-number-id`	Phone number ID from WhatsApp Business	Not set	String
`whatsapp-verify-token`	Token for webhook verification	Not set	String
`whatsapp-business-account-id`	WhatsApp Business Account ID	Not set	String
`whatsapp-api-version`	Graph API version	`v17.0`	String

Example: WhatsApp Configuration

whatsapp-api-key,EAABs...your_access_token
whatsapp-phone-number-id,123456789012345
whatsapp-verify-token,my-secret-verify-token
whatsapp-business-account-id,987654321098765

See WhatsApp Channel Configuration for detailed setup instructions.

Multi-Agent Parameters

Agent-to-Agent (A2A) Communication

Parameter	Description	Default	Type
`a2a-enabled`	Enable agent-to-agent communication	`true`	Boolean
`a2a-timeout`	Default delegation timeout	`30`	Seconds
`a2a-max-hops`	Maximum delegation chain depth	`5`	Number
`a2a-retry-count`	Retry attempts on failure	`3`	Number
`a2a-queue-size`	Maximum pending messages	`100`	Number
`a2a-protocol-version`	A2A protocol version	`1.0`	String
`a2a-persist-messages`	Persist A2A messages to database	`false`	Boolean

Bot Reflection

Parameter	Description	Default	Type
`bot-reflection-enabled`	Enable bot self-analysis	`true`	Boolean
`bot-reflection-interval`	Messages between reflections	`10`	Number
`bot-reflection-prompt`	Custom reflection prompt	(none)	String
`bot-reflection-types`	Reflection types to perform	`ConversationQuality`	Semicolon-separated
`bot-improvement-auto-apply`	Auto-apply suggested improvements	`false`	Boolean
`bot-improvement-threshold`	Score threshold for improvements (0-10)	`6.0`	Float

Reflection Types

Available values for bot-reflection-types:

ConversationQuality - Analyze conversation quality and user satisfaction
ResponseAccuracy - Analyze response accuracy and relevance
ToolUsage - Analyze tool usage effectiveness
KnowledgeRetrieval - Analyze knowledge retrieval performance
Performance - Analyze overall bot performance

Example:

bot-reflection-enabled,true
bot-reflection-interval,10
bot-reflection-types,ConversationQuality;ResponseAccuracy;ToolUsage
bot-improvement-auto-apply,false
bot-improvement-threshold,7.0

Memory Parameters

User Memory (Cross-Bot)

Parameter	Description	Default	Type
`user-memory-enabled`	Enable user-level memory	`true`	Boolean
`user-memory-max-keys`	Maximum keys per user	`1000`	Number
`user-memory-default-ttl`	Default time-to-live (0=no expiry)	`0`	Seconds

Episodic Memory (Context Compaction)

Parameter	Description	Default	Type
`episodic-memory-enabled`	Enable episodic memory system	`true`	Boolean
`episodic-memory-threshold`	Exchanges before compaction triggers	`4`	Number
`episodic-memory-history`	Recent exchanges to keep in full	`2`	Number
`episodic-memory-model`	Model for summarization	`fast`	String
`episodic-memory-max-episodes`	Maximum episodes per user	`100`	Number
`episodic-memory-retention-days`	Days to retain episodes	`365`	Number
`episodic-memory-auto-summarize`	Enable automatic summarization	`true`	Boolean

Episodic memory automatically manages conversation context to stay within LLM token limits. When conversation exchanges exceed episodic-memory-threshold, older messages are summarized and only the last episodic-memory-history exchanges are kept in full. See Chapter 03 - Episodic Memory for details.

Model Routing Parameters

These parameters configure multi-model routing for different task types. Requires multiple llama.cpp server instances.

Parameter	Description	Default	Type
`llm-models`	Available model aliases	`default`	Semicolon-separated
`model-routing-strategy`	Routing strategy (manual/auto/load-balanced/fallback)	`auto`	String
`model-default`	Default model alias	`default`	String
`model-fast`	Model for fast/simple tasks	(configured)	Path/String
`model-quality`	Model for quality/complex tasks	(configured)	Path/String
`model-code`	Model for code generation	(configured)	Path/String
`model-fallback-enabled`	Enable automatic fallback	`true`	Boolean
`model-fallback-order`	Order to try on failure	`quality,fast,local`	Comma-separated

Multi-Model Example

llm-models,default;fast;quality;code
llm-url,http://localhost:8081
model-routing-strategy,auto
model-default,fast
model-fallback-enabled,true
model-fallback-order,quality,fast

Hybrid RAG Search Parameters

General Bots uses hybrid search combining dense (embedding) and sparse (BM25 keyword) search for optimal retrieval. The BM25 implementation is powered by Tantivy, a full-text search engine library similar to Apache Lucene.

Parameter	Description	Default	Type
`rag-hybrid-enabled`	Enable hybrid dense+sparse search	`true`	Boolean
`rag-dense-weight`	Weight for semantic results	`0.7`	Float (0-1)
`rag-sparse-weight`	Weight for keyword results	`0.3`	Float (0-1)
`rag-reranker-enabled`	Enable LLM reranking	`false`	Boolean
`rag-reranker-model`	Model for reranking	`cross-encoder/ms-marco-MiniLM-L-6-v2`	String
`rag-reranker-top-n`	Candidates for reranking	`20`	Number
`rag-max-results`	Maximum results to return	`10`	Number
`rag-min-score`	Minimum relevance score threshold	`0.0`	Float (0-1)
`rag-rrf-k`	RRF smoothing constant	`60`	Number
`rag-cache-enabled`	Enable search result caching	`true`	Boolean
`rag-cache-ttl`	Cache time-to-live	`3600`	Seconds

BM25 Sparse Search (Tantivy)

BM25 is a keyword-based ranking algorithm that excels at finding exact term matches. It's powered by Tantivy when the vectordb feature is enabled.

Parameter	Description	Default	Type
`bm25-enabled`	Enable/disable BM25 sparse search	`true`	Boolean
`bm25-k1`	Term frequency saturation (0.5-3.0 typical)	`1.2`	Float
`bm25-b`	Document length normalization (0.0-1.0)	`0.75`	Float
`bm25-stemming`	Apply word stemming (running→run)	`true`	Boolean
`bm25-stopwords`	Filter common words (the, a, is)	`true`	Boolean

Switching Search Modes

Hybrid Search (Default - Best for most use cases)

bm25-enabled,true
rag-dense-weight,0.7
rag-sparse-weight,0.3

Uses both semantic understanding AND keyword matching. Best for general queries.

Dense Only (Semantic Search)

bm25-enabled,false
rag-dense-weight,1.0
rag-sparse-weight,0.0

Uses only embedding-based search. Faster, good for conceptual/semantic queries where exact words don't matter.

Sparse Only (Keyword Search)

bm25-enabled,true
rag-dense-weight,0.0
rag-sparse-weight,1.0

Uses only BM25 keyword matching. Good for exact term searches, technical documentation, or when embeddings aren't available.

BM25 Parameter Tuning

The k1 and b parameters control BM25 behavior:

bm25-k1 (Term Saturation): Controls how much additional term occurrences contribute to the score
- Lower values (0.5-1.0): Diminishing returns for repeated terms
- Higher values (1.5-2.0): More weight to documents with many term occurrences
- Default 1.2 works well for most content
bm25-b (Length Normalization): Controls document length penalty
- 0.0: No length penalty (long documents scored equally)
- 1.0: Full length normalization (strongly penalizes long documents)
- Default 0.75 balances length fairness

Tuning for specific content:

# For short documents (tweets, titles)
bm25-b,0.3

# For long documents (articles, manuals)
bm25-b,0.9

# For code search (exact matches important)
bm25-k1,1.5
bm25-stemming,false

Code Sandbox Parameters

Parameter	Description	Default	Type
`sandbox-enabled`	Enable code sandbox	`true`	Boolean
`sandbox-runtime`	Isolation backend (lxc/docker/firecracker/process)	`lxc`	String
`sandbox-timeout`	Maximum execution time	`30`	Seconds
`sandbox-memory-mb`	Memory limit in megabytes	`256`	MB
`sandbox-cpu-percent`	CPU usage limit	`50`	Percent
`sandbox-network`	Allow network access	`false`	Boolean
`sandbox-python-packages`	Pre-installed Python packages	(none)	Comma-separated
`sandbox-allowed-paths`	Accessible filesystem paths	`/data,/tmp`	Comma-separated

Example: Python Sandbox

sandbox-enabled,true
sandbox-runtime,lxc
sandbox-timeout,60
sandbox-memory-mb,512
sandbox-cpu-percent,75
sandbox-network,false
sandbox-python-packages,numpy,pandas,requests,matplotlib
sandbox-allowed-paths,/data,/tmp,/uploads

SSE Streaming Parameters

Parameter	Description	Default	Type
`sse-enabled`	Enable Server-Sent Events	`true`	Boolean
`sse-heartbeat`	Heartbeat interval	`30`	Seconds
`sse-max-connections`	Maximum concurrent connections	`1000`	Number

Parameter Types

Boolean

Values: true or false (case-sensitive)

Number

Integer values, must be within valid ranges:

Ports: 1-65535
Tokens: Positive integers
Percentages: 0-100

Float

Decimal values:

Thresholds: 0.0 to 1.0
Weights: 0.0 to 1.0

Path

File system paths:

Relative: ../../../../data/model.gguf
Absolute: /opt/models/model.gguf

URL

Valid URLs:

HTTP: http://localhost:8081
HTTPS: https://api.example.com

String

Any text value (no quotes needed in CSV)

Email

Valid email format: user@domain.com

Hex Color

HTML color codes: #RRGGBB format

Semicolon-separated

Multiple values separated by semicolons: value1;value2;value3

Comma-separated

Multiple values separated by commas: value1,value2,value3

Required vs Optional

Always Required

None - all parameters have defaults or are optional

Required for Features

LLM: llm-model must be set
Email: email-from, email-server, email-user
Embeddings: embedding-model for knowledge base
Custom DB: custom-database if using external database

Configuration Precedence

Built-in defaults (hardcoded)
config.csv values (override defaults)
Environment variables (if implemented, override config)

Special Values

none - Explicitly no value (for llm-key)
Empty string - Unset/use default
false - Feature disabled
true - Feature enabled

Performance Tuning

For Local Models

llm-server-ctx-size,8192
llm-server-n-predict,2048
llm-server-parallel,4
llm-cache,true
llm-cache-ttl,7200

For Production

llm-server-cont-batching,true
llm-cache-semantic,true
llm-cache-threshold,0.90
llm-server-parallel,8
sse-max-connections,5000

For Low Memory

llm-server-ctx-size,2048
llm-server-n-predict,512
llm-server-mlock,false
llm-server-no-mmap,false
llm-cache,false
sandbox-memory-mb,128

For Multi-Agent Systems

a2a-enabled,true
a2a-timeout,30
a2a-max-hops,5
a2a-retry-count,3
a2a-persist-messages,true
bot-reflection-enabled,true
bot-reflection-interval,10
user-memory-enabled,true

For Hybrid RAG

rag-hybrid-enabled,true
rag-dense-weight,0.7
rag-sparse-weight,0.3
rag-reranker-enabled,true
rag-max-results,10
rag-min-score,0.3
rag-cache-enabled,true
bm25-enabled,true
bm25-k1,1.2
bm25-b,0.75

For Dense-Only Search (Faster)

bm25-enabled,false
rag-dense-weight,1.0
rag-sparse-weight,0.0
rag-max-results,10

For Code Execution

sandbox-enabled,true
sandbox-runtime,lxc
sandbox-timeout,30
sandbox-memory-mb,512
sandbox-network,false
sandbox-python-packages,numpy,pandas,requests

Validation Rules

Paths: Model files must exist
URLs: Must be valid format
Ports: Must be 1-65535
Emails: Must contain @ and domain
Colors: Must be valid hex format
Booleans: Exactly true or false
Weights: Must sum to 1.0 (e.g., rag-dense-weight + rag-sparse-weight)

25 KiB Raw Blame History