ConverZen - High-Performance AI, Simplified

High-Level Overview

The chatbot platform follows a microservices-inspired architecture with clear separation of concerns:

Ensures correct Authentification. The layer distinguishes between different User levels and restricts access accordingly.

Login & docs routes are protected by UI api-keys
Logged in users are authenticated by JWT tokens
Chat endpoints requite JWT tokens or customer / persona specific api-keys.
Chat JWT tokens can be requested from the /api/chat/get_token endpoint with an api-key

Purpose: HTTP API gateway and request routing
Responsibilities:
- REST API endpoints for chat & user/admin operations
- Configuration management endpoints
- Request/response serialization
- Chat streaming support

Purpose: Core conversation management and AI integration
Responsibilities:
- Chat session lifecycle management
- AI provider abstraction and integration
- Token streaming implementation
- Context assembly and prompt construction
- Tool execution and function calling

Purpose: Dynamic persona/RAG/MCP and system configuration
Responsibilities:
- Chat persona definitions
- Tool configurations
- Context document associations
- Chat parameters (temperature, max tokens, etc.)

Purpose: Context retrieval and document management
Responsibilities:
- Document ingestion and vectorization
- Semantic search and retrieval
- Context ranking and filtering
- Document metadata management

Request Reception: Web server receives chat request with persona ID
Auth Middleware Api Key JWT extraction & check
Chat Routing:
- Chat invocation uses an api-key or a JWT-Token that contains all information required to identify the persona.
- Database access is typically not necessary to ensure the request routing.
- Chat personas are cached or on demand-loaded from postgres.
- Session context is cached.
Request moderation: - If configured in persona, the request is checked for inappropriate content.
Context Assembly: Query RAG system for relevant context
Prompt Construction: Build complete prompt with system, context, and user message
AI Processing: Send to AI provider (streaming or batch)
- AI MCP/Function request handling. Possible follow up LLM requests with function results.
Response Handling: Process and return/stream response to client
History Storage: Save (brief) conversation history to session/database.
Accounting: Saving of token counts etc.