Install
Engines are installed and configured through the system module and provider catalog.
Engines
An engine implements configured runtime methods such as chat, generate stream, embeddings, STT, TTS, or image analysis. The application asks for a model or objective; the runtime resolves and invokes the provider.
Provider lifecycle
The system module handles install and activation. Feature code asks for a configured objective or model id.
Engines are installed and configured through the system module and provider catalog.
Models store capabilities, runtime methods, provider options, and performance defaults.
Model loading can be requested outside the UI process so first use does not freeze clients.
Invocation model
This keeps modules away from provider SDK details and lets the engine layer normalize local and remote runtimes.
Providers expose the methods they support. Test pages and UI should use those real method contracts.
OpenAI-compatible APIs, local runtimes, and specialized engines share the same high-level provider boundary.
Requests carry ids and context so failures, timings, and output can be traced.
Resource behavior
Local model runtimes can be expensive to load. The platform should keep those costs outside UI-critical paths.
Engine processes can hold model memory away from the main application/UI process.
A provider can be resolved first, then warmed up before the first user-facing invocation.
Context length, runtime options, and performance defaults belong to model/provider configuration rather than page code.
Implemented engines
These are the engine implementations exposed through the runtime/provider layer.
Remote APIs, OpenAI-compatible providers, and local LLM runtimes used for chat and streamed generation.
OpenAI
OpenAI-compatible
llama.cpp
vLLM
ONNXSpeech-to-text and text-to-speech providers used by composer voice input and audio generation flows.
Whisper
OpenAI Whisper
OpenAI TTS
Edge TTS
eSpeak
Parler TTS
Qwen TTSSpecialized engines for visual analysis and object detection workflows.
YOLONext
Use these pages to move from the concept to adjacent parts of the runtime.