This page covers information that is more relevant to self-hosting.
For a more comprehensive overview of how data is processed in Onyx,
check out Data Flows and Data Storage
All data processing occurs within your infrastructure, ensuring complete control over your data.
No sensitive data leaves your network except for the specific communications you configure.Note, anonymous telemetry is enabled by default but can be turned off at deployment time.
LLM Inference: Queries, chat history, and documents sent to the LLM for processing.
Actions: Calls to external APIs configured by the admin users in your Onyx deployment.
Web Search: Queries passed to a search provider and scraper of choice.
You can also configure external document processing services, embedding/reranking APIs, image captioning models, etc.
but these are purely optional and have built-in equivalents.