In the middle of an incident, the bottleneck is not data collection. The bottleneck is asking the right question fast, and trusting the answer. At Qonto, we built an AI-assisted observability workflow where engineers can ask natural-language questions and immediately get verifiable outputs: the system shows the exact SQL it ran on ClickHouse and returns evidence like trace IDs and scoped impact.
This quick talk shares the core architecture and the “trust patterns” that make it usable under pressure: OpenTelemetry as a common schema, ClickHouse as a single analytical backend for high-cardinality observability data, read-only query tooling, and a UI that keeps humans in the loop. You will leave with a clear blueprint to build a conversational investigation experience without turning incident response into a hallucination lottery.