📊
Metrics Quantitative system health signals
🔍
Traces End-to-end request visibility
📋
Logs Structured event records over time

8 Observability Categories

01 🤖
AI Observability Platforms

Platforms purpose-built to monitor, trace, and evaluate AI model behaviour, performance, and drift in production environments.

02 🔭
Observability Tools & Platforms

General-purpose observability platforms covering metrics, logs, and traces across distributed systems and microservices architectures.

03 🕸️
Observability for Istio Service Mesh

Specialised tools for gaining deep visibility into traffic flows, latency, and failures within Istio service mesh deployments.

04
APM & Observability Tools

Application Performance Monitoring tools that track response times, error rates, and transaction traces to ensure application health.

05 ☁️
Cloud-Native Observability Tools

Tools designed for Kubernetes, containers, and cloud-native workloads — integrating with OpenTelemetry, Prometheus, and modern stacks.

06 🗃️
Data Observability Tools

Platforms that monitor data pipeline health, data quality, freshness, and lineage — preventing silent data failures from reaching downstream consumers.

07 🧠
LLM Observability Tools

Tools for tracing LLM prompts, responses, token costs, latency, and quality — essential for debugging and optimising AI-powered applications.

08 🤝
Agentic AI Observability Tools

Observability platforms purpose-built for autonomous AI agent systems — tracking multi-step reasoning, tool calls, agent handoffs, and decision chains.

09 📈
ML Observability Platforms

The definitive guide to machine learning observability tools — detect drift, diagnose failures, measure bias, and maintain model health across the full inference lifecycle.