Observability in Catalyst
Catalyst ships with built-in observability for every workload running on the platform — no instrumentation required. From the Catalyst console, inspect workflow executions step-by-step, drill into agent runs, and browse API-level request logs with token usage for LLM calls. You also get topology and metrics across your project out of the box.
Every feature on this page — metrics, API logs, workflow replay, agent executions, and the application graph — is available in the Catalyst Web UI at catalyst.diagrid.io.
For local workflow and actor inspection during development, use the Diagrid Dev Dashboard. The observability features on this page apply to workloads hosted on Catalyst.
📈Metrics
Request counts, latencies, error rates, and resource utilization per project and App ID.
📝API Logs
Request/response inspection for every Dapr API call, including LLM token usage.
🔁Workflow Replay
Step-level drill-down, execution graph, and replay for durable workflows.
Metrics dashboard
The console's Metrics page shows request counts, latencies, error rates, and resource utilization scoped by project and App ID. Use it to answer questions like:
- How many requests per second is my App ID receiving?
- What's the 95th-percentile (p95) latency of my pub/sub deliveries?
- Are any components returning errors?
- Am I close to project-level rate limits?
Metrics are retained according to your plan — on Catalyst Cloud, retention is 7 days by default; longer retention is available on paid plans.
API Logs
API Logs are per-call records of every Dapr API request made through Catalyst, captured on the data plane:
- Request and response bodies (subject to body-size limits) for replay and debugging.
- Status codes and latencies for success/failure analysis.
- LLM token usage —
token_prompt,token_completion, andtoken_totalfor calls through the Conversation API. - Originating App ID and component for attribution.
API Logs are the fastest way to understand what your application actually sent to Catalyst — useful when an SDK wraps the call in ways that obscure the wire-level request, or when an LLM call returned an unexpected result.
Application logs
Stream Dapr runtime and application logs directly from the CLI or console:
# Stream logs from an App ID's Dapr runtime
diagrid appid logs my-app --follow
# Fetch recent logs
diagrid appid logs my-app --tail 500
See diagrid appid logs for all flags.
In the console, open an App ID and select the Logs tab to view logs with filtering and search.
Log retention on Catalyst Cloud is 3 days for the free tier and configurable on paid plans.
Workflow execution insights
The Workflows page in the console lists every workflow execution across your project with status (running, completed, failed, paused, stalled), App ID, workflow name, version, and duration. Click into an execution to see:
- Execution graph — a visual DAG of activities and sub-orchestrations.
- History — step-level event log with inputs, outputs, and timestamps.
- Custom status — workflow-reported status strings for progress tracking.
- Error resolution — one-click rerun, resume, or purge of failed or stalled executions.
- Event raising — send an event to a waiting workflow from the UI.
The workflow CLI mirrors these operations so you can wire replay, resume, and purge into your own tooling:
diagrid workflow list --app-id my-workflow-app --status running
diagrid workflow get --app-id my-workflow-app --instance-id <id>
diagrid workflow pause --app-id my-workflow-app --instance-id <id>
diagrid workflow resume --app-id my-workflow-app --instance-id <id>
diagrid workflow rerun --app-id my-workflow-app --instance-id <id>
diagrid workflow terminate --app-id my-workflow-app --instance-id <id>
diagrid workflow raise-event --app-id my-workflow-app --instance-id <id> \
--event-name approval --event-data '{"approved": true}'
diagrid workflow purge --app-id my-workflow-app --instance-id <id>
Each subcommand is documented individually: list, get, pause, resume, rerun, terminate, raise-event, and purge.
Agent execution view
For AI agents, the Agents page in the console shows every running agent, its framework, and recent executions. Supported frameworks include Dapr Agents, CrewAI, LangGraph, Strands, Microsoft Agent Framework, Google ADK, OpenAI, Pydantic AI, and Deep Agents — see Develop Agents for the full list. Each execution captures:
- Input and output for each agent run.
- Tool calls and their results.
- Conversation history for durable agents with memory.
- Session inspection for multi-turn interactions.
Combine the agent view with API Logs to see the underlying LLM calls, including token usage.
# List agents in the current project
diagrid agent list
# Get agent details
diagrid agent get my-agent
See diagrid agent list and diagrid agent get.
Application graph
The Application Graph shows the topology of your project — which App IDs talk to which other App IDs, which components they use, and where pub/sub traffic flows. Use it to verify that services are wired up as expected and to spot unexpected cross-project dependencies.
Distributed tracing
Catalyst emits OpenTelemetry traces for every Dapr API call. Export them to your preferred backend (Datadog, New Relic, Honeycomb, Jaeger, or any OTLP-compatible collector) by attaching a Configuration resource to your App IDs:
apiVersion: cra.diagrid.io/v1beta1
kind: Configuration
metadata:
name: tracing
spec:
tracing:
samplingRate: "1"
otel:
endpointAddress: "otel-collector.my-namespace.svc.cluster.local:4317"
isSecure: false
protocol: grpc
diagrid apply -f tracing-config.yaml
diagrid appid update my-app --app-config tracing --wait
See Declarative management for the full diagrid apply workflow and diagrid appid update for binding configurations to App IDs. For Catalyst Enterprise Self-Hosted, the observability guide covers platform-level telemetry exports.
Audit logs
User and application audit logs capture who did what and when — for example, which user created an App ID, who rotated an API key, or which service account deployed a new component. Audit logs are available on Catalyst Enterprise plans and support compliance review. Contact Diagrid to enable audit log export on your organization.
What's next
- Develop Workflows — build workflows whose executions appear in the console.
- Develop Agents — build agents whose runs appear on the Agents page.
- Catalyst Enterprise Self-Hosted Observability — OpenTelemetry export from the self-hosted data plane.
- Plans & Support — retention limits and SLAs per plan.