Observability in Catalyst

Catalyst ships with built-in observability for every workload running on the platform — no instrumentation required. From the Catalyst console, inspect workflow executions step-by-step, drill into agent runs, and browse API-level request logs with token usage for LLM calls. You also get topology and metrics across your project out of the box.

Open in the Catalyst console

Every feature on this page — metrics, API logs, workflow replay, agent executions, and the application graph — is available in the Catalyst Web UI at catalyst.diagrid.io.

Local development

For local workflow and actor inspection during development, use the Diagrid Dev Dashboard. The observability features on this page apply to workloads hosted on Catalyst.

📈Metrics

Request counts, latencies, error rates, and resource utilization per project and App ID.

📝API Logs

Request/response inspection for every Dapr API call, including LLM token usage.

🔁Workflow Replay

Step-level drill-down, execution graph, and replay for durable workflows.

Metrics dashboard

The console's Metrics page shows request counts, latencies, error rates, and resource utilization scoped by project and App ID. Use it to answer questions like:

How many requests per second is my App ID receiving?
What's the 95th-percentile (p95) latency of my pub/sub deliveries?
Are any components returning errors?
Am I close to project-level rate limits?

Metrics are retained according to your plan — on Catalyst Cloud, retention is 7 days by default; longer retention is available on paid plans.

API Logs

API Logs are per-call records of every Dapr API request made through Catalyst, captured on the data plane:

Request and response bodies (subject to body-size limits) for replay and debugging.
Status codes and latencies for success/failure analysis.
LLM token usage — token_prompt, token_completion, and token_total for calls through the Conversation API.
Originating App ID and component for attribution.

API Logs are the fastest way to understand what your application actually sent to Catalyst — useful when an SDK wraps the call in ways that obscure the wire-level request, or when an LLM call returned an unexpected result.

Application logs

Stream Dapr runtime and application logs directly from the CLI or console:

# Stream logs from an App ID's Dapr runtime
diagrid appid logs my-app --follow

# Fetch recent logs
diagrid appid logs my-app --tail 500

See diagrid appid logs for all flags.

In the console, open an App ID and select the Logs tab to view logs with filtering and search.

Log retention on Catalyst Cloud is 3 days for the free tier and configurable on paid plans.

Workflow execution insights

The Workflows page in the console lists every workflow execution across your project with status (running, completed, failed, paused, stalled), App ID, workflow name, version, and duration. Click into an execution to see:

Execution graph — a visual DAG of activities and sub-orchestrations.
History — step-level event log with inputs, outputs, and timestamps.
Custom status — workflow-reported status strings for progress tracking.
Error resolution — one-click rerun, resume, or purge of failed or stalled executions.
Event raising — send an event to a waiting workflow from the UI.

The workflow CLI mirrors these operations so you can wire replay, resume, and purge into your own tooling:

diagrid workflow list --app-id my-workflow-app --status running
diagrid workflow get --app-id my-workflow-app --instance-id <id>
diagrid workflow pause --app-id my-workflow-app --instance-id <id>
diagrid workflow resume --app-id my-workflow-app --instance-id <id>
diagrid workflow rerun --app-id my-workflow-app --instance-id <id>
diagrid workflow terminate --app-id my-workflow-app --instance-id <id>
diagrid workflow raise-event --app-id my-workflow-app --instance-id <id> \
  --event-name approval --event-data '{"approved": true}'
diagrid workflow purge --app-id my-workflow-app --instance-id <id>

Each subcommand is documented individually: list, get, pause, resume, rerun, terminate, raise-event, and purge.

Agent execution view

For AI agents, the Agents page in the console shows every running agent, its framework, and recent executions. Supported frameworks include Dapr Agents, CrewAI, LangGraph, Strands, Microsoft Agent Framework, Google ADK, OpenAI, Pydantic AI, and Deep Agents — see Develop Agents for the full list. Each execution captures:

Input and output for each agent run.
Tool calls and their results.
Conversation history for durable agents with memory.
Session inspection for multi-turn interactions.

Combine the agent view with API Logs to see the underlying LLM calls, including token usage.

# List agents in the current project
diagrid agent list

# Get agent details
diagrid agent get my-agent

See diagrid agent list and diagrid agent get.

Application graph

The Application Graph shows the topology of your project — which App IDs talk to which other App IDs, which components they use, and where pub/sub traffic flows. Use it to verify that services are wired up as expected and to spot unexpected cross-project dependencies.

Distributed tracing

Catalyst emits OpenTelemetry traces for every Dapr API call. Export them to your preferred backend (Datadog, New Relic, Honeycomb, Jaeger, or any OTLP-compatible collector) by attaching a Configuration resource to your App IDs:

apiVersion: cra.diagrid.io/v1beta1
kind: Configuration
metadata:
  name: tracing
spec:
  tracing:
    samplingRate: "1"
    otel:
      endpointAddress: "otel-collector.my-namespace.svc.cluster.local:4317"
      isSecure: false
      protocol: grpc

diagrid apply -f tracing-config.yaml
diagrid appid update my-app --app-config tracing --wait

See Declarative management for the full diagrid apply workflow and diagrid appid update for binding configurations to App IDs. For Catalyst Enterprise Self-Hosted, the observability guide covers platform-level telemetry exports.

Audit logs

User and application audit logs capture who did what and when — for example, which user created an App ID, who rotated an API key, or which service account deployed a new component. Audit logs are available on Catalyst Enterprise plans and support compliance review. Contact Diagrid to enable audit log export on your organization.

What's next

Develop Workflows — build workflows whose executions appear in the console.
Develop Agents — build agents whose runs appear on the Agents page.
Catalyst Enterprise Self-Hosted Observability — OpenTelemetry export from the self-hosted data plane.
Plans & Support — retention limits and SLAs per plan.

📈Metrics

📝API Logs

🔁Workflow Replay

Metrics dashboard​

API Logs​

Application logs​

Workflow execution insights​

Agent execution view​

Application graph​

Distributed tracing​

Audit logs​

What's next​