Tracing#
Tracing enhances the observability of guardrails execution. This section explains the configuration process for implementing tracing with NeMo Guardrails.
With tracing, you can:
- Track which rails are activated during conversations. 
- Monitor LLM calls and their performance. 
- Debug flow execution and identify bottlenecks. 
- Analyze conversation patterns and errors. 
Span Formats#
Starting with NeMo Guardrails v0.16.0, the tracing system has transitioned to OpenTelemetry semantic conventions for Generative AI (GenAI), moving away from the legacy span format. This change enhances observability standardization and improves monitoring capabilities for AI workloads.
Reference Documentation:
- OpenTelemetry Semantic Conventions for GenAI - Overview of GenAI semantic conventions 
- GenAI Events Specification - Details on capturing user inputs and model outputs 
- GenAI Spans Specification - Span naming and attribute conventions 
Supported Formats#
NeMo Guardrails support the following formats.
- OpenTelemetry ( - opentelemetry) - Recommended default format following OpenTelemetry semantic conventions for GenAI
- Legacy ( - legacy) - Previous format with simple metrics dictionary (deprecated)
Configuration#
You can control the span format through the tracing configuration:
tracing:
  enabled: true
  span_format: "opentelemetry"  # default
  enable_content_capture: false  # default, see privacy considerations below
Key Differences#
The following are the key differences between the supported span formats.
OpenTelemetry Format:
- Follows the standardized semantic conventions 
- Rich-structured attributes and events (e.g., - gen_ai.request.model,- gen_ai.usage.input_tokens)
- Enhanced LLM call tracking with provider and model information 
- Support for span events and error tracking 
- Compatible with OpenTelemetry ecosystem tools 
Legacy Format:
- Simple metrics dictionary 
- Minimal overhead with limited observability 
- Deprecated - maintained for backward compatibility only 
Important Considerations#
Development Status: The OpenTelemetry semantic conventions for GenAI are currently in development and may undergo changes. Consider the following risks:
- Evolving Standards: Conventions may change as they mature, potentially affecting existing implementations 
- Data Privacy: The - enable_content_captureoption captures user inputs and model outputs, which may include sensitive information (PII). Only enable when necessary and ensure compliance with data protection regulations. See GenAI Events documentation for details
- Performance Impact: Extensive telemetry collection may impact system performance, especially with large inputs/outputs 
Migration Path#
Existing configurations will continue to work. However, it is strongly recommended to migrate to the OpenTelemetry format. Migration steps are:
- Update your configuration to use - span_format: "opentelemetry"
- Review your telemetry backends for compatibility with OpenTelemetry conventions 
- Test thoroughly, as span structures and attribute names have changed 
- Consider privacy implications if enabling content capture 
Contents#
- Quick Start - Minimal setup to enable tracing using the OpenTelemetry SDK 
- Adapter Configurations - Detailed configuration for FileSystem, OpenTelemetry, and Custom adapters 
- Advanced OpenTelemetry Integration - Production-ready OpenTelemetry setup and ecosystem compatibility 
- common-integrations.md- Setup examples for Jaeger, Zipkin, and OpenTelemetry Collector
- Troubleshooting - Common issues and solutions 
Jupyter Notebooks#
- Tracing Guardrails Quickstart - A quickstart guide to tracing Guardrails requests in sequential and parallel modes. 
- Tracing Guardrails with Jaeger - A guide to visualize Guardrails traces in Jaeger.