Overview of NVIDIA NeMo Guardrails Library#

The NVIDIA NeMo Guardrails library (PyPI | GitHub) is an open-source Python package for adding programmable guardrails to LLM-based applications. It intercepts inputs and outputs, applies configurable safety checks, and blocks or modifies content based on defined policies.

NeMo Guardrails Library within the NVIDIA NeMo Software Stack#

NVIDIA NeMo is a suite of microservices, tools, and libraries for building, deploying, and scaling LLM-based applications.

NeMo Guardrails is part of the NVIDIA NeMo software stack. It takes part in adding programmable guardrails to LLM-based applications. The NeMo Guardrails library provides tools to build guardrails and integrate them into your LLM-based applications at development time. The NeMo Guardrails microservice as part of the NeMo microservices platform is a production-ready container image built on top of this library, designed for Kubernetes deployment with Helm charts.

	NeMo Guardrails Library	NeMo Guardrails Microservice
Distribution	PyPI (`pip install`)	Container image (backed by this library)
Deployment	Self-managed Python environment	Kubernetes with Helm
Scaling	Application-level	Managed by orchestrator
Configuration	YAML + Colang	Same YAML + Colang format

Configurations are portable between the library and microservice, so you can develop locally with the library and deploy to production with the microservice.

Architecture#

The NeMo Guardrails library is designed to be integrated into LLM-based applications. It intercepts inputs and outputs, applies configurable safety checks, and blocks or modifies content based on defined policies.

        %%{init: {'theme': 'neutral', 'themeVariables': { 'background': 'transparent' }}}%%

flowchart TB
  A("Application Code")
  B("NeMo Guardrails Library")
  C("Large Language Model (LLM)")

  A <--> B

subgraph NemoGuard["NemoGuard NIMs"]
  direction TB
  D("NemoGuard Content Safety")
  E("NemoGuard Topic Control")
  F("NemoGuard Jailbreak Detection")
end

  B <--> NemoGuard
  NemoGuard <--> C

  style A fill:#d8d8e8,stroke:#999
  style B fill:#f0f7e6,stroke:#76b900,stroke-width:2px
  style C fill:#d8d8e8,stroke:#999
  style D fill:#f0f7e6,stroke:#76b900
  style E fill:#f0f7e6,stroke:#76b900
  style F fill:#f0f7e6,stroke:#76b900

Application code interacting with LLMs through the NeMo Guardrails library.

Use Cases#

The following are the top use cases of the NeMo Guardrails library that you can apply to protect your LLM applications.

Tools#

The following are the tools you can use to interact with the NeMo Guardrails library.

Python SDK#

from nemoguardrails import LLMRails, RailsConfig

config = RailsConfig.from_path("./config")
rails = LLMRails(config)

response = rails.generate(
    messages=[{"role": "user", "content": "Hello!"}]
)

The generate method accepts the same message format as the OpenAI Chat Completions API.

CLI Server#

nemoguardrails server --config ./config --port 8000

The server exposes an HTTP API compatible with OpenAI’s /v1/chat/completions endpoint.