Skip to content

About Auditing Models

NVIDIA NeMo Auditor is released with early access availability and is subject to limited support and potential API changes in future releases.

NVIDIA NeMo Auditor audits LLMs by probing them with adversarial prompts and detecting failures such as jailbreaks, prompt injection, encoding bypasses, and unsafe output generation. It is powered by garak, NVIDIA's open-source LLM vulnerability scanner, and integrates with NeMo Platform so audits can target any model reachable through the Inference Gateway.

Tutorials SDK Resources


Typical Workflow

A typical audit looks like the following:

  1. Create an audit target for the model you want to test.
  2. Create an audit configuration that selects which garak probes and detectors to run, along with reporting settings.
  3. Run the audit and inspect the resulting JSONL, HTML, and hitlog reports.

The plugin exposes both synchronous and asynchronous Python entry points for each step.


Setup

Before you can run audits, you need a working NeMo Platform install with the auditor plugin enabled and a garak interpreter on disk.

  • Follow Setup to install the platform and start local services.
  • Install garak in a Python virtual environment. By default the plugin invokes ~/.auditor/.venv/bin/python -m garak; override the interpreter path with NEMO_AUDITOR_GARAK_PYTHON if you installed it elsewhere.
  • Configure at least one Inference Gateway provider so audits can route requests to the model under test.

Task Guides

  • Audit Targets


    Define the model under test — generator type, model identifier, and inference endpoint.

  • Audit Configurations


    Choose probes, detectors, and reporting settings for the audit.

  • Run an Audit Locally


    End-to-end walkthrough: create entities, run the audit in-process, read the report artifacts.

  • SDK Resources


    Reference for the client.auditor SDK surface: configs, targets, and run().

References

  • Configuration Schema


    Field reference for AuditConfig and its system, run, plugins, and reporting sub-models.

  • Target Schema


    Field reference for AuditTarget (type, model, options).

  • Selecting Probes


    probe_spec, probe_tags, and detector_spec syntax with worked examples.

  • Inference Gateway


    How nmp_uri_spec resolves a target's URI through a NeMo Platform provider.