Skip to content

Tutorials

Learn how to run inference through the NeMo Platform.

Prerequisites

  • NMP_BASE_URL environment variable set to your platform URL
  • Appropriate API credentials for an external provider (e.g. an NGC API key for NVIDIA Build)

Guides

  • Run Inference — Route requests via model entity, provider, or OpenAI routing