Gradient Docs
Gradient HomeHelp DeskCommunitySign up free
Gradient Next
Gradient Next
  • About Gradient
  • Get Started
    • Quick Start
      • Install the Gradient CLI
    • Core Concepts
    • Organizing Projects
      • Secrets
      • Storing an API key as a Secret
    • Tutorials
      • Gradient Notebooks Tutorial
      • Gradient Workflows Tutorial
      • Gradient Deployments Tutorial
    • FAQ
    • Common Errors
  • Gradient Platform
    • Gradient Notebooks
      • Runtimes
      • Files and storage
      • Machines
      • Terminal
      • Shortcuts
      • Sharing
      • TensorBoard
      • Run on Gradient
    • Gradient Workflows
      • Basic operations
      • Understanding Inputs & Outputs
      • Workflow Spec
      • Gradient Actions
      • Environment Variables
      • Using YAML for Data Science
    • Gradient Deployments
      • Basic operations
      • Deployment Spec
  • Artifacts
    • Container Management
      • Custom Containers
    • Data
      • Versioned Data
        • Public Datasets Repository
        • Storage Providers
      • Persistent Storage
    • Models
      • Managing Models
        • Model Types & Metadata
        • Public Models
    • Code
    • Metrics
      • Push Metrics
      • View & Query Metrics
  • Gradient Cluster
    • Overview
      • Setup
        • Managed Private Clusters
        • Self-Hosted Clusters
          • Pre-installation steps
          • Gradient Installer CLI
          • Terraform
            • Pre-installation steps
            • Install on AWS
            • Install on bare metal / VMs
            • Install on NVIDIA DGX
          • Let's Encrypt DNS Providers
          • Updating your cluster
      • Usage
  • More
    • SDK
      • Projects Client
      • Models Client
      • Deployments Client
      • Workflows Client
      • SDK Examples
      • Full SDK Reference
    • Machine Types
      • Machine Tiers
      • Free Machines (Free Tier)
    • Your Account
      • Teams
        • Creating a Team
        • Upgrading to a Team Plan
      • Hotkeys
      • Billing & Subscriptions
        • Storage Billing
      • Public Profiles
    • Release notes
    • Roadmap
Powered by GitBook
On this page
  • Overview
  • Where to start
  • Key concepts
  1. Gradient Platform

Gradient Deployments

Gradient Deployments helps you perform effortless model serving.

PreviousUsing YAML for Data ScienceNextBasic operations

Last updated 3 years ago

Gradient Deployments is currently in Preview. If you would like to provide feedback on the deployments product, please .

Overview

After Notebooks and Workflows, the third major component of the Gradient Platform is Deployments.

Deployments is used to run container images and to serve machine learning models using a high-performance, low-latency micro-service with a RESTful API.

This allows the model to be run on new unseen data in production, also known as model inference.

Where to start

The best place to start learning how to deploy models on Gradient is the official Gradient Deployments Tutorial:

Key concepts

Deployments has a number of concepts to allow different configuration.

Deployment Spec

Deployment Run

A deployment can have multiple runs at the same time. Any update to the deployment spec can create a new deployment run. Once the latest deployment run is ready, the previous deployment run will scale down.

A deployment spec is used to represent the desired state of your deployment. With the you can use a to change the desired state of your deployment.

Gradient Deployments Tutorial
Gradient CLI
YAML file
reach out
Creating a new deployment in the Gradient console.
Example deployment with YAML spec.