Deployment Autoscaling
Deployment Autoscaling

--minInstanceCount TEXT Minimal instance count
--maxInstanceCount TEXT Maximal instance count
--scaleCooldownPeriod TEXT Scale cooldown period
--metric TEXT Autoscaling metrics. Example:
my_metric/targetAverage:21.37
--resource TEXT Autoscaling resources. Example:
cpu/target:60gradient deployments create --deploymentType TFServing --modelId <ModelID> --name <name> --machineType c5.xlarge --minInstanceCount 1 --maxInstanceCount 5 --resource cpuPercentage/targetAverage:90 --imageUrl tensorflow/serving --instanceCount 1 ... Algorithm Details
Last updated