This operation shuts down an inference deployment, making it unavailable for handling requests. The deployment will scale down to 0 replicas, overriding any minimum replica settings.
API key for authentication. Make sure to include the word apikey
, followed by a single space and then your token.
Example: apikey 1234$abcdef
Project ID
1
Inference instance name.
"my-instance"
No Content