Managing custom foundation model deployments from deployment space
Copy link to section
You can access, update, scale, delete, and monitor the performance of your custom model deployment in your deployment space.
Accessing deployment details from deployment space
Copy link to section
Follow these steps to review or update deployment details:
From the Deployments tab of your deployment space, click a deployment name.
Click the Deployment details tab to access information that is related to your custom foundation model deployment.
Note: If your organization is using any of the use cases to track and govern assets, deployment information for a tracked asset is recorded in a factsheet in the associated use case.
Updating deployment details from deployment space
Copy link to section
You can update the details for your custom foundation model deployment, such as name, serving name, description, and hardware specifications. For more information, see Updating a deployment.
Scaling a deployment in a deployment space
Copy link to section
You can scale your deployment by increasing the number of copies that are created for your deployment. For more information, see Scaling a deployment.
Deleting a deployment from a deployment space
Copy link to section
You can delete your custom foundation model deployment when you don't need it anymore, to free up resources. For more information, see Deleting a deployment.
Note:
In workflows where your custom foundation model is used periodically, consider assigning your model the same serving name each time you deploy it. This way, after you delete and then re-deploy the model, you can keep using the
same endpoint in your code.
Monitoring deployment performance from a deployment space
Copy link to section
You can evaluate your custom foundation model deployment to measure performance and understand model predictions by provisioning a watsonx.governance instance and configuring monitors for fairness, quality, drift, and explainability. For more
information, see Evaluating deployments in spaces with watsonx.governance.
Managing a custom foundation model deployment programmatically
Copy link to section
Prerequisites
Copy link to section
You can access, update, scale, delete, and monitor the performance of your custom model deployment programmatically.
To update or delete a deployment programmatically, first get the list of deployed models to find the correct metadata for the deployment.
Getting the list of deployed models
Copy link to section
Get the list of deployments for the specified project ID. To filter for all deployments that point to custom foundation models, use the type=custom_foundation_model query parameter. Refer to this example code:
Monitoring deployment performance from a deployment space
Copy link to section
You can evaluate your custom foundation model deployment to measure performance and understand model predictions by provisioning a watsonx.governance instance and configuring monitors for fairness, quality, drift, and explainability. For more
information, see Evaluating deployments in spaces with watsonx.governance.
About cookies on this siteOur websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising.For more information, please review your cookie preferences options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.