About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Last updated: Feb 14, 2025
When you deploy certain assets in watsonx.ai Runtime, you can choose the type, size, and power of the hardware configuration that matches your computing needs.
Creating hardware specifications for deployments
You can create hardware specifications for your deployments in the following ways:
- Python client library: Use the
function from the Python client library. For more information, see Python client library referencehardware_specifications.store
- Data and AI Common Core API: Use
from the Environments list in the Data and AI Common Core API to create a hardware specification. For more information, see Environments API reference.POST /v2/hardware_specifications
Deployment types that require hardware specifications
Selecting a hardware specification is available for all batch deployment types. For online deployments, you can select a specific hardware specification if you're deploying:
- Python Functions
- Tensorflow models
- Models with custom software specifications
Hardware configurations available for deploying assets
: 1x4 = 1 vCPU and 4 GB RAMXS
: 2x8 = 2 vCPU and 8 GB RAMS
: 4x16 = 4 vCPU and 16 GB RAMM
: 8x32 = 8 vCPU and 32 GB RAML
: 16x64 = 16 vCPU and 64 GB RAMXL
You can use the
configuration to deploy:XS
- Python functions
- Python scripts
- R scripts
- Models based on custom libraries and custom images
For Decision Optimization deployments, you can use these hardware specifications:
S
M
L
XL
Learn more
Parent topic: Managing predictive deployments
Was the topic helpful?
0/1000