Watson Machine Learning plans and compute usage

You use Watson Machine Learning resources, measured in capacity unit hours, when you train AutoAI models, run deep learning experiments, and request predictions from deployed models. This topic describes the various plans you can choose, and what services are included, and provides a list of default computing environments to help you select a plan that matches your needs.

Watson Machine Learning plans

Watson Machine Learning plans govern how you are billed for models you train and deploy with Watson Machine Learning. Choose a plan based on your needs:

  • Lite is a free plan with limited capacity. Choose this plan if you are evaluating Watson Machine Learning and want to try out the capabilities. Note that HIPAA support is not available with the Standard plan
  • Standard is a pay-as-you-go plan that gives you the flexibility to build, deploy, and manage models to match your needs. Note that Decision Optimization and HIPAA support are not available with the Standard plan
  • Professional is a high-capacity, flat-rate enterprise plan designed to support all of an organization’s machine learning needs.

For details on pricing, see Watson Machine Learning: Pricing

This table provides details for plan allowances and restrictions.

Feature Lite Standard Professional
Max published models 200 1000 1000
Deployed models 5 1000 1000
Predictions 5000 per month Billed per prediction 2 million then billed per 1,000
Capacity Unit Hours 50 per month Billed per CUH 1,000 then billed for additional CUH
HIPAA readiness     Available if provisioned on IBM Cloud - Dallas region
Decision Optimization  
AutoAI Experiments
Batch scoring
Deep learning training Max 8 k80 GPU in parallel Unlimited Unlimited

 

Watson Machine Learning compute usage environments

Machine Learning compute usage is calculated by the number of capacity unit hours (CUH) consumed by an active machine learning instance.

The rate of capacity units per hour consumed is determined by the computing requirements of your Machine Learning assets and models. For example, a model with a large, complex data set will consume more training resources than a model with a smaller, simpler data set.

Compute time is calculated to the millisecond. However, there is a one-minute minimum for each distinct operation. That is, a training run that takes 12 seconds is billed as one minute toward the capacity unit hour quota, while a training run that takes 83.555 seconds is billed exactly as calculated.

These tables show the capacity units per hour calculation for machine learning environments, by usage type.

Capacity units per hour for deep learning experiments

Capacity type Capacity units per hour
1 (one) NVIDIA K80 GPU 2
1 (one) NVIDIA V100 GPU 8

 

Capacity units per hour for batch scoring

Capacity type Capacity units per hour
Extra small: 1x4 = 1 vCPU and 4 GB RAM 0.5
Small: 2x8 = 2 vCPU and 8 GB RAM 1
Medium: 4x16 = 4 vCPU and 16 GB RAM 2
Large: 8x32 = 8 vCPU and 32 GB RAM 4
Extra large: 16x64 = 16 vCPU and 64 GB RAM 8

 

Capacity units per hour for AutoAI experiments

Capacity type Capacity units per hour
AutoAI: 4 vCPU and 16 GB RAM 10
AutoAI: 8 vCPU and 32 GB RAM 20

 

Capacity units per hour for Decision Optimization

Capacity type Capacity units per hour
Decision Optimization: 2 vCPU and 8 GB RAM 30
Decision Optimization: 4 vCPU and 16 GB RAM 40
Decision Optimization: 16 vCPU and 64 GB RAM 60

Note: Decision Optimization is supported on the Watson Machine Learning Lite and Professional plans. It is not supported on the Standard plan.

For details on how resources are consumed, see Monitoring account resource usage

 

Track runtime usage for machine learning by project

You can view the machine learning environment runtimes that are currently active in a project, and monitor usage for your machine learning assets from the project Environments page.

Track runtime usage for an account

The CUH consumed by the service runtimes in a project are billed to the account that the project creator has selected in his or her profile settings at the time the project is created. This account can be the account of the project creator, or another account that the project creator has access to. If other users are added to the project and use runtimes, their usage is also billed against the account that the project creator chose at the time of project creation.

You can track the runtime usage for an account on the Environment Runtimes page if you are the IBM Cloud account owner or administrator or the Watson Machine Learning service owner.

To view the total runtime usage across all of the projects and see how much of your plan you have currently used, choose Manage > Environment Runtimes.

A list of the active runtimes billed to your account is displayed. You can see who created the runtimes, when, and for which service instances, as well as the capacity units that were consumed by the active runtimes at the time you view the list.