Some service plans charge for compute usage and other types of resource usage. If you are the IBM Cloud account owner or administrator, you can monitor the resources usage to ensure the limits are not exceeded.
For Lite plans, you cannot exceed the limits of the plan. You must wait until the start of your next billing month to use resources that are calculated monthly. Alternatively, you can upgrade to a paid plan.
For most paid plans, you pay for the resources that the tools and processes that are provided by the service consume each month.
To see the costs of your plan, log in to IBM Cloud, open your service instance from your IBM Cloud dashboard, and click Plan.
- Capacity unit hours (CUH) for compute usage
- Resource units for foundation model inferencing
- Monitor monthly billing
Capacity unit hours (CUH) for compute usage
Many tools consume compute usage that is measured in capacity unit hours (CUH). A capacity unit hour is a specific amount of compute capability with a set cost.
What is measured for CUH consumption?
Resources, whether measured with capacity unit hours (CUH) or resource units (RU) are consumed for running assets, not for working in tools. That is, there is no consumption charge for defining an experiment in AutoAI, but there is a charge for running the experiment to train the experiment pipelines. Similarly, there is no charge for creating a deployment space or defining a deployment job, but there is a charge for running a deployment job or inferencing against a deployed asset. Assets that run continuously, such as Jupyter notebooks, RStudio assets, and Bash scripts consume resources for as long as they are active.
How compute usage is calculated
Different types of processes and different levels of compute power are billed at different rates of capacity units per hour. For example, the hourly rate for a data profiling process is 6 capacity units.
Compute usage for watsonx.ai Studio is charged by the minute, with a minimum charge of 10 minutes (0.16 hours). Compute usage for watsonx.ai Runtime is charged by the minute with a minimum charge of one minute.
Compute usage is calculated by adding the minimum number of minutes billed for each process plus the number of minutes the process runs beyond the minimum minutes, then multiplying the total by the capacity unit rate for the process.
The following table shows examples of how the billed CUH is calculated.
Rate | Usage time | Calculation | Total CUH billed |
---|---|---|---|
1 CUH/hour | 1 hour | 1 hour * 1 CUH/hour | 1 CUH |
2 CUH/hour | 45 minutes | 0.75 hours * 2 CUH/hour | 1.5 CUH |
6 CUH/hour | 5 minutes | 0.16 hours * 6 CUH/hour | 0.96 CUH. The minimum charge for watsonx.ai Studio applies. |
6 CUH/hour | 30 minutes | 0.5 hours * 6 CUH/hour | 3 CUH |
6 CUH/hour | 1 hour | 1 hour * 6 CUH/hour | 6 CUH |
Processes that consume capacity unit hours
Some types of processes, such as AutoAI and Federated Learning, have a single compute rate for the runtime. However, with many tools you have a choice of compute resources for the runtime. The notebook editor, Data Refinery, SPSS Modeler, and other tools have different rates that reflect the memory and compute power for the environment. Environments with more memory and compute power consume capacity unit hours at a higher rate.
This table shows each process that consumes CUH, where it runs, and against which service CUH is billed, and whether you can choose from more than one environment. Follow the links to view the available CUH rates for each process.
Tool or Process | Workspace | Service that provides CUH | Multiple CUH rates? |
---|---|---|---|
Notebook editor | Project | watsonx.ai Studio, Analytics Engine (Spark) | Multiple rates |
Invoking the machine learning API from a notebook | Project | watsonx.ai Runtime | Multiple rates |
Data Refinery | Project | watsonx.ai Studio | Multiple rates |
SPSS Modeler | Project | watsonx.ai Studio | Multiple rates |
RStudio IDE | Project | watsonx.ai Studio | Multiple rates |
AutoAI experiments | Project | watsonx.ai Runtime | Multiple rates |
Decision Optimization experiments | Spaces | watsonx.ai Runtime | Multiple rates |
Running deployments | Spaces | watsonx.ai Runtime | Multiple rates |
Profiling | Project | watsonx.ai Studio | One rate |
Synthetic Data Generator | Project | watsonx.ai Studio | One rate |
Orchestration Pipelines | Project and Space | watsonx.ai Studio | Multiple rates |
Monitoring compute usage
You can monitor compute usage for all services at the account level. To view the monthly CUH usage for a service, open the service instance from your IBM Cloud dashboard and click Plan.
You can also monitor compute usage in a project on the Environments page on the Manage tab.
To see the total amount of capacity unit hours that are used and that are remaining for watsonx.ai Studio and watsonx.ai Runtime, look at the Environment Runtimes page. From the navigation menu, select Administration > Environment runtimes. The Environment Runtimes page shows details of the CUH used by environments. You can calculate the amount of CUH you use for data flows and profiling by subtracting the amount used by environments from the total amount used.
Resource units for foundation model inferencing
Calling a foundation model to generate output in response to a prompt is known as inferencing. Foundation model inferencing is measure in resource units (RU). Each RU equals 1,000 tokens. A token is a basic unit of text (typically 4 characters or 0.75 words) used in the input or output for a foundation model prompt. For details on tokens, see Tokens.
Resource unit billing is based on the rate of the foundation model class multipled by the number of tokens. Foundation models are classified into three classes. See Resource unit metering.
Monitoring token usage for foundation model inferencing
You can monitor foundation model token usage in a project on the Environments page on the Manage tab.
Monitor monthly billing
You must be an IBM Cloud account owner or administrator to see resource usage information.
To view a summary of your monthly billing, from the navigation menu, choose Administration > Account and billing > Billing and usage. The IBM Cloud usage dashboard opens. To view the usage for each service, in the Usage summary section, click View usage.
Learn more
- Choosing compute resources for running tools in projects
- Upgrade services
- Environments compute usage
- watsonx.ai Studio offering plans
- watsonx.ai Runtime plans and compute usage
Parent topic: Managing the platform