Asset types and properties
You create content, in the form of assets, when you work with tools in collaborative workspaces. An asset is an item that contains information about a data set, a model, or another item that works with data.
You add assets by importing them or creating them with tools. You work with assets in collaborative workspaces. The workspace that you use depends on your tasks.
-
Projects Where you collaborate with others to work with data and create assets. Most tools are in projects and you run assets that contain code in projects. For example, you can import data, prepare data, analyze data, or create models in projects. See Projects.
-
Deployment spaces Where you deploy and run assets that are ready for testing or production. You move assets from projects into deployment spaces and then create deployments from those assets. You monitor and update deployments as necessary. See Deployment spaces.
You can create many different types of assets, but all assets have some common properties:
Asset types
To create most types of assets, you must use a specific tool.
The following table lists the types of assets that you can create, the tools you need to create them, and the workspaces where you can add them.
Asset type | Description | Tools to create it | Workspaces |
---|---|---|---|
AutoAI experiment | Automatically generates candidate predictive model pipelines. | AutoAI | Projects |
Connected data asset | Represents data that is accessed through a connection to a remote data source. | Connected data tool | Projects, Spaces |
Connection | Contains the information to connect to a data source. | Connection tool | Projects, Spaces |
Data asset from a file | Represents a file that you uploaded from your local system. | Upload pane | Projects, Spaces |
Data Refinery flow | Prepares data. | Data Refinery | Projects, Spaces |
Decision Optimization | Solves optimization problems | Decision Optimization | Projects |
Federated Learning experiment | Trains a common model on a set of remote data sources. | Federated Learning | Projects |
Folder asset | Represents a folder in IBM Cloud Object Storage. | Connected data tool | Projects, Spaces |
Jupyter notebook | Runs Python or R code to analyze data or build models. | Jupyter notebook editor, AutoAI, Prompt Lab | Projects |
Model | Contains information about a saved or imported model. | Various tools that run experiments or train models | Projects, Spaces |
Model use case | Tracks the lifecycle of a model from request to production. | watsonx.governance | Inventory |
Pipeline | Automates the model lifecycle. | Orchestration Pipelines | Projects |
Prompt template | A single prompt. | Prompt Lab | Projects |
Prompt session | The history of a working session in the Prompt Lab. | Prompt Lab | Projects |
Python function | Contains Python code to support a model in production. | Jupyter notebook editor | Projects, Spaces |
Script | Contains a Python or R script to support a model in production. | Jupyter notebook editor, RStudio | Projects, Spaces |
SPSS Modeler flow | Runs a flow to prepare data and build a model. | SPSS Modeler | Projects |
Visualization | Shows visualizations from a data asset. | Visualization page in data assets | Projects |
Synthetic data flow | Generates synthetic tabular data. | Synthetic Data Generator | Projects |
Tuned model | A tuned foundation model. | Tuning Studio | Projects |
Tuning experiment | A tuning experiment that builds a tuned foundation model. | Tuning Studio | Projects |
Vector index | An index that stores contextual information for foundation model prompts. | Connected data tool | Projects |
Common properties for assets
Assets accumulate information in properties when you create them, use them, or when they are updated by automated processes. Some properties are provided by users and can be edited by users. Other properties are automatically provided by the system. Most system-provided properties can't be edited by users.
The Last modified field for an asset tracks both user actions and system actions. System actions often occur in the background and might involve only changes to the asset's internal metadata.
Common properties for assets everywhere
Most types of assets have the properties that are listed in the following table in all the workspaces where those asset types exist.
Property | Description | Editable? |
---|---|---|
Name | The asset name. Can contain up to 255 characters. Supports multibyte characters. Cannot be empty, contain Unicode control characters, or contain only blank spaces. Asset names do not need to be unique within a project or deployment space. | Yes |
Description | Optional. Supports multibyte characters and hyperlinks. | Yes |
Creation date | The timestamp of when the asset was created or imported. | No |
Creator or Owner | The username or email address of the person who created or imported the asset. | No |
Last modified date | The timestamp of when the asset was last modified. | No |
Last editor | The username or email address of the person who last modified the asset. | No |
Common properties for assets that run in tools
Some assets are associated with running a tool. For example, an AutoAI experiment asset runs in the AutoAI tool. Assets that run in tools are also known as operational assets. Every time that you run assets in tools, you start a job. You can monitor and schedule jobs. Jobs use compute resources. Compute resources are measured in capacity unit hours (CUH) and are tracked. Depending on your service plans, you can have a limited amount of CUH per month, or pay for the CUH that you use every month.
For many assets that run in tools, you have a choice of the compute environment configuration to use. Typically, larger and faster environment configurations consume compute resources faster.
In addition to basic properties, most assets that run in tools contain the following types of information in projects:
Properties | Description | Editable? | Workspaces |
---|---|---|---|
Environment definition | The environment template, hardware specification, and software specification for running the asset. See Environments. | Yes | Projects, Spaces |
Settings | Information that defines how the asset is run. Specific to each type of asset. | Yes | Projects |
Associated data assets | The data that the asset is working on. | Yes | Projects |
Jobs | Information about how to run the asset, including the environment definition, schedule, and notification options. See Jobs. | Yes | Projects, Spaces |
Data asset types and their properties
Data asset types contain metadata and other information about data, including how to access the data.
How you create a data asset depends on where your data is:
- If your data is in a file, you upload the file from your local system to a project or deployment space.
- If your data is in a remote data source, you first create a connection asset that defines the connection to that data source. Then, you create a data asset by selecting the connection, the path or other structure, and the table or file that contains the data. This type of data asset is called a connected data asset.
The following graphic illustrates how data assets from files point to uploaded files in Cloud Object Storage. Connected data assets require a connection asset and point to data in a remote data source.
You can create the following types of data assets in a project or deployment space:
-
Data asset from a file Represents a file that you uploaded from your local system. The file is stored in the object storage container on the IBM Cloud Object Storage instance that is associated with the workspace. The contents of the file can include structured data, unstructured textual data, images, and other types of data. You can create a data asset with a file of any format. However, you can do more actions on CSV files than other file types. See Properties of data assets.
You can create a data asset from a file by uploading a file in a workspace. You can also create data files with tools and convert them to assets. For example, you can create data assets from files with the Data Refinery, Jupyter notebook, and RStudio tools.
-
Connected data asset Represents a table, file, or folder that is accessed through a connection to a remote data source. The connection is defined in the connection asset that is associated with the connected data asset. You can create a connected data asset for every supported connection. When you access a connected data asset, the data is dynamically retrieved from the data source. See Properties of data assets.
You can import connected data assets from a data source with the connected data tool in a workspace.
-
Folder asset Represents a folder in IBM Cloud Object Storage. A folder data asset is special case of a connected data asset. You create a folder data asset by specifying the path to the folder and the IBM Cloud Object Storage connection asset. You can view the files and subfolders that share the path with the folder data asset. The files that you can view within the folder data asset are not themselves data assets. For example, you can create a folder data asset for a path that contains news feeds that are continuously updated. See Properties of data assets.
You can import folder assets from IBM Cloud Object Storage with the connected data tool in a workspace.
-
Connection asset Contains the information necessary to create a connection to a data source. See Properties of connection assets.
You can create connections with the connection tool in a workspace.
Learn more about creating and importing data assets:
Properties of data assets from files and connected data assets
In addition to basic properties, data assets from files and connected data assets have the properties or pages that are listed in the following table.
Property or page | Description | Editable? | Workspaces |
---|---|---|---|
Tags | Optional. Text labels that users create to simplify searching. A tag consists of one string of up to 255 characters. It can contain spaces, letters, numbers, underscores, dashes, and the symbols # and @. | Yes | Projects |
Format | The MIME type of a file. Automatically detected. | Yes | Projects, Spaces |
Source | Information about the data file in storage or the data source and connection. | No | Projects, Spaces |
Asset details | Information about the size of the data, the number of columns and rows, and the asset version. | No | Projects, Spaces |
Preview asset | A preview of the data that includes a limited set of columns and rows from the original data source. See Asset contents or previews. | No | Projects, Spaces |
Profile page | Metadata and statistics about the content of the data. See Profile. | Yes | Projects |
Visualizations page | Charts and graphs that users create to understand the data. See Visualizations. | Yes | Projects |
Feature group page | Information about which columns in the data asset are used as features in models. See Managing feature groups. | Yes | Projects, Spaces |
Properties of connection assets
The properties of connection assets depend on the data source that you select when you create a connection. See Connection types. Connection assets for most data sources have the properties that are listed in the following table.
Properties | Description | Editable? | Workspaces |
---|---|---|---|
Connection details | The information that identifies the data source. For example, the database name, hostname, IP address, port, instance ID, bucket, endpoint URL, and so on. | Yes | Projects, Spaces |
Credential setting | Whether the credentials are shared across the platform (default) or each user must enter their personal credentials. Not all data sources support personal credentials. | Yes | Projects, Spaces |
Authentication method | The format of the credentials information. For example, an API key or a username and password. | Yes | Projects, Spaces |
Credentials | The username and password, API key, or other credentials, as required by the data source and the specified authentication method. | Yes | Projects, Spaces |
Certificates | Whether the data source port is configured to accept SSL connections and other information about the SSL certificate. | Yes | Projects, Spaces |
Private connectivity | The method to connect to a database that is not externalized to the internet. See Connecting to data behind a firewall. | Yes | Projects, Spaces |
Learn more
Parent topic: Overview of IBM watsonx.ai