0 / 0
Quick start: Virtualize data

Quick start: Virtualize data

You can use Data Virtualization to create a virtual table to segment or combine data from one or more tables. Data Virtualization connects multiple data sources into a single self-balancing collection of data sources or databases. Read about the Data Virtualization tool, then watch a video and take a tutorial that’s suitable for users with some knowledge of virtualizing data, but does not require coding.

Required service
Data Virtualization
Optional services
Watson Studio
IBM Knowledge Catalog

Your basic workflow includes these tasks:

  1. Provision the service and create your service credentials.
  2. Create databases in multiple data sources and collect database details and credentials.
  3. Add connections to your data sources.
  4. Create virtual objects by combining data from all your data sources.
  5. Manage access to your virtual objects.
  6. Add vitualized data to your catalogs and projects.
  7. Monitor your service instance with IBM Db2 Data Management Console.

Read about Data Virtualization

With the Data Virtualization service, you can connect to multiple data sources, create and govern virtual assets, and consume the virtualized data.

Read more about Data Virtualization

Watch a video about Data Virtualization

Watch Video Watch this video to see how to virtualize data to a project or catalog using the Data Virtualization service.

This video provides a visual method to learn the concepts and tasks in this documentation.


Try a tutorial to virtualize data

In this tutorial, you will complete these tasks:

This tutorial will take approximately 30 minutes to complete.





Tips for completing this tutorial Here are some tips for successfully completing this tutorial.

Use the video picture-in-picture

Tip: Start the video, then as you scroll through the tutorial, the video moves to picture-in-picture mode. Close the video table of contents for the best experience with picture-in-picture. You can use picture-in-picture mode so you can follow the video as you complete the tasks in this tutorial. Click the timestamps for each task to follow along.

The following animated image shows how to use the video picture-in-picture and table of contents features:

How to use picture-in-picture and chapters

Get help in the community

If you need help with this tutorial, you can ask a question or find an answer in the Cloud Pak for Data Community discussion forum.

Set up your browser windows

For the optimal experience completing this tutorial, open Cloud Pak for Data in one browser window, and keep this tutorial page open in another browser window to switch easily between the two applications. Consider arranging the two browser windows side-by-side to make it easier to follow along.

Side-by-side tutorial and UI

Tip: If you encounter a guided tour while completing this tutorial in the user interface, click Maybe later.



Task 1: Open a project

preview tutorial video To preview this task, watch the video beginning at 00:10.

You need a project to store the virtualized data. Follow these steps to open an existing project or create a new project.

  1. From the Cloud Pak for Data navigation menu Navigation menu, choose Projects > View all projects

  2. If you have an existing project, open it.

  3. If you don't have an existing project, then click New project.

  4. Select Create an empty project.

  5. Enter a name and optional description for the project.

  6. Choose an existing object storage service instance or create a new one.

  7. Click Create.

For more information or to watch a video, see Creating a project.

Checkpoint icon Check your progress

The following image shows a new, empty project.

The following image shows a new, empty project.




Task 2: Provision the required services

preview tutorial video To preview this task, watch the video beginning at 00:32.

This tutorial requires the Data Virtualization service, and optional services Watson Studio and IBM Knowledge Catalog. Follow these steps to create these services:

  1. From the Navigation Menu Navigation menu, click Services > Service instances.

  2. If you have a Data Virtualization service listed, then there is no need to provision another instance. Otherwise, follow these steps:

    1. Click Add service.

    2. Select Data Virtualization.

    3. Select the Lite plan for Data Virtualization.

    4. Click Create.

  3. Verify that the services are provisioned on your Service instances page.

For more information, see Data Virtualization on Cloud Pak for Data as a Service.

Checkpoint icon Check your progress

The following image shows the provisioned services.

The following image shows the provisioned services.




Task 3: Add a connection to the Db2 Warehouse data source

preview tutorial video To preview this task, watch the video beginning at 00:58.

Before you can virtualize the data, you need create a connection to the data source. Follow these steps to create a connection in Data Virtualization:

  1. From the Navigation Menu Navigation menu, select Data > Data virtualization. The list of configured Data sources displays.

  2. Click Add connection > New connection.

  3. Select Db2 Warehouse on Cloud, and click Select.

  4. Complete the connection details using the following information:

    • Name: Db2 Warehouse
    • Database: BLUDB
    • Hostname or IP address: db2w-ruggyab.us-south.db2w.cloud.ibm.com
    • Port: 50001
    • Username: CPDEMO
    • Password: DataFabric@2022IBM
    • Select the Port is SSL-enabled checkbox.
  5. Click Test.

  6. Click Create.

For more information, see Adding and connecting to data sources in Data Virtualization.

Checkpoint icon Check your progress

The following image shows the Data Sources page.

The following image shows the Data Sources page.




Task 4: Add tables to your virtualized data

preview tutorial video To preview this task, watch the video beginning at 01:45.

With the connection defined, you can virtualize data from that data source. Follow these steps to add the tables to your virtualized data.

  1. From the Data Virtualization menu, select Virtualization > Virtualize, and wait for the available tables to load.

  2. Locate and select the customers and sales tables from the list, and click Add to cart.

  3. Click View cart.

  4. Clear the Assign to project field. This will add the two tables to your list of virtualized data, but not add them to a project. Later, you will add virtualized data to your project.

  5. Click Virtualize.

  6. Click Confirm.

  7. Click Go to virtualized data.

For more information, see Creating virtual objects in Data Virtualization.

Checkpoint icon Check your progress

The following image shows the My virtualized data page.

The following image shows the My virtualized data page.




Task 5: Publish virtualized data to a catalog and project

preview tutorial video To preview this task, watch the video beginning at 02:43.

Next, follow these steps to join two tables to create a virtualized asset and publish that to a catalog and project:

  1. On the Virtualized data screen, select the customers and sales tables from the list, and click Join.

  2. For each table, search for salesrep.

  3. Connect the SALESREP_ID columns in the two tables.

  4. Click Next.

  5. Review the joined table, and click Next.

  6. For the view name, type joined_customers_sales_table.

  7. Select a project from the list.

  8. Check the Publish to catalog option, and select a catalog.

  9. Click Create view.

  10. When the process completes, you can either view the project or the catalog to preview the virtualized data. You will need an IBM Cloud API key to view the data in the project or catalog. See Creating an IBM Cloud API key.

For more information, see Governing virtual data in Data Virtualization.

Checkpoint icon Check your progress

The following image shows the virtualized data asset in the catalog.

The following image shows the virtualized data asset in the catalog.



Next steps

Now your virtual data is ready to be used. For example, you can do any of these tasks:

Additional resources

Parent topic: Quick start tutorials

Generative AI search and answer
These answers are generated by a large language model in watsonx.ai based on content from the product documentation. Learn more