0 / 0
Getting started with preparing data
Getting started with preparing data

Getting started with preparing data

To get started with preparing, transforming, and integrating data, understand the overall workflow, choose a tutorial, and check out other learning resources for working in Cloud Pak for Data as a Service.

Overview of the data preparation workflow

Prerequisite Sign up for or join a Cloud Pak for Data as a Service account

Your data preparation workflow has these basic steps:

  1. Create a project.

  2. If necessary, create the service instance that provides the tool you want to use and associate it with the project.

  3. Add data to your project. You can add data files from your local system, data from a remote data source that you connect to, data from a catalog, or sample data from the Gallery.
  4. Choose a tool to analyze your data. Each of the tutorials describes a tool.

  5. Run or schedule a job to prepare your data.

Tutorials

Each of these tutorials provides a description of the tool, a video, the instructions, and additional learning resources:

Tutorial Description Expertise for tutorial
Refine and visualize data with Data Refinery Prepare and visualize tabular data with a graphical flow editor. Select operations to manipulate data.
Transform data with DataStage Design a data integration flow to filter and sort tables with a graphical flow editor. Drop data and operation nodes on a canvas and select properties.
Virtualize data Create a virtualized table by joining two tables. Select tables and connect the primary key columns.

Additional resources

Guided tutorials

Click Take a guided tutorial on the Cloud Pak for Data as a Service home page. After you create the sample project, choose Explore and prepare data to cleanse and visualize data.

Documentation

Videos

Samples

  • Gallery of samples provides sample notebooks, data sets, and projects that you can import.

Training

Parent topic: Getting started

Generative AI search and answer
These answers are generated by a large language model in watsonx.ai based on content from the product documentation. Learn more