Getting started with preparing data
To get started with preparing, transforming, and integrating data, understand the overall workflow, choose a tutorial, and check out other learning resources for working in Cloud Pak for Data as a Service.
Overview of the data preparation workflow
Prerequisite Sign up for or join a Cloud Pak for Data as a Service account
Your data preparation workflow has these basic steps:
-
Create a project.
-
If necessary, create the service instance that provides the tool you want to use and associate it with the project.
- Add data to your project. You can add data files from your local system, data from a remote data source that you connect to, data from a catalog, or sample data from the Gallery.
-
Choose a tool to analyze your data. Each of the tutorials describes a tool.
-
Run or schedule a job to prepare your data.
Tutorials
Each of these tutorials provides a description of the tool, a video, the instructions, and additional learning resources:
Tutorial | Description | Expertise for tutorial |
---|---|---|
Refine and visualize data with Data Refinery | Prepare and visualize tabular data with a graphical flow editor. | Select operations to manipulate data. |
Transform data with DataStage | Design a data integration flow to filter and sort tables with a graphical flow editor. | Drop data and operation nodes on a canvas and select properties. |
Virtualize data | Create a virtualized table by joining two tables. | Select tables and connect the primary key columns. |
Additional resources
Guided tutorials
Click Take a guided tutorial on the Cloud Pak for Data as a Service home page. After you create the sample project, choose Explore and prepare data to cleanse and visualize data.
Documentation
Videos
- A comprehensive set of videos that show many common data preparation tasks in Cloud Pak for Data as a Service.
Samples
- Gallery of samples provides sample notebooks, data sets, and projects that you can import.
Training
- Watson Studio Methodology is an IBM Training e-Learning course that provides an in-depth look at Watson Studio.
- Take control of your data with Watson Studio is a learning path that consists of step-by-step tutorials that explain the process of working with data using Watson Studio.
Parent topic: Getting started