0 / 0
Getting and preparing data in a project
Getting and preparing data in a project

Getting and preparing data in a project

After you create a project, or join one, the next step is to add data to the project and prepare the data for analysis.

You can add data assets from your local system, from a catalog, from the Gallery, or from connections to data sources.

You can add these types of data assets to a project:

  • Data assets from files from your local system, including structured data, unstructured data, and images. The files are stored in the project's IBM Cloud Object Storage bucket.
  • Connection assets that contain information for connecting to data sources. You can add connections to IBM or third-party data sources. See Supported connections.
  • Connected data assets that specify a table, view, or file that is accessed through a connection to a data source.
  • Connected folder assets that specify a path in IBM Cloud Object Storage.

To get started quickly, choose a learning path and take a tutorial. See Getting started with preparing data.

To see a preview of the contents of the data asset and a profile of the textual content of the data, click the asset name to open it.

To refine data by cleansing and shaping it, first add the data to the project, then open the data asset and click Prepare data.

To curate data, add the data to the project through metadata import and then enrich those data assets.

To transform data, choose New asset > DataStage to create a DataStage flow and subsequent job.

To mask data, choose New asset > Data privacy to create and deliver a masked data set.

To create and manage master data, choose New asset > IBM Match 360 with Watson to create a master data configuration asset to use with IBM Match 360 with Watson.

Learn more