Getting and preparing data in a project

After you create a project, or join one, the next step is to add data to the project and prepare the data for analysis.

You can add data assets from your local system, from a catalog, from the Gallery, or from connections to data sources.

You can add these types of data assets to a project:

  • Data assets from files from your local system, including structured data, unstructured data, and images. The files are stored in the project’s IBM Cloud Object Storage bucket.
  • Connection assets that contain information for connecting to data sources. You can add connections to IBM or third-party data sources.
  • Connected data assets that specify a table, view, or file that is accessed through a connection to a data source.
  • Folder assets that specify a path in IBM Cloud Object Storage.

You can see a preview of the contents of the data asset and a profile of the textual content of the data.

If you plan to refine data by cleansing and shaping it, first add the data to the project, then open the data asset and click Refine.

If you plan to create and manage master data, choose Add to project > MDM configuration to create a master data configuration asset.

If you plan to ingest and process streaming data, choose Add to project > Streams flow to create connections and streams flows.

If you plan to transform data, choose Add to project > DataStage flow to create a DataStage flow and subsequent job.

