0 / 0
Adding data to a project
Adding data to a project

Adding data to a project

After you create a project, the next step is to add data assets to it so that you can work with data. All the collaborators in the project are automatically authorized to access the data in the project.

Assets can have duplicate names, however, you can't add the same asset multiple times with the same name.

You can add these types of data assets to projects:

Add local files

You can add a file as a data asset from your local system to your project. You must have the Editor or Admin role in the project. The maximum size for files that you can load with the Watson Studio UI is 5 GB. You can load larger files to a project with APIs.

Important You can't add executable files to a project. All other types files that you add to a project are not checked for malicious code. You must ensure that your files do not contain malware or other types of malicious software that other collaborators might download.

To add data files to a project:

  1. From your project's Assets page, click the Upload asset to project icon (Shows the find data icon.). You can also click the Find and add data icon (Shows the find data icon.) from within a notebook or canvas.

  2. In the Data pane that opens, browse for the files or drag them onto the pane. You must stay on the page until the load is complete. You can cancel an ongoing load process if you want to stop loading a file.

The files are saved in the object storage that is associated with your project and are listed as data assets on the Assets page of your project.

When you click the data asset name, you can see this information about data assets from files:

  • The asset name and description
  • The tags for the asset
  • The name of the person who created the asset
  • The size of the data
  • The date when the asset was added to the project
  • The date when the asset was last modified
  • A preview of the data, for CSV, Avro, Parquet, TSV, Microsoft Excel, PDF, text, JSON, and image files
  • A profile of the data, for CSV, Avro, Parquet, Microsoft Word, PDF, text, and HTML files

You can update the contents of a data asset from a file by adding a file with the same name and format to the project and then choosing to replace the existing data asset.

You can remove the data asset by choosing the Remove option from the action menu next to the asset name. Choose the Refine option to refine the data with Data Refinery.

Add Gallery data sets

You can add data sets from the Gallery to your project:

  1. In the Gallery, find the card for the data set that you want to add.
  2. Click the Add to Project icon from the action bar, select the project, and click Add.

Watch this short video to see how to load and analyze public data sets.

This video provides a visual method as an alternative to following the written steps in this documentation.

Add files from the project storage

The storage for the project contains the files you uploaded to the project, but it can also contain other files. For example, you can save a DataFrame in a notebook in the project storage. You can add those files as data assets to your project.

To add data assets from the project storage:

  1. From the Assets tab of your project, click Add asset.
  2. Select the asset and click Add.

Next steps