Connecting to Cloud Object Storage in Watson Query

The IBM Cloud Object Storage connection type requires specific information to create a connection to it in Watson Query.

For more information, see Data sources in object storage in Watson Query.

Before you begin

To access data that is stored in Cloud Object Storage, you must create a connection to the data source where the files are located. Watson Query integrates with object storage connections and supports PARQUET (or PARQUETFILE), Optimized Row Columnar (ORC), comma-separated values (CSV), tab-separated values (TSV), and JSON data formats. All other file formats are not supported.

Although text-based files that use valid file formats are supported, they cannot contain column headers. You must remove the column headers from files that you want to store in Cloud Object Storage before you virtualize data.

About this task

For more information, see Getting started with Cloud Object Storage.

Authenticating for Watson Query object storage connections is standard for all supported platforms (Ceph, IBM Cloud Object Storage, Amazon S3).

Procedure

To get started, create a Cloud Object Storage connection.

Authenticating for Watson Query object storage connections is standard for all supported platforms (Ceph, IBM Cloud Object Storage, Amazon S3).

  1. On the navigation menu, click Data > Data virtualization. The service menu opens to the Data sources page by default.

  2. Click Add connection > New connection to see a list of data sources that can be added to Watson Query.

  3. Select the Cloud Object Storage data source connection.

  4. Enter the connection name and description.

  5. To configure a Cloud Object Storage connection, you must provide values for the following parameters.
    • Bucket
    • Endpoint URL
    • Access key
    • Secret key
  6. To find the values for the Access key and Secret key, follow these steps.

    Note: The Secret key field is not enabled until you enter an Access key.
    1. Go to the Cloud Object Storage Resource list at https://cloud.ibm.com/resources.
    2. Expand the Storage resource.
    3. Click the Cloud Object Storage service, and then click Service credentials.
    4. Expand the Key name that you want to use. Copy the following values without the quotation marks.
      • Access key: access_key_id
      • Secret key: secret_access_key

    For more information, see Service credentials.

  7. Click Create to add the connection to the data source environment.

Results

You can now use your Cloud Object Storage database as a data source in Watson Query.