Integrating with AWS

You can configure an integration with the Amazon Web Services (AWS) platform to allow Cloud Pak for Data as a Service users to access data sources from AWS. Before proceeding, make sure you have proper permissions (you’ll need to be able to create certain services and create credentials in the account, for example).

After configuring an integration, you’ll see it under Service instances. For example, you’ll see a new AWS tab that lists your instances of Redshift and S3.

To configure an integration with AWS:

  1. Log on to the AWS Console at https://aws.amazon.com/console/.

  2. From the account drop-down at the upper-right, select My Security Credentials.
  3. Under Access keys for CLI, SDK, & API access, click Create access key.
  4. Copy the key ID and secret.

    Important! Write down your key ID and secret and store them in a safe place.

  5. In Cloud Pak for Data as a Service, under Administrator > Cloud integrations, go to the AWS tab, enable integration, and then paste the access key ID and access key secret in the appropriate fields.

Configuring firewall access

You must also configure access so Cloud Pak for Data as a Service can access data through the firewall.

For AWS Redshift firewall:

  1. Navigate to your provisioned Amazon Redshift instance.
  2. Go to Properties and then scroll down to Network and Security.
  3. Click the VPC security group.

    AWS VPC security group

  4. Select the active/default security group.

    AWS active security group

  5. Under Inbound rules, copy the IP ranges.

    AWS IP ranges

  6. Open Cloud Pak for Data as a Service and go to Adminster > Cloud Integrations. Under Firewall integration, paste the IP addresses. Use CIDR notation for AWS ranges.

You can now create connections, preview data from AWS data sources, and access AWS data in Notebooks, Data Refinery, SPSS Modeler, and other tools in projects and in catalogs. You can see your AWS instances under Service instances > AWS.

Next steps