Integrating with Google Cloud Platform
You can configure an integration with the Google Cloud Platform (GCP) to allow Cloud Pak for Data as a Service users to access data sources from GCP. Before proceeding, make sure you have proper permissions.
After configuring an integration, you’ll see it under Service instances. For example, you’ll see a new GCP tab that lists your BigQuery datasets and Storage buckets.
To configure an integration with GCP:
- Log on to the Google Cloud Platform at https://console.cloud.google.com.
- Go to IAM & Admin > Service Accounts.
Open your project and then click CREATE SERVICE ACCOUNT.
Specify a name and description for the new service account and click CREATE. Specify other options as desired and click DONE.
Click the actions menu next to the service instance and select Create key. For key type, select JSON and then click CREATE. The JSON key file will be downloaded to your machine.
Important! Store the key file in a secure location.
In Cloud Pak for Data as a Service, under Administrator > Cloud integrations, go to the GCP tab, enable integration, and then paste the contents from the JSON key file into the text field. Only certain properties from the JSON will be stored, and the
private_keyproperty will be encrypted.
Go back to Google Cloud Platform and edit the service account you created previously. Add the following roles:
Now you’ll see BigQuery datasets and Storage buckets under Service instances, and they’ll also be available as sources for creating new connections:
You can now preview data from GCP data sources, and access GCP data in Notebooks, Data Refinery, SPSS Modeler, and other tools in projects and in catalogs. You can see your GCP instances under Service instances > GCP.