Connecting to data sources in Watson Query
Watson Query supports many relational and nonrelational data sources that you can add to your data source environment. Watson Query connects to relational data sources by using the Java™ Database Connectivity (JDBC) protocol.
Learn how you can connect to your data sources.
- Adding a connection to a data source
- Adding a data source from an existing platform connection
- Accessing data sources by using remote connectors in Watson Query
- Adding platform connections
Before you begin
- If you want to enforce governance for your published objects, set up a governed catalog to publish your assets to. For more information, see Governing virtual data in Watson Query.
- Review the list of data sources that are supported for Watson Query. For more information, see Supported data sources in Watson Query.
- For certain data sources, such as Amazon S3, Ceph®, IBM Cloud Object Storage, Google BigQuery, and Snowflake, you must complete specific steps.
- After a data source is added, any user with virtualize permissions (Watson Query Manager or Engineer roles) can create virtual tables. The user can create virtual tables by using any of the added data sources, no matter which user added the data source. For more information, see Managing users and roles.
- Review limitations and restrictions for data type mapping in Watson Query. For more information, see Supported data sources in Watson Query.
- Review Data source connection access restrictions in Watson Query to plan who you want to be able to access the data source connection and what privileges you want them to have.
Adding a connection to a data source
To add a connection to a data source in your environment, complete the following steps.
On the navigation menu, click Data sources page by default.
. The service menu opens to theClick Watson Query.
to see a list of data sources that can be added toSelect the type of data source that you want to connect to and then click Select.
The type of connection that you create determines the information that you must provide.
Typically, a connection requires a URL, a hostname, and port number.
Specify the required information based on the connection that you selected:
- The connection name and description.
- The name of the database
- The hostname or IP address and port number of the database, which is required to access the connection.
- The username and password that allow access to the
data source.Note: The username and password that is specified here refer to an ID with read-only access to the data source. This user is required for accessing data from the data source and does not necessarily correspond to a Cloud Pak for Data username or a Watson Query user ID.
For some data sources, you can use the Cloud Pak for Data credentials to access the data source. To do so, select the corresponding checkbox.
- Any additional properties required to create the connection.
If you want to use SSL to connect to the database, copy the content of the SSL certificate and paste it in the corresponding box.
- Add collaborators to the data source connection to determine who can access it. On the
Add collaborators and add the connection to a remote connector page, the
creator is listed as a collaborator. Choose from the following options to add additional
collaborators to the connection:
- Select Skip to create the data source connection without any additional collaborators or remote connectors. This means only the creator of the connection can view and use it.
- Select and select the users and user groups that you want to add as collaborators. Any users that you select, and any users that belong to groups that you select, can access the connection
- Select and select the roles that you want to add as collaborators. You can add the Engineer role, the Admin role, or both. Any users that have the roles that you select can access the connection.
For more information about collaborators, see Collaborators.
-
Click Create to add the connection to the data source environment.
- Manage access for the connection to determine what database tasks the
collaborators can perform on the connection.
- On the Data sources page, click the vertical overflow menu () and select Manage access. On the Manage access page, you can see the collaborators and their currently assigned privileges.
- You can add additional collaborators from the Manage access page.
- Apply your changes.
- Transfer ownership of the data source connection. See Transferring ownership of data sources in Watson Query.
Adding a data source from an existing platform connection
To add a data source from an existing platform connection, complete the following steps.
On the navigation menu, click Data sources page by default.
. The service menu opens to the-
Click the Add connection drop-down menu and click Existing platform connection to see a list of data sources that can be added to Watson Query.
-
Select the data source that you want to add and click Add.
- Add collaborators to the data source connection to determine who can access
it. On the Add collaborators and add the connection to a remote connector page,
the creator is listed as a collaborator. Choose from the following options to add additional
collaborators to the connection:
- Select Skip to create the data source connection without any additional collaborators or remote connectors. This means only the creator of the connection can view and use it.
- Select and select the users and user groups that you want to add as collaborators. Any users that you select, and any users that belong to groups that you select, can access the connection
- Select and select the roles that you want to add as collaborators. You can add the Engineer role, the Admin role, or both. Any users that have the roles that you select can access the connection.
For more information about collaborators, see Collaborators.
-
Optional: Select a remote connector to associate to the data source and click Add to connector.
For more information, see Accessing data sources by using remote connectors in Watson Query.
- Click Add to add the connection. Note: When you add data source connections in Watson Query, you might need to refresh twice on the Virtualize page. The first refresh notification is displayed when new data source connections are added. Click Refresh to reload tables, including those from new connections. After tables reload, a second notification appears. Click Refresh again to update your table list with newly loaded tables.
- Manage access for the connection to determine what database tasks the
collaborators can perform on the connection.
- On the Data sources page, click the vertical overflow menu () and select Manage access. On the Manage access page, you can see the collaborators and their currently assigned privileges.
- You can add additional collaborators from the Manage access page.
- Apply your changes.
- Transfer ownership of the data source connection. See Transferring ownership of data sources in Watson Query.
Learn more