Cloudera Impala connection
To access your data in Cloudera Impala, create a connection asset for it.
Cloudera Impala provides SQL queries directly on your Apache Hadoop data stored in HDFS or HBase.
Cloudera Impala 1.3+
Create a connection to Cloudera Impala
To create the connection asset, you need these connection details:
- Database name
- Hostname or IP address
- Port number
- Username and password
- SSL certificate (if required by the database server)
For Private connectivity, to connect to a database that is not externalized to the internet (for example, behind a firewall), you must set up a secure connection.
Choose the method for creating a connection based on where you are in the platform
In a project Click Assets > New asset > Data access tools > Connection. See Adding a connection to a project.
In a catalog Click Add to catalog > Connection. See Adding a connection asset to a catalog.
In a deployment space Click Add to space > Connection. See Adding connections to a deployment space.
In the Platform assets catalog Click New connection. See Adding platform connections.
Next step: Add data assets from the connection
Where you can use this connection
You can use Cloudera Impala connections in the following workspaces and tools:
- Data Refinery (Watson Studio or Watson Knowledge Catalog)
- Metadata import (Watson Knowledge Catalog)
- SPSS Modeler (Watson Studio)
- Platform assets catalog
- Other catalogs (Watson Knowledge Catalog)
Watson Query service You can connect to this data source from Watson Query.
Cloudera Impala setup
You can use this connection only for source data. You cannot write to data or export data with this connection.
Running SQL statements
To ensure that your SQL statements run correctly, refer to the Impala SQL Language Reference for the correct syntax.
Parent topic: Supported connections