Last updated: Aug 16, 2024
To access your data in Apache Impala, create a connection asset for it.
Apache Impala provides high-performance, low-latency SQL queries on data that is stored in popular Apache Hadoop file formats.
Supported versions
Apache Impala 1.3+
Create a connection to Apache Impala
To create the connection asset, you need these connection details:
- Database (optional): If you do not enter a database name, you must enter the catalog name, schema name, and the table name in the properties for SQL queries.
- Hostname or IP address
- Port number
- Username and password
- SSL certificate (if required by the database server)
Choose the method for creating a connection based on where you are in the platform
- In a project
- Click Assets > New asset > Connect to a data source. See Adding a connection to a project.
- In a deployment space
- Click Import assets > Data access > Connection. See Adding data assets to a deployment space.
- In the Platform assets catalog
- Click New connection. See Adding platform connections.
Next step: Add data assets from the connection
Where you can use this connection
You can use Apache Impala connections in the following workspaces and tools:
Projects
- Data Refinery
- SPSS Modeler
- Synthetic Data Generator
Catalogs
- Platform assets catalog
Apache Impala setup
Restriction
You can use this connection only for source data. You cannot write to data or export data with this connection.
Running SQL statements
To ensure that your SQL statements run correctly, refer to the Impala SQL Language Reference for the correct syntax.
Learn more
Parent topic: Supported connections