To access your data with the Apache Cassandra for DataStage connection, create a connection asset for it. This connection works with the DataStage Apache Cassandra for DataStage connector, which has exclusive features for DataStage.
Apache Cassandra is an open source, which is distributed, NoSQL database.
Supported databases
- Apache Cassandra 3.11.x and 4.0.x
- Azure Cosmos DB
- DataStax Enterprise (DSE) 6.8.x
Create a connection with Apache Cassandra for DataStage
To create the connection asset, you need these connection details:
-
Cluster contact points: The list of cluster seed nodes. Enter the IP addresses or hostnames of the Cassandra cluster nodes. Include the port numbers if they differ from the default Cassandra port. By default, Cassandra uses
7000
for cluster communication or7001
if SSL is enabled. Separate the contact points with a semicolon (;
) character. Example:hostA:portA;hostB:portB
-
Local datacenter: The name of the datacenter that is local to the defined contact points.
-
Protocol version: Select the CQL protocol version to connect to the target Cassandra cluster. Consult your Cassandra administrator for the protocol information. In most cases the Default protocol is the best choice.
- Default: Protocol that is automatically used in the connection setup.
- DSE_V1 (DSE 5.1)
- DSE_V2 (DSE 6.0/6.7/6.8)
- Newest beta: Newest protocol in beta
- V1 (1.2.0, not supported)
- V2 (2.0.0, not supported)
- V3 (2.1.0)
- V4 (2.2.0, 3.0.0)
- V5 (3.10.0)
- V6 (4.0.0, beta)
-
Compression type: The type of compression for transport-level requests and responses.
- No compression (default)
- LZ4
- Snappy
-
Credentials
Authentication method
- None: Unrestricted access to the database for all users.
- Password authentication: Username and password to connect to the database.
-
SSL/TLS
Select Use SSL/TLS to secure the connection between the client and the Cassandra cluster.- Use client-to-node encryption: The traffic between the client and the cluster nodes is encrypted. The client verifies the identity of the Cassandra nodes that it connects to.
- Keystore path: The path to your keystore file.
- Keystore password: The password that was used to generate the keystore.
- Use client certificate authentication: Cassandra nodes verify the identity of the client.
- Truststore path: The path to your truststore file.
- Truststore password: The password that was used to generate the truststore.
- Use client-to-node encryption: The traffic between the client and the cluster nodes is encrypted. The client verifies the identity of the Cassandra nodes that it connects to.
Choose the method for creating a connection based on where you are in the platform
- In a project
- Click Assets > New asset > Connect to a data source. See Adding a connection to a project.
- In a deployment space
- Click Import assets > Data access > Connection. See Adding data assets to a deployment space.
- In the Platform assets catalog
- Click New connection. See Adding platform connections.
Next step: Add data assets from the connection
Where you can use this connection
You can use the Apache Cassandra for DataStage connection in the following workspaces and tools:
Projects
- DataStage (DataStage service). See Connecting to a data source in DataStage.
Catalogs
- Platform assets catalog
Learn more
- Apache Cassandra documentation
- Azure Cosmos DB for Apache Cassandra documentation
- DataStax docs for Apache Cassandra
Related connection: Apache Cassandra connection
Parent topic: Supported connections