Last updated: Jun 07, 2024
To access your data from a storage service that is compatible with the Amazon S3 API, create a connection asset for it.
Create a Generic S3 connectionCopy link to section
Copy link to section
To create the connection asset, you need these connection details:
- Endpoint URL: The endpoint URL to access to S3
- Bucket(optional): The name of the bucket that contains the files
- Region (optional): S3 region. Specify a region that matches the regional endpoint.
- Access key: The access key (username) that authorizes access to S3
- Secret key: The password associated with the Access key ID that authorizes access to S3
- The SSL certificate of the trusted host. The certificate is required when the host certificate is not signed by a known certificate authority.
- Disable chunked encoding: Select if the storage does not support chunked encoding.
- Enable global bucket access: Consult the documentation for your S3 data source for whether to select this property.
- Enable path style access: Consult the documentation for your S3 data source for whether to select this property.
Choose the method for creating a connection based on where you are in the platformCopy link to section
Copy link to section
- In a project
- Click Assets > New asset > Connect to a data source. See Adding a connection to a project.
- In a deployment space
- Click Import assets > Data access > Connection. See Adding data assets to a deployment space.
- In the Platform assets catalog
- Click New connection. See Adding platform connections.
Next step: Add data assets from the connectionCopy link to section
Copy link to section
Where you can use this connectionCopy link to section
Copy link to section
You can use the Generic S3 connection in the following workspaces and tools:
Projects
- Data Refinery
- Decision Optimization
Catalogs
- Platform assets catalog
Generic S3 connection setupCopy link to section
Copy link to section
For setup information, consult the documentation of the S3-compatible data source that you are connecting to.
Supported file typesCopy link to section
Copy link to section
The Generic S3 connection supports these file types: Avro, CSV, delimited text, Excel, JSON, ORC, Parquet, SAS, SAV, SHP, and XML.
Table formatsCopy link to section
Copy link to section
In addition to Flat file, the Generic S3 connection supports these Data Lake table formats: Delta Lake and Iceberg.
Related connection: Amazon S3 connection
Parent topic: Supported connections