Connecting to Amazon S3 in Data Virtualization
The Amazon S3 connector requires specific information to create a connection to it in Data Virtualization.
For more information, see Data sources in object storage in Data Virtualization.
Before you begin
To access data that is stored in Cloud Object Storage, you must create a connection to the data source where the files are located. Data Virtualization integrates with object storage connections and supports PARQUET (or PARQUETFILE), Optimized Row Columnar (ORC), comma-separated values (CSV), tab-separated values (TSV), and JSON data formats. All other file formats are not supported.
About this task
For more information, see Getting started with Cloud Object Storage.
Authenticating for Data Virtualization object storage connections is standard for all supported platforms (Ceph, IBM Cloud Object Storage, Amazon S3).
Procedure
To connect to Amazon S3 in Data Virtualization, follow these steps.
On the navigation menu, click Data sources page appears.
. TheClick
to view a list of data sources.-
Select the Amazon S3 data source connection.
-
Enter the connection name and description.
-
To configure an Amazon S3 connection, you must provide values for the following parameters.
- Bucket
- Endpoint URL
Note: Only Access key and Secret key authentication methods are supported when you create a connection to Cloud Object Storage. - Select your authentication method.
To find the values for the Access key and Secret key see Understanding and getting your AWS credentials.
-
Click Create to add the connection to the data source environment.