To access your data in Apache HDFS, create a connection asset for it.
Apache Hadoop Distributed File System (HDFS) is a distributed file system that is designed to run on commodity hardware. Apache HDFS was formerly Hortonworks HDFS.
Create a connection to Apache HDFS
Copy link to section
To create the connection asset, you need these connection details. The WebHDFS URL is required.
The available properties in the connection form depend on whether you select Connect to Apache Hive so that you can write tables to the Hive data source.
WebHDFS URL to access HDFS.
Hive host: Hostname or IP address of the Apache Hive server.
Hive database: The database in Apache Hive.
Hive port number: The port number of the Apache Hive server. The default value is 10000.
Hive HTTP path: The path of the endpoint such as gateway/default/hive when the server is configured for HTTP transport mode.
SSL certificate (if required by the Apache Hive server).
For Private connectivity, to connect to a database that is not externalized to the internet (for example, behind a firewall), you must set up a secure connection.
Choose the method for creating a connection based on where you are in the platform