0 / 0
Go back to the English version of the documentation
Microsoft Azure Data Lake Store connection
Microsoft Azure Data Lake Store connection

Microsoft Azure Data Lake Store connection

To access your data in Microsoft Azure Data Lake Store, create a connection asset for it.

Azure Data Lake Store (ADLS) is a scalable data storage and analytics service that is hosted in Azure, Microsoft's public cloud. The Microsoft Azure Data Lake Store connection supports access to both Gen1 and Gen2 Azure Data Lake Storage repositories.

Create a connection to Microsoft Azure Data Lake Store

To create the connection asset, you need these connection details:

  • WebHDFS URL: The WebHDFS URL for accessing HDFS.
    To connect to a Gen 2 ADLS, use the format, https://<account-name>.dfs.core.windows.net/<file-system>
    Where <account-name> is the name you used when you created the ADLS instance.
    For <file-system>, use the name of the container you created. For more information, see the Microsoft Data Lake Storage Gen2 documentation.

  • Tenant ID: The Azure Active Directory tenant ID
  • Client ID: The client ID for authorizing access to Microsoft Azure Data Lake Store
  • Client secret: The authentication key that is associated with the client ID for authorizing access to Microsoft Azure Data Lake Store

For Private connectivity, to connect to a database that is not externalized to the internet (for example, behind a firewall), you must set up a secure connection.

Choose the method for creating a connection based on where you are in the platform

In a project Click Assets > New asset > Data access tools > Connection. See Adding a connection to a project.


In a catalog Click Add to catalog > Connection. See Adding a connection asset to a catalog.


In a deployment space Click Add to space > Connection. See Adding connections to a deployment space.


In the Platform assets catalog Click New connection. See Adding platform connections.

Next step: Add data assets from the connection

Where you can use this connection

You can use Microsoft Azure Data Lake Store connections in the following workspaces and tools:

Projects

  • DataStage (DataStage service)
  • Metadata import (Watson Knowledge Catalog)
  • SPSS Modeler (Watson Studio)

Catalogs

  • Platform assets catalog
  • Other catalogs (Watson Knowledge Catalog)

Azure Data Lake Store authentication setup

To set up authentication, you need a tenant ID, client (or application) ID, and client secret.

Supported file types

The Microsoft Azure Data Lake Store connection supports these file types: Avro, CSV, Delimited text, Excel, JSON, ORC, Parquet, SAS, SAV, SHP, and XML.

Learn more

Azure Data Lake

Parent topic: Supported connections