0 / 0

IBM Cloud Databases for PostgreSQL lineage configuration

Last updated: Apr 15, 2025
IBM Cloud Databases for PostgreSQL lineage configuration

To import lineage metadata from IBM Cloud Databases for PostgreSQL, create a connection, data source definition and metadata import job.

To import lineage metadata for IBM Cloud Databases for PostgreSQL, complete these steps:

  1. Create a data source definition.
  2. Create a connection to the data source in a project.
  3. Create a metadata import.

Creating a data source definition

Create a data source definition. Select PostgreSQL as the data source type.

Creating a connection to IBM Cloud Databases for PostgreSQL

Create a connection to the data source in a project. For connection details, see IBM Cloud Databases for PostgreSQL connection.

Creating a metadata import

Create a metadata import. Learn more about options that are specific to IBM Cloud Databases for PostgreSQL data source:

Include and exclude lists

You can include or exclude assets up to the schema level. Provide databases and schemas in the format database/schema. Each part is evaluated as a regular expression. Assets which are added later in the data source will also be included or excluded if they match the conditions specified in the lists. Example values:

  • myDB/: all schemas in myDB database.
  • myDB2/.*: all schemas in myDB2 database.
  • myDB3/mySchema1: mySchema1 schema from myDB3 database.
  • myDB4/mySchema[1-5]: any schema in my myDB4 database with a name that starts with mySchema and ends with a digit between 1 and 5.

External inputs

If you use external SQL scripts for IBM Cloud Databases for PostgreSQL , you can add them in a .zip file as an external input. You can organize the structure of a .zip file as subfolders that represent databases and schemas. After the scripts are scanned, they are added under respective databases and schemas in the selected catalog or project. The .zip file can have the following structure:

    <database_name>
        <schema_name>
           <script_name.sql>
    <database_name>
        <script_name.sql>
    <script_name.sql>
    replace.csv

The replace.csv file contains placeholder replacements for the scripts that are added in the .zip file. For more information about the format, see Placeholder replacements.

Advanced import options

Extract extended attributes
You can extract extended attributes like primary key, unique and referential integrity constraints of columns. By default these attributes are not extracted.
Extraction mode
You can decide which extraction mode to run for the imported metadata. You have the following options:
  • Prefetch: use it for relational databases.
  • Parallel bulk: use it for analytical processing engines.
  • Single-thread: use it to avoid parallelism and large queries during extraction. When you select this mode, performance might be low.
Transformation logic extraction
You can enable building transformation logic descriptions from SQL code in SQL scripts.

Learn more

Parent topic: Supported connectors for lineage import