Defining data definitions in DataStage
When transforming or cleansing data, you must define the data that you are working with.
You define the data by importing or defining data definitions. You can save the data definitions for use in your flow designs.
Data definitions specify the data to be used at each stage of a flow. Data definitions are shared by all the flows in a project.
When you develop a flow you will typically load your stages with data definitions. You do this in the relevant Columns section when you edit stage properties.
Creating data definitions
- Open an existing project or create a project. Then, open or create a new DataStage flow.
- Open a connector or stage in the DataStage flow. On the Output tab, go to the Columns section and click Edit.
- Click the Save columns as new data definition icon. Enter a name for your data definition and click Next.
- Select all the columns that you want included in the data definition. Then, click Create.
- Click Apply and return.
Importing data definitions
- Open an existing project or create a project. Then, open or create a new DataStage flow.
- Open a connector or stage in the DataStage flow. On the Output tab, go to the Columns section and click Edit.
- Click the Import existing data definition icon.
- Click the Data definition section, select the data definition that you want to import, and click Next.
- Select the columns that you want to import, click Import, and then click Apply and return.
Creating a data definition DataStage component
You can create a data definition as a reusable component to use in DataStage flows.
- Open an existing project or create a project.
- Click New Asset + and select Create reusable DataStage components.
- Select Data definition as the DataStage component type.
- Enter a name for the data definition, then click Create.
- Provide information for columns and extended properties. Then, click Save.
You can manage all your DataStage components from the Assets tab.
This video provides a visual method to learn the concepts and tasks in this documentation.