Batch deployment input details for Decision Optimization models

Last updated: Oct 09, 2024

Follow these rules when you are specifying input details for batch deployments of Decision Optimization models.

Data type summary table:

Data	Description
Type	inline and data references
File formats	Refer to Model input and output data file formats.

Data Sources

Input/output inline data:

Inline input data is converted to CSV files and used by engine.
CSV output data is converted to output inline data.
Base64-encoded raw data is supported as input and output.

Input/output data references:

Tabular data is loaded from CSV, XLS, XLSX, JSON files or database data sources supported by the WDP connection library, converted to CSV files, and used by engine.
CSV output data is converted to tabular data and saved to CSV, XLS, XLSX, JSON files or database data sources supported by the WDP connection library.
Raw data can be loaded and saved from/to any file data sources supported by the WDP connection library.
No support for .zip files.
The environment variables parameter of deployment jobs is not applicable.

If you are specifying input/output data references programmatically:

Data source reference type depends on the asset type. Refer to the Data source reference types section in Adding data assets to a deployment space.
For S3 or Db2, connection details must be specified in the input_data_references.connection parameter, in the deployment job’s payload.
For S3 or Db2, location details such as table name, bucket name, or path must be specified in the input_data_references.location.path parameter, in the deployment job’s payload.
For data_asset, a managed asset can be updated or created. In case of creation you can set the name and description for the created asset.

You can use a pattern in ID or connection properties. For example:

To collect all output CSV as inline data:

"output_data": [ { "id":".*\\.csv"}]

To collect job output in a particular S3 folder:

"output_data_references": [ {"id":".*", "type": "s3", "connection": {...}, "location": { "bucket": "do-wml", "path": "${job_id}/${attachment_name}" }}]

Note:

Support for s3 and db2 values for scoring.input_data_references.type and scoring.output_data_references.type is deprecated and will be removed in the future. Use connection_asset or data_asset instead. See the documentation for the Watson Machine Learning REST API or Watson Machine Learning Python client library for details and examples.

For details on deploying decision optimization solutions, refer to Model input and output data adaptation.

Parent topic: Batch deployment input details by framework

Was the topic helpful?

0/1000