0 / 0
Managing data for model evaluations

Managing data for model evaluations

To enable model evaluations in Watson OpenScale, you must prepare your data for logging to generate insights.

You must provide your model data in a supported format to enable model evaluations. Your model transactions are processed logged in the data mart. The data mart is the logging database that stores the data that is used for model evaluations. The following sections describe the different types of data that are logged for model evaluations:

Training data

You must provide training data to generate the statistics that you need to configure model evaluations. Training data contains labeled feature columns that Watson OpenScale measures to determine their impact on model outcomes and a prediction column that contains the outcome that the model is trained to predict. The following example shows training data from the German Credit Risk dataset:

CSV file of training data for Watson OpenScale

To enable model evaluations, you must connect your training data to Watson OpenScale. The training data for your model must be provided in a format that Watson OpenScale understands. For more information, see Managing training data.

Payload data

Payload data contains the input and output transactions for your deployment. To configure evaluations, the payload data from your model is stored in a payload logging table. The payload logging table contains the feature and prediction columns that exist in your training data and a prediction probability column that contains the model's confidence in the prediction that it provides. The table also includes timestamp and ID columns to identify each scoring request that you send to the service that you're using as shown in the following example:

Python SDK sample output of payload logging table

You must send scoring requests to provide a log of your model transactions. For more information, see Managing payload data.

Feedback data

Feedback data is labeled data that matches the structure of training data and includes known model outcomes that are compared to your model predictions to measure the accuracy of your model. You must upload feedback data regularly to Watson OpenScale to continuously measure the accuracy of your model predictions. For more information, see Managing feedback data.

Learn more

Generative AI search and answer
These answers are generated by a large language model in watsonx.ai based on content from the product documentation. Learn more