0 / 0
Deploying a prompt template
Last updated: Dec 12, 2024
Deploying a prompt template

Deploy a prompt template so you can add it to a business workflow or so you can evaluate the prompt template to measure performance.

Promote a prompt template to a deployment space

To promote a prompt template to a deployment space, your prompt template must contain at least one variable.

Restriction:

You cannot promote prompt templates that do not contain any variables to your deployment space.

To deploy prompt templates with no variables, you must export the Project that contains the prompt template asset and import the Project to your deployment space. For more information, see Importing a prompt template to a deployment space.

Note these additional considerations for deploying a prompt template:

  • You can only create deployments of prompt templates that reference model deployments in the same project or space where you created the model deployment. If you promote a template that references a deployed model to a space, a copy of the model asset is automatically added to the space. You must deploy the model in the space before you deploy the prompt template.
  • If you export a prompt template that references a deployed model, the associated model asset is automatically exported with the prompt template. When you import the template into a new container, the model is also imported. The model must be deployed in the new container before you deploy the prompt template.

Before you begin

  1. Save a prompt template that contains at least one variable as a project asset. See Building reusable prompts.

  2. You must set up your task credentials by generating an API key. For more information, see Managing task credentials.

Procedure

To deploy a prompt template, complete the following steps:

  1. Open the project containing the prompt template.
  2. Click Promote to space for the template.

Promoting a prompt template to a deployment space

  1. In the Target deployment space field, choose a deployment space or create a new space. Note the following:

    The deployment space must be associated with a machine learning instance that is in the same account as the project where the prompt template was created.

    If you don't have a deployment space, choose Create a new deployment space, and then follow the steps in Creating deployment spaces.

    If you plan to evaluate the prompt template in the space, the recommended Deployment stage type for the space is Production. For more information on evaluating, see Evaluating a prompt template in a deployment space.

    Note: The deployment space stage cannot be changed after the space is created.

  2. Tip: Select View deployment in deployment space after creating. Otherwise, you need to take more steps to find your deployed asset.

  3. From the Assets tab of the deployment space, click Deploy. You create an online deployment, which means you can send data to the endpoint and receive a response in real-time.

    Deploying a prompt template

  4. Optional: In the Deployment serving name field, add a unique label for the deployment.

    The serving name is used in the URL for the API endpoint that identifies your deployment. Adding a name is helpful because the human-readable name that you add replaces a long, system-generated unique ID that is assigned otherwise.

    The serving name also abstracts the deployment from its service instance details. Applications refer to this name, which allows for the underlying service instance to be changed without impacting users.

    The name can have up to 36 characters. The supported characters are [a-z,0-9,_].

    The name must be unique across the IBM Cloud region. You might be prompted to change the serving name if the name you choose is already in use.

Importing a prompt template to a deployment space

To deploy prompt templates with no variables, you must export the Project that contains the prompt template asset and import the Project to your deployment space.

Before you begin

Export your Project that contains the prompt template asset as a ZIP file.

Procedure

To import a prompt template asset to a deployment space, follow these steps:

  1. From your deployment space toolbar, select the Export/Import space Import/Export icon dropdown menu and click Import space.

  2. Upload the ZIP file that contains your prompt template asset exported from your Project and click Done.

    Importing a prompt template to a deployment space

Testing the deployed prompt template

After the deployment successfully completes, click the deployment name to view the deployment.

Deploying a prompt template

  • API reference tab includes the API endpoints and code snippets that you need to add this prompt template to an application.
  • Test tab supports testing the prompt template. Enter test data as text, streamed text, or in a JSON file.

If the watsonx.governance service is enabled, you also see these tabs:

  • Evaluate provides the tools for evaluating the prompt template in the space. Click Activate to choose the dimensions to evaluate. For details, see Evaluating prompt templates.
  • AI Factsheets displays all of the metadata that is collected for the prompt template. Use these details for tracking the prompt template for governance and compliance goals. See Tracking prompt templates.

Deploying a prompt template programmatically

When you deploy a prompt template programmatically, you must follow these steps:

  1. Move the model asset and the prompt template to a deployment space. For example, if you are using the watsonx.ai API, fetch the template and parse the model_id stored in prompt.model_id. If you are using Data and AI Common Core API, use entity.wx_prompt.prompt.model_id. The ID is a string of keys split with :. The last item in the string is the model resource key, which refers to the model asset associated with the template.
  2. Deploy the model, using the model resource key from the first step to locate the model to deploy for the template. For example, if you are using the Data and AI Common Core API API, the search request looks for the space or project ID (XXX), the user's token (YYY), and the model resource key associated with the prompt template (ZZZ).
curl --location 'https://api.dataplatform.cloud.ibm.com/v2/asset_types/wml_model/search?space_id=XXX' \
--header 'Authorization: Bearer YYY' \
--header 'Content-Type: application/json' \
--data '{
    "query": "(asset.resource_key:ZZZ)",
    "include": "entity",
    "limit": 100
}'
  1. After the model is successfully deployed, deploy the prompt template

Learn more

Parent topic: Deploying and managing assets

Generative AI search and answer
These answers are generated by a large language model in watsonx.ai based on content from the product documentation. Learn more