Deploying AI services from Prompt Lab with notebooks

Last updated: Nov 07, 2024

To customize the programming logic of your generative AI application in an AI service, save and deploy your work with auto-generated deployment notebooks.

Before you begin

You must have an existing IBM Cloud API key or generate a new API key.
You must set up your task credentials by generating an API key. For more information, see Managing task credentials.

Saving your work in a deployment notebook

Follow these steps to create an online deployment for an AI service from the watsonx.ai Prompt lab user interface:

From your project, open the prompt lab and chat with documents to build a RAG solution. For more information, see Chatting with documents and images.
Click the Save icon and select Save as from the dropdrown menu.
In the Save you work dialog box, select Deployment notebook.

Note:
The deployment notebook contains the code to test, promote, and deploy an AI service. For deploying your application, you must save your work in a deployment notebook. You cannot use a standard notebook to deploy an AI service asset.
In the Define details section, enter a name and an optional description for your deployment notebook.
Click Save.

Running the deployment notebook

When you save your work in a deployment notebook, watsonx.ai automatically generates a notebook which contains the code to test, promote, and deploy an AI service. The AI service captures the logic to implement RAG pattern for grounded chats. To create an online deployment for your AI service, run the cells in the deployment notebook.

You can customize the auto-generated deployment notebook for your generative AI applications by clicking the Edit icon.

Learn more

Deploying AI services with direct coding

Parent topic: Deploying AI services with Prompt Lab

Was the topic helpful?

0/1000