You can build a customized AI service which is tailored to your generative AI application from the ground up. For example, if you are deploying an asset that uses retrieval augmented generation (RAG), you can capture the logic for retrieving answers
from the grounding documents in the AI service.
Process overview
Copy link to section
The following graphic illustrates the process of coding AI services.
You can create a notebook that contains the AI service and connections within the Project. The AI service captures the logic of your RAG application and contains the generateration function, which is a deployable unit of code. The generation
function is promoted to the deployment space, which is used to create a deployment. The deployment is exposed as a REST API endpoint that can be accessed by other applications. You can send a request to the REST API endpoint to use the deployed
AI service for inferencing. The deployed AI service processes the request and returns a response.
Tasks for creating and deploying AI services
Copy link to section
Here are the steps that you must follow to create, deploy, and manage AI services:
Create AI service: Define an AI service in a notebook by using Python. The AI service must meet specific requirements for deploying as an AI service.
Test AI service: Test the coding logic of your AI service locally.
Create AI service assets: After creating and testing the AI service, you must package the AI service as a deployable asset.
Set up the environment Download the test dataset. Defining the foundation model on watsonx Set up connectivity information to Elasticsearch Generate a retrieval-augmented response to a question Creating an AI service
Testing AI service function locally Deploying the AI service
About cookies on this siteOur websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising.For more information, please review your cookie preferences options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.