Coding and deploying AI services with templates

Last updated: Feb 21, 2025

You can use pre-defined templates to deploy your AI services in watsonx.ai. AI service templates provide a pre-built foundation for AI applications, enabling developers to focus on the core logic of their application, rather than starting from scratch.

Standardizing AI service deployments with templates

AI service templates are pre-built, reusable, and customizable components that provide a structured approach to deploying and managing generative AI applications. They offer a standardized way to package, deploy, and integrate AI models with other applications and systems, enabling developers to focus on building and training models without worrying about the underlying infrastructure and deployment logistics. By providing a pre-defined structure, configuration, and set of tools, AI service templates simplify the process of deploying AI services, reduce the risk of errors, and improve the overall efficiency and consistency of AI development and deployment.

Accessing and using the templates

The templates are located in the IBM Git repository, where you can clone and access them to use in your own machine and IDE. To use the templates, simply clone the repository, navigate to the template directory, and open the template in your preferred IDE, such as VSCode or PyCharm, to start building and customizing your AI service. For more information, see Sample templates.

Components of AI service templates

The components of an AI service template are as follows:

Source directory: The source directory contains the code used by the deployed functions (from the ai_service.py file). Upon deployment, the source directory is packaged and sent to cloud as package extension.
Core application logic: The core application logic is contained in the ai_service.py file. This file encompasses the functions to be deployed, including the application's core logic, input schema definition, and authentication code.
Configuration file: The config.toml file stores the deployment metadata and configuration settings for the model.
Tests: The tests/ contains the unit tests for the template, including tests for tools and utility functions.
Deployment scripts: The scripts/deploy.py deploys the template on IBM Cloud. The examples/execute_ai_service_locally.py can be used to run the AI service locally. The examples/query_existing_deployment.py can be used to inference an existing deployment.
Project configuration: The pyproject.toml file manages dependencies and packages for the project.

Deploying AI services with templates

Follow these steps to deploy AI services with templates:

Prepare the template: To prepare the template, you must clone the template repository, install the required dependencies and tools, such as Pipx or Poetry, set up the environment on your local system, and activate the virtual environment. This ensures that the template is properly configured and ready for deployment.
Configure the template: Configure the template by filling in the config.toml file with the necessary credentials and configuration settings. This includes customizing the model with the application logic as needed to suit the specific requirements of the AI service. The configuration file stores deployment metadata and configuration settings for the model, and is used to tweak the model for local runs.

You can also provide additional key-value data to the app by using Parameter sets or the CUSTOM object in the config.toml file. To learn more about storing and managing parameter sets, see Parameter sets in watsonx.ai Python client library documentation.

For handling external credentials, you can use IBM Secrets Manager. Secrets Manager is a service that enables you to securely store and manage sensitive information, such as API keys and passwords. By using Secrets Manager, you can keep your credentials out of your code and configuration files, which helps to improve the security of your application. For more information, see IBM Cloud Secrets Manager API documentation.
Test the template: Before deploying the template, it is essential to test it to ensure that it is working correctly. This involves running unit tests to verify that the template is functioning as expected, by testing the application locally by running the examples/execute_ai_service_locally.py script. You can run this script to run the AI service locally and test it with sample inputs.
Deploy the template: Once the template has been tested and validated, it can be deployed by using the scripts/deploy.py script. This script automates the deployment process and creates a package extension. You can use the Secrets Manager to handle external credentials during deployment. To do this, create a secret that contains the credentials you want to use. Once you have created your secret, you can reference it during deployment. This will deploy your app with the credentials from the secret. The deployment process may take a few minutes to complete, after which you will receive a deployment ID.
Inference the template: You can inference the deployed AI service by using the examples/query_existing_deployment.py script. Use this script to test the AI service with sample inputs and verify the output. You can also use the user interface to inference the deployment and interact with the AI service.

Sample template

To learn how to deploy AI services by using templates, see the following sample template:

Sample templates
Template	Description
A LangGraph LLM app template with function calling capabilities	Prerequisites Cloning and setting up the template locally Modifying and configuring the template Configuration file Providing additional key-value data to the app Handling external credentials LangGraph's graph architecture Core app logic Adding new tools Enhancing tests suite Unit testing the template Executing the app locally Deploying on IBM Cloud Inferencing the deployment

Parent topic: Deploying AI services