IBM watsonx.ai is a studio of integrated tools for working with generative AI capabilities that are powered by foundation models and for building machine learning models. IBM watsonx.ai provides a secure and collaborative environment where you can access your organization's trusted data, automate AI processes, and deliver AI in your applications.
Watch this short video that introduces watsonx.ai.
Looking for watsonx.data? Go to IBM watsonx.data documentation.
You can accomplish the following goals with IBM watsonx.ai:
- Build machine learning models
- Build models by using open source frameworks and code-based, automated, or visual data science tools.
- Experiment with foundation models
- Test prompts to generate, classify, summarize, or extract content from your input text. Choose from IBM models or open source models from Hugging Face.
- Manage the AI lifecycle
- Manage and automate the full AI model lifecycle with all the integrated tools and runtimes to train, validate, and deploy AI models.
Watsonx.ai provides these tools for working with data and models:
What you can use | What you can do | Best to use when |
---|---|---|
Data Refinery | Access and refine data from diverse data source connections. Materialize the resulting data sets as snapshots in time that might combine, join, or filter data for other data scientists to analyze and explore. |
You need to visualize the data when you want to shape or cleanse it. You want to simplify the process of preparing large amounts of raw data for analysis. |
Prompt Lab | Experiment with IBM and open source foundation models by inputting prompts. | You want to find the right model for your use case. |
AutoAI | Use AutoAI to automatically select algorithms, engineer features, generate pipeline candidates, and train model pipeline candidates. Then, evaluate the ranked pipelines and save the best as models. Deploy the trained models to a space, or export the model training pipeline that you like from AutoAI into a notebook to refine it. |
You want an advanced and automated way to build a good set of training pipelines and models quickly. You want to be able to export the generated pipelines to refine them. |
Notebooks and scripts | Use notebooks and scripts to write your own feature engineering, model training, and evaluation code in Python or R. Use training data sets that are available in the project, or connections to data sources such as databases, data lakes,
or object storage. Code with your favorite open source frameworks and libraries. |
You want to use Python or R coding skills to have full control over the code that is used to create, train, and evaluate the models. |
SPSS Modeler flows | Use SPSS Modeler flows to create your own model training, evaluation, and scoring flows. Use training data sets that are available in the project, or connections to data sources such as databases, data lakes, or object storage. | You want a simple way to explore data and define model training, evaluation, and scoring flows. |
RStudio | Analyze data and build and test models by working with R in RStudio. | You want to use a development environment to work in R. |
Decision Optimization | Prepare data, import models, solve problems and compare scenarios, visualize data, find solutions, produce reports, and save models to deploy with Watson Machine Learning. | You need to evaluate millions of possibilities to find the best solution to a prescriptive analytics problem. |
Federated learning | Train a common model that uses distributed data. | You need to train a model without moving, combining, or sharing data that is distributed across multiple locations. |
Watson Pipelines | Use pipelines to create repeatable and scheduled flows that automate notebook, Data Refinery, and machine learning pipelines, from data ingestion to model training, testing, and deployment. | You want to automate some or all of the steps in an MLOps flow. |
Synthetic Data Generator | Generate synthetic tabular data based on production data or a custom data schema using visual flows and modeling algorithms. | You want to mask or mimic production data or you want to generate synthetic data from a custom data schema. |
Security and privacy of your data and models
Your work on watsonx.ai, including your data and the models that you create, are private to your account:
- Your data is accessible only by you. Your data is used to train only your models. Your data will never be accessible or used by IBM or any other person or organization. Your data is stored in dedicated storage buckets from your IBM Cloud Object Storage service instance. Data is encrypted at rest and in motion.
- The models that you create are accessible only by you. Your models will never be accessible or used by IBM or any other person or organization. Your models are secured in the same way as your data.
Learn more about security and your options:
Underlying architecture
Watsonx includes the following functionality as the secure and scalable foundation for your organization to collaborate efficiently:
- Software and hardware
- Watsonx is fully managed by IBM on IBM Cloud. Software updates are automatic. Scaling of compute resources and storage is automatic.
- Storage
- A IBM Cloud Object Storage service instance is automatically provisioned for you to provide storage.
- Compute resources
- You can choose the appropriate runtime for your jobs. Compute resource usage is billed based on the rate for the runtime environment and its active duration.
- Security, compliance, and isolation
- The data security, network security, security standards compliance, and isolation of watsonx are managed by IBM Cloud. You can set up extra security and encryption options.
- User management
- You add users and user groups and manage their account roles and permissions with IBM Cloud Identity and Access Management. You assign roles within each collaborative workspace across the platform.
- Global search
- You can search for assets across the platform.
- Shared connections to data sources
- You can share connections with others across the platform in the Platform assets catalog.
- Samples
- You can experiment with IBM-curated sample data sets, notebooks, projects, and models.
Watsonx.ai on the watsonx platform includes the Watson Studio and Watson Machine Learning services.
Workspaces and assets
Watsonx is organized as a set of collaborative workspaces where you can work with your team or organization. Each workspace has a set of members with roles that provide permissions to perform actions. Most users work with assets, which are the items that users add to the platform. Data assets contain metadata that represents data, while assets that you create in tools, such as models, run code to work with data. You build assets in projects, and manage the deployment of completed assets in deployment spaces.
Projects and tools
Projects are where your data science and model builder teams work with data to create assets, such as, saved prompts, notebooks, models, or pipelines. Your first project, which is known as your sandbox project, is created automatically when you sign up for watsonx.ai.
The following image shows what the Overview page of a project might look like.
Deployment spaces
Deployment spaces are where your ModelOps team deploys models and other deployable assets to production and then tests and manages deployments in production. After you build models and deployable assets in projects, you promote them to deployment spaces.
The following image shows what the Overview page of a deployment space might look like.
Samples
The platform includes an integrated collection of samples that provides models, data assets, prompts, notebooks, and sample projects. Sample notebooks provide examples of data science and machine learning code. Sample projects contain sets of data, models, other assets, and detailed instructions on how to solve a particular business problem.
The following image shows what Samples looks like.