IBM Knowledge Catalog on Cloud Pak for Data as a Service
Last updated: Feb 21, 2025
IBM Knowledge Catalog on Cloud Pak for Data as a Service
Description
Copy link to section
IBM Knowledge Catalog, a core service of Cloud Pak for Data as a Service, connects people to the data and knowledge
that they need. The platform is supported by a data governance framework to ensure that data access
and data quality are compliant with your business rules and standards. IBM Knowledge Catalog delivers automated enrichment of data assets with business metadata to
align company policies and vocabularies to data in support of AI, analytics, and compliance use
cases.
IBM Knowledge Catalog provides the data governance and privacy capabilities of the
data fabric architecture.
You develop a knowledge core by curating data assets and enriching them with governance artifacts
that describe their properties and meaning. Data stewards and data engineers curate data by
importing metadata, preparing the data assets, enriching the data assets by assigning governance
artifacts, and publishing the assets into catalogs. Some governance artifacts are predefined and are
automatically assigned to data assets. Data stewards can create or import a business vocabulary to
further enrich data assets during data curation. Knowledge Accelerators provide sets of ready to use business
vocabulary for specific industries. You use categories to control who can create and use governance
artifacts for what purpose.
You can create data protection rules that define how to protect data. Data protection rules are
enforced automatically in a uniform manner in governed catalogs. You can configure data protection
rules to mask sensitive data based on the content, format, or meaning of the data, or the identity
of the users who access the data. When you mask data, you unlock the data for users who are not
authorized to view sensitive data and avoid the need to maintain multiple copies of the data.
You provide a self-service way to find and share assets across your enterprise with catalogs:
Collaborators in a catalog have access to data assets without needing separate credentials or
being able to see the credentials. Collaborators have roles that control what activities they can
perform in the catalog.
Data assets contain information about how to access the data, data classifications, assigned
business terms and other governance artifacts, relationships with other assets, and rating and
reviews. Data assets can be relational data or unstructured data, such as PDF or Microsoft Office
documents.
Other types of assets in catalogs include operational assets, which data scientists create with
tools to work with data, such as, models, notebooks, and dashboards.
Search based on data asset metadata and properties and AI-powered recommendations help users
find the data that they need.
Data scientists find assets in catalogs and then copy the assets into projects where they analyze
data and build models with Watson Studio and
Watson
Machine Learning tools.
Use IBM Manta Data
Lineage
for advanced metadata import.
Table 2. Related services. The following related services are often used with this service and
provide complementary features, but they are not required.
Use built-in search, automatic metadata propagation, and simultaneous highlighting of
compilation errors to create, edit, load, and run jobs that transform and tailor information for
your enterprise.
Compatible data sources
Copy link to section
See Connectors for a list of data source services that are compatible.
About cookies on this siteOur websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising.For more information, please review your cookie preferences options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.