The HDBSCAN node in watsonx.ai exposes the core features and
commonly used parameters of the HDBSCAN library. The node is implemented in Python, and you can use
it to cluster your dataset into distinct groups when you don't know what those groups are at first.
Unlike most learning methods in watsonx.ai, HDBSCAN
models do not use a target field. This type of learning, with no target field, is called
unsupervised learning. Rather than trying to predict an outcome, HDBSCAN tries to
uncover patterns in the set of input fields. Records are grouped so that records within a group or
cluster tend to be similar to each other, but records in different groups are dissimilar. The
HDBSCAN algorithm views clusters as areas of high density separated by areas of low density. Due to
this rather generic view, clusters found by HDBSCAN can be any shape, as opposed to k-means which
assumes that clusters are convex shaped. Outlier points that lie alone in low-density regions are
also marked. HDBSCAN also supports scoring of new samples.1
To use the HDBSCAN node, you must set up an upstream Type node. The HDBSCAN node will read input
values from the Type node (or from the Types of an upstream import node).
For more information about HDBSCAN clustering algorithms, see the HDBSCAN
documentation. 1
About cookies on this siteOur websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising.For more information, please review your cookie preferences options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.