Text Mining model nuggets

You can run a Text Mining node to automatically generate a concept or category model nugget using the Generate directly option in the node settings. Or you can use a more hands-on, exploratory approach using the Build interactively mode to generate model nuggets from within the interactive workbench.

Text Mining nugget: Concept model

A Text Mining concept model nugget is created whenever you successfully run a Text Mining node where you've selected the option to Generate a model directly in the node settings. Use a text mining concept model nugget for the real-time discovery of key concepts in other text data, such as scratch-pad data from a call center.

The concept model nugget itself comprises a list of concepts, which have been assigned to types. You can select any or all of the concepts in that model for scoring against other data. When you run a flow containing a Text Mining model nugget, new fields are added to the data according to the build mode selected in the settings of the Text Mining modeling node prior to building the model.

If the model nugget was generated using translated documents, the scoring will be performed in the translated language. Similarly, if the model nugget was generated using English as the language, you can specify a translation language in the model nugget, since the documents will then be translated into English.

Text Mining model nuggets are placed in the Outputs pane at the upper-right are of the application when they're generated.

Text Mining nugget: Category model

A Text Mining category model nugget is created whenever you generate a category model from within the interactive workbench. This modeling nugget contains a set of categories, whose definition is made up of concepts, types, TLA patterns, and/or category rules. The nugget is used to categorize survey responses, blog entries, other web feeds, and any other text data.

If you launch an interactive workbench session in the modeling node, you can explore the extraction results, refine the resources, and fine-tune your categories before you generate category models. When you run a flow containing a Text Mining model nugget, new fields are added to the data according to the build mode selected in the settings of the Text Mining node prior to building the model.

If the model nugget was generated using translated documents, the scoring will be performed in the translated language. Similarly, if the model nugget was generated using English as the language, you can specify a translation language in the model nugget, since the documents will then be translated into English.

Text Mining model nuggets are placed in the Outputs pane at the upper-right are of the application when they're generated.