The Categories and concepts view

The interactive workbench includes several views. With the Categories and concepts view, you can create and explore categories as well as explore and tweak the extraction results.

Categories refers to a group of closely related ideas and patterns to which documents and records are assigned through a scoring process. Concepts refer to the most basic level of extraction results available to use as building blocks, called descriptors, for your categories.

Figure 1. Categories and concepts view
Categories and concepts view

The Categories and concepts view is organized into panes.

Categories pane

Located in the upper left corner, this area presents a table in which you can manage any categories you build. After extracting the concepts and types from your text data, you can begin building categories by using techniques such as semantic networks and concept inclusion, or by creating them manually. If you select a category name and click the Settings icon, the category settings open and display all of the descriptors that make up its definition, such as concepts, types, and rules. Not all automatic techniques are available for all languages.

When you select a row in the pane, you can then display information about corresponding documents/records or descriptors.

Extraction results pane

Located in the lower left corner, this area presents the extraction results. When you run an extraction, the extraction engine reads through the text data, identifies the relevant concepts, and assigns a type to each. Concepts are words or phrases extracted from your text data. Types are semantic groupings of concepts stored in the form of type dictionaries. When the extraction is complete, concepts and types appear with color coding in this pane.

Text mining is an iterative process in which extraction results are reviewed according to the context of the text data, fine-tuned to produce new results, and then reevaluated. Extraction results can be refined by modifying the linguistic resources. This fine-tuning can be done in part directly from the extraction results or data pane.

Data pane

The Data pane is located on the right. This pane presents a table containing the documents or records corresponding to a selection in another area of the view. Depending on what is selected, only the corresponding text appears in the Data pane. Once you make a selection, click Display to populate the Data pane with the corresponding text.

If you have a selection in another pane, the corresponding documents or records show the concepts highlighted in color to help you easily identify them in the text. You can also hover your mouse over color-coded items to display a tooltip showing name of the concept under which it was extracted and the type to which it was assigned.

Searching and finding in the categories and concepts view

In some cases, you may need to locate information quickly in a particular section. Using the Find toolbar, you can enter the string you want to search for and define other search criteria such as case sensitivity or search direction. Then you can choose the pane in which you want to search.

  1. In the Find field at the top of your screen, type the word string you want to search for. You can use the up and down arrow buttons to control the direction of the search.
  2. From the drop-down, select the name of the pane in which you want to search and then click one of the arrow buttons. If a match is found, the text is highlighted in the window.
  3. To look for the next match, click the arrow button again.