Skip to content

Filter, search, and browse documents

Overview

Filtering documents in a categorization project—in list and detail view—is a very useful feature, because it allows you to focus on a restricted set of texts to be analyzed.

Not only is it possible to filter documents by various items, such as categories, annotations, entities and tokens, but also to directly search for them in the search bar.

Filter by Annotation

Filter the annotated or not annotated documents

  1. Select Filters .
  2. In the Filter documents window, Resources tab:

    • Click Documents with annotations to select the annotations as positive filter, that means annotated documents.

    • Double-click Documents with annotations to select the annotations as negative filter, that means not annotated documents.

  3. Select Filter documents.

To turn the filter off:

  • Double-click Documents with annotations or click Documents with annotations.

Or:

  • Select the X beside the ANN chip.

The central panel, if in list view, or the left panel, if in detail view, displays the filtered document list.

To reset filters, select Reset .

Note

  • In list view, you can see some filter chips that can be turned off with the same procedure.
  • If you turn off or reset your filters in the Filter documents dialog, then select Filter documents to apply changes.

Filter documents by the annotation category in the list view

If you want to filter the documents with a specific annotation value, in the left panel, Taxonomy tab, choose the Annotations sub-panel and then double-click the category of interest.

Note

Select Open or Close to expand and collapse the panels.

The central panel displays the filtered document list.

Example

If you select sport in the Annotations sub-panel, the documents annotated with this category are shown in the central panel and the annotation chip is inserted in the search bar as filtering item notification.

Filter by Categories

It is possible to find the categorized documents after at least an experiment on a project.

Filter the categorized documents

To filter the categorized documents:

  1. Select Filters .
  2. In the Filter documents window, Resources tab:

    • Click Documents with categories to select the annotations as positive filter, that means categorized documents.
    • Double-click Documents with categories to select the annotations as negative filter, that means not categorized documents.
  3. Select Filter documents.

To turn the filter off:

  • Double-click Documents with categories or click Documents with categories.

Or:

  • Select the X beside the CAT chip.

The central panel, if in list view, or the left panel if in detail view, displays the filtered document list.

To reset filters, select Reset .

Note

If you turn off or reset your filters in the Filter documents dialog, then select Filter documents to apply changes.

Filter documents by the category value in the list view

If you want to select the documents with specific categorization values, double-click the categories of interest in the left panel, Taxonomy tab, Categories subpanel.

Note

Select Open or Close to expand and collapse the panels.

The central panel displays the filtered document list.

Example

If you select sport in the Categories subpanel, the documents to which this category has been assigned are shown in the central panel and the categorization chip is inserted in the search bar as filtering item notification.

Filter by Entities

Filter documents by specific entities

  1. Select Filters .
  2. In the Filter documents window, Entities tab, click the entities of interest for positive filtering, double-click for negative filtering.
  3. Select Filter documents.

To turn the filter off:

  • Double-click or click .

Or:

  • Select the X beside the entity chip.

The central panel, if in list view, or the left panel, if in detail view, will display the filtered document list.

To reset filters, select Reset .

Note

If you turn off or reset your filters in the Filter documents dialog, then select Filter documents to apply changes.

Example

To filter the documents that contain People and Company, but not Mass media:

  1. Click People and Company.
  2. Double-click Mass media.
  3. Select Filter documents.

Filter documents by specific entity value in list view

If you want to filter the documents that contain specific entity values:

  1. In the left panel, select the Entities tab.
  2. Expand the entity of interest, then double-click the occurrence.

The central panel displays the filtered document list.

Example

If you want to select the documents containing Barack Obama, which is a human entity, in the Entities tab, expand People , then double-click Barack Obama.

The documents that contain this entity value are shown in the central panel and the related chip is inserted in the search bar as filtering item notification.

Filter documents by specific token values in the list view

If you want to filter the documents that contain a specific token value:

  1. In the left panel, select the Tokens tab.
  2. Expand the token type, then double-click the occurrence of interest.

The central panel displays the filtered document list.

Example

If you want to select the documents related to politics, in the Tokens tab, expand Main Topics , then double-click politics.

The documents related to this topic are shown in the central panel and the related chip is inserted in the search bar as filtering item notification.

Filtering by item combination in the list view

It is possible to filter documents combining more than an item and using boolean operators.

Example

If you want to select the documents related to Barack Obama's presidential actiivity:

  1. In the Entities tab, expand People , then double-click Barack Obama.
  2. In the Tokens tab, expand Keywords , then double-click president.

The central panel shows the documents related to Barack Obama and president and the related chips are inserted in the search bar as filtering items notifications. Note that the Boolean operator AND is added.

Filter by validated documents

To filter the validated documents:

  1. Select Filters .
  2. In the Filter documents window, Resources tab:

    • Click Validated documents to select the validation as positive filter, that means validated documents.
    • Double-click Validated documents to select the validation as negative filter, that means not validates documents.
  3. Select Filter documents.

To turn the filter off:

  • Double-click Validated documents or click Validated documents to turn the filter off.

Or:

  • Select the X beside the Validated chip.

The central panel, if in list view, or the left panel if in detail view, displays the filtered document list.

To reset filters, select Reset .

Note

If you turn off or reset your filters in the Filter documents dialog, then select Filter documents to apply changes.

Filter by suggestions

You can apply this filter only for a taxonomy created with the Building magic taxonomy procedure which suggests categories for documents.

  1. Select Filters .
  2. In the Filter documents window, Resources tab:

    • Click Documents with suggestions to select the annotations as positive filter, that means documents that contain suggestions.
    • Double-click Documents with suggestions to select the annotations as negative filter, that means documents with no suggestions.
  3. Select Filter documents.

To turn the filter off:

  • Double-click Documents with suggestions or click Documents with suggestions.

Or:

  • Select the X beside the SUG chip .

The central panel, if in list view, or the left panel, if in detail view, displays the filtered document list.

To reset filters, select Reset .

Note

If you turn off or reset your filters in the Filter documents dialog, then select Filter documents to apply changes.

Search for documents

Please refer to the search operation in the corpus environment. The logic is the same.

Browse documents

In detail view it is possible to browse documents by using Next document next and Previous document prev.

In list vs. not in list

When applying filters or performing searches in detail view, documents are marked by two different icons: