Skip to content

Filter, search, and browse documents

Overview

Filtering documents in a categorization project—in list and detail view—is a very useful feature, because it allows you to focus on a restricted set of texts to be analyzed.

Not only is it possible to filter documents by various items, such as categories, annotations, entities and tokens, but also to directly search for them in the search bar.

Filter by Annotation

Filter the annotated or non-annotated documents

  1. Select Filters .
  2. In the Filter documents window, Resources tab:

    • Click Documents with annotations to select the annotations as positive filter, that means annotated documents.
    • Double-click Documents with annotations to select the annotations as negative filter, that means non-annotated documents.

    To turn the filter off:

    • Double-click Documents with annotations or click Documents with annotations.

    Or:

    • Select the X beside the ANN chip.
  3. Select Filter documents.

The central panel, if in list view, or the left panel, if in detail view, displays the filtered document list.

To reset filters, select Reset filters .

Note

You can also remove the filters in the Filter documents window with the procedures above. In this case, select Filter documents to apply changes.

Filter documents by the annotation in the list view

To filter documents with a specific annotation:

  1. Hover over the annotation of interest in the Taxonomy panel to the right side.
  2. Select Search annotation ; the central panel displays the filtered document list.

Example

If you select politics in the Taxonomy panel, the documents annotated with this category are shown in the central panel and the annotation chip is displayed in the search bar as filtering item notification.

Or:

  1. Double-click the annotations value of your interest in Annotation sub-panel to the left side; the central panel displays the filtered document list..

Example

If you double-click politics and economy in the Annotations sub-panel, the documents annotated with this category are shown in the central panel and the annotation chips are displayed in the search bar as filtering item notification.

Filter documents by the annotation in the detail view

To filter documents with a specific annotation:

  1. Hover over the annotation of interest in the Taxonomy panel to the right side.
  2. Select Search annotation ; the central panel displays the filtered document list.

Filter by Categories

It is possible to find the categorized documents after at least an experiment on a project.

Filter the categorized documents

To filter the categorized documents:

  1. Select Filters .
  2. In the Filter documents window, Resources tab:

    • Click Documents with categories to select the annotations as positive filter, that means categorized documents.
    • Double-click Documents with categories to select the annotations as negative filter, that means not categorized documents.
  3. Select Filter documents.

To turn the filter off:

  • Double-click Documents with categories or click Documents with categories.

Or:

  • Select the X beside the CAT chip.

The central panel, if in list view, or the left panel if in detail view, displays the filtered document list.

To reset filters, select Reset filters .

Filter documents by the category in the list view

To filter documents with a specific category:

  1. Hover over the annotation of interest in the Taxonomy panel to the right side.
  2. Select Search annotation ; the central panel displays the filtered document list.

Or:

  1. Double-click the category value of your interest in Category sub-panel to the left side; the central panel displays the filtered document list..

Filter documents by the category in the detail view

To filter documents with a specific category:

  1. Hover over the annotation of interest in the Taxonomy panel to the right side.
  2. Select Search annotation ; the central panel displays the filtered document list.

Filter by Entities

Filter documents by specific entities

  1. Select Filters .
  2. In the Filter documents window, Entities tab, click the entities of interest for positive filtering, double-click for negative filtering.
  3. Select Filter documents.

To turn the filter off:

  • Double-click or click .

Or:

  • Select the X beside the entity chip.

The central panel, if in list view, or the left panel, if in detail view, will display the filtered document list.

To reset filters, select Reset filters .

Example

To filter the documents that contain People and Companies, but not Mass media:

  1. Click People and Companies.
  2. Double-click Mass media.

Filter documents by specific entity values in list view

If you want to filter the documents that contain specific entity values:

  1. In the left panel, select the Entities tab.
  2. Expand the entity of interest, then double-click the occurrence.

The central panel displays the filtered document list.

Example

If you want to select the documents containing United States, which is a geographic entity, in the Entities tab, expand Geography , then double-click United States.

The documents that contain this entity value are shown in the central panel and the related chip is inserted in the search bar as filtering item notification.

Filter documents by specific entity values in detail view

If you want to filter the documents that contain a specific entity value:

  1. In the right panel, select the Entities tab.
  2. Expand the token type, then double-click the occurrence of interest. The occurrences are set as search criteria in the Search bar.

The document list in the left panel is updated accordingly.

Filter by tokens

Filter documents by specific token values in the list view

If you want to filter the documents that contain a specific token value:

  1. In the left panel, select the Tokens tab.
  2. Expand the token type, then double-click the occurrence of interest.

The central panel displays the filtered document list.

Example

If you want to select the documents related to sport, in the Tokens tab, expand Main Topics , then double-click sport.

The documents related to this topic are shown in the central panel and the related chip is inserted in the search bar as filtering item notification.

Filter documents by specific token values in the detail view

If you want to filter the documents that contain a specific token value:

  1. In the right panel, select the Tokens tab.
  2. Expand the token type, then double-click the occurrence of interest. The occurrences are set as search criteria in the Search bar.

The document list in the left panel is updated accordingly.

Filtering by items combination

It is possible to filter documents combining entities and tokens.

Filter by validated documents

To filter the validated documents:

  1. Select Filters .
  2. In the Filter documents window, Resources tab:

    • Click Validated documents to select the validations as positive filter, that means validated documents.
    • Double-click Validated documents to select the validations as negative filter, that means not validates documents.
  3. Select Filter documents.

To turn the filter off:

  • Double-click Validated documents or click Validated documents to turn the filter off.

Or:

  • Select the X beside the Validated chip.

The central panel, if in list view, or the left panel if in detail view, displays the filtered document list.

To reset filters, select Reset filters .

Filter by suggestions

You can apply this filter only for a taxonomy created with the Building magic taxonomy procedure which suggests categories for documents.

  1. Select Filters .
  2. In the Filter documents window, Resources tab:

    • Click Documents with suggestions to select the suggestions as positive filter, that means documents that contain suggestions. .
    • Double-click Documents with suggestions to select the suggestions as negative filter, that means documents with no suggestions.

    To turn the filter off:

    • Double-click Documents with suggestions

    Or:

    • Click Documents with suggestions.

    Or:

    • Select the X beside the SUG chip .
  3. Select Filter documents.

The central panel, if in list view, or the left panel, if in detail view, displays the filtered document list.

To reset filters, select Reset filters

Search for documents

Refer to the article about search operations. The logic is the same.

Search multiple categories extractions

To search multiple category extractions:

  1. Select the concepts using CTRL+CLICK in the Resources panel of the Resources tab.
  2. Select ; the search criteria is displayed in the search bar and the resulting documents are filtered and displayed in the Documents tab.

    Info

    It is possible to change search criteria directly in the search bar as described in Search.

Browse documents

In the detail view, it is possible to browse documents by using Next document next and Previous document prev.

The number of the current document, which is updated accordingly, and the total number of documents in the dataset are displayed in between .

In the detail view, if a document is related to others, select Related documents related to display them in the Related Documents panel.

  • Select show more to have a preview of a selected document or show less to reduce the view.
  • To open a document in the list, double-click it.
  • To change the sort order in the Related Documents panel, select the preferred option from the drop-down menu at the top right.
  • To show again the original tabs in the right pane, select Value value.

In list vs. not in list

When applying filters or performing searches when in detail view, documents are marked by two different icons: