Skip to content

Filter, search, and browse documents

Overview

Filtering documents in a categorization project—in list and detail view—is a very useful feature, because it allows you to focus on a restricted set of texts to be analyzed.

Not only is it possible to filter documents by various items, such as categories, annotations, entities and tokens, but also to directly search for them in the search bar.

Filter by Annotation

Filter the annotated or not annotated documents

  1. Select Filters .
  2. In the Filter documents window, Resources tab:

    • Click Documents with annotations to select the annotations as positive filter, that means annotated documents.
    • Double-click Documents with annotations to select the annotations as negative filter, that means not annotated documents.

    To turn the filter off:

    • Double-click Documents with annotations or click Documents with annotations.

    Or:

    • Select the X beside the ANN chip.
  3. Select Filter documents.

The central panel, if in list view, or the left panel, if in detail view, displays the filtered documents list.

To reset filters, select Reset .

Filter documents by the annotation category in the list view

If you want to filter the documents with a specific annotation value, in the left panel, Taxonomy tab, choose the Annotations sub-panel and then double-click the category of interest.

The central panel displays the filtered documents list.

Example

If you select politics in the Annotations sub-panel, the documents annotated with this category are shown in the central panel and the annotation chip is inserted in the search bar as filtering item notification.

Filter by Categories

It is possible to find the categorized documents after at least an experiment on a project.

Filter the categorized documents

To filter the categorized documents:

  1. Select Filters .
  2. In the Filter documents window, Resources tab:

    • Click Documents with categories to select the annotations as positive filter, that means categorized documents.
    • Double-click Documents with categories to select the annotations as negative filter, that means not categorized documents.
  3. Select Filter documents.

To turn the filter off:

  • Double-click Documents with categories or click Documents with categories.

Or:

  • Select the X beside the CAT chip.

The central panel, if in list view, or the left panel if in detail view, displays the filtered documents list.

To reset filters, select Reset .

Filter documents by the category value in the list view

If you want to select the documents with specific categorization values, double-click the categories of interest in the left panel, Taxonomy tab, Categories sub-panel.

The central panel displays the filtered documents list.

Example

If you select politics in the Categories sub-panel, the documents to which this category has been assigned are shown in the central panel and the categorization chip is inserted in the search bar as filtering item notification.

Filter by Entities

Filter documents by specific entities

  1. Select Filters .
  2. In the Filter documents window, Entities tab, click the entities of interest for positive filtering, double-click for negative filtering.
  3. Select Filter documents.

The central panel, if in list view, or the left panel, if in detail view, will display the filtered documents list.

To turn the filter off:

  • Double-click or click .

Or:

  • Select the X beside the entity chip.

To reset filters, select Reset .

Example

To filter the documents that contain People and Company, but not Mass media:

  1. Click People and Company.
  2. Double-click Mass media.

Filter documents by specific entities values in list view

If you want to filter the documents that contain specific entities values:

  1. In the left panel, select the Entities tab.
  2. Expand the entity of interest, then double-click the occurrence.

The central panel displays the filtered documents list.

Example

If you want to select the documents containing United States, which is a geographic entity, in the Entities tab, expand Geography , then double-click United States.

The documents that contain this entity value are shown in the central panel and the related chip is inserted in the search bar as filtering item notification.

Filter documents by specific token value in the list view

If you want to filter the documents that contain a specific token value:

  1. In the left panel, select the Tokens tab.
  2. Expand the token type, then double-click the occurrence of interest.

The central panel displays the filtered documents list.

Example

If you want to select the documents related to sport, in the Tokens tab, expand Main Topics , then double-click sport.

The documents related to this topic are shown in the central panel and the related chip is inserted in the search bar as filtering item notification.

Filtering by items combination in the list view

It is possible to filter documents combining more than an item and using boolean operators.

Example

If you want to select the documents related to presidential election in the United States:

  1. In the Tokens tab, expand Keywords , then double-click president.
  2. In the Entities tab, expand Geography , then double-click United States.

The central panel shows the documents related to president and the United States and the related chips are inserted in the search bar as filtering items notifications. Note that the Boolean operator AND is added.

Filter by validated documents

To filter the validated documents:

  1. Select Filters .
  2. In the Filter documents window, Resources tab:

    • Click Validated documents to select the validations as positive filter, that means validated documents.
    • Double-click Validated documents to select the validations as negative filter, that means not validates documents.
  3. Select Filter documents.

To turn the filter off:

  • Double-click Validated documents or click Validated documents to turn the filter off.

Or:

  • Select the X beside the Validated chip.

The central panel, if in list view, or the left panel if in detail view, displays the filtered documents list.

To reset filters, select Reset .

Filter by suggestions

You can apply this filter only for a taxonomy created with the Building magic taxonomy procedure which suggests categories for documents.

  1. Select Filters .
  2. In the Filter documents window, Resources tab:

    • Click Documents with suggestions to select the suggestions as positive filter, that means documents that contain suggestions. .
    • Double-click Documents with suggestions to select the suggestions as negative filter, that means documents with no suggestions.

    To turn the filter off:

    • Double-click Documents with suggestions

    Or:

    • Click Documents with suggestions.

    Or:

    • Select the X beside the SUG chip .
  3. Select Filter documents.

The central panel, if in list view, or the left panel, if in detail view, displays the filtered documents list.

To reset filters, select Reset

Search for documents

Please refer to the search operation in the corpus environment. The logic is the same.

Browse documents

In detail view it is possible to browse documents by using Next document next and Previous document prev.

In detailed view, if a document is related to other, select Related documents related to display them in the Related Documents panel.

Select show more to have a preview of a selected document or show less to reduce the view.

To open a document in the list, double-click it.

To change the sort order in the Related Documents panel, select the preferred option from the drop-down menu at the top right.

To show again the original tabs in the right pane, select Value value.

In list vs. not in list

When applying filters or performing searches when in detail view, documents are marked by two different icons: