Skip to content

Filter and search documents

Overview

Filtering documents in a thesaurus project—in list and detail view—is a very useful feature, because it allows you to focus on a restricted set of texts to be analyzed.

Filter documents based on presence or absence of annotations, extractions or validations

To filter a list of documents based on the presence or absence of annotations, extractions and validations, check the procedure described in the dedicated page. Once you open the Filter documents window, select the Resources tab and check the available options:

  • Validated documents
  • Documents with annotations
  • Documents with extractions

The central panel, if in list view, or the left panel, if in detail view, displays the filtered document list.

Filter by concept

The document lists in the Documents and in Experiments panels can be filtered based on specific extractions and annotations or thesaurus concepts.

  1. In the left panel, select the Thesaurus tab. It displays concepts that have been annotated in the current library and concepts that were extracted at least once during the selected experiment.
  2. Select Open or Close beside Extractions and Annotation to expand or collapse the lists.
  3. Double-click one or more items in the Extractions and Annotations lists. Clicked items become search criteria shown in the search box where they can be edited as described in the article about search.

filt-ann-val

Info

In the Thesaurus panel, the number on the right of Extractions and Annotations is the number of concepts with at least one extraction or annotation, not the total number of extractions or annotations.
The number on the right of each concept in the lists is the number of documents in which a concept has been extracted or annotated—possibly multiple times—and not the total number of extractions or annotations of the concept.

Filter by entity

Filter documents by entity types

To filter a list of documents based on the presence or absence of specific entity types, check the procedure described in the dedicated page. Once you open the Filter documents window, select the Entities tab and check the entities of interest.

The central panel, if in list view, or the left panel, if in detail view, displays the filtered document list.

Filter by entity value

  • In the list view:

    1. In the left panel, select the Entities tab.
    2. Select an entity type.
    3. Double-click an entity.
    4. Repeat from step b or c to add more entities.
  • In the detail view:

    1. In the right panel, select the Entities tab.
    2. Double-click an entity.
    3. Repeat from step b to add more entities.

Double-click selections become as many elements of the search criteria shown in the search box where they can be edited as described in the article about search.

Filter documents by entity, extraction, validation, annotation in the Analytics sub-tab

It is possible to apply filters in different areas of the project other than the list view and the detail view Check the project Analytics page to see how to apply filters from there.

Filter by token value

  • In the list view:

    1. In the left panel, select the Tokens tab.
    2. Select a token type.
    3. Double-click a token value.
    4. Repeat from step b to add more tokens.
  • In the detail view:

    1. In the right panel, select the Tokens tab.
    2. Expand a token type.
    3. Double-click a token.
    4. Repeat from step b or c to add more tokens.

Double-click selections become search criteria and are displayed in the search box where they can be edited as described in the search.

Filter by Main topics

To filter on a main topic, when in detail view, double-click one or more topics displayed in the Main topics strip above the text of the document.

Filter by document name

To filter documents based on file name, in the list view—or in the Documents Preview panel of the Resources tab—enter the file name or part of it (at least 3 chars) in the search box above the list of documents and press Enter. Only documents whose file name contains the specified string will be displayed.
To cancel the filter, select on the right of the search bar.

filt-doc

Filter by language

To filter documents based on their language, in list or in detail view—or in the Documents Preview panel of the Resources tab—select one of the options from the drop-down menu above the document list.

"In list" Vs. "Not in list"

When applying filters or performing searches while in detail view, documents are marked by two different icons: