Skip to content

Edit project settings

Overview

The settings of thesaurus projects are the same of all other project types, except for the:

See the dedicated page on how to manage the common settings tabs.

Languages

The Languages tab allows you to set the project languages and choose the favorite language between them.

If you check only one language, it will be automatically set as the favorite.

Thesaurus URI

The Thesaurus URI tab allows you to define the URIs for the thesaurus's concepts and choose a favorite URI between them.

  • To add a new URI, select Add new URI prefix and enter the new identifier.
  • To change an existing URI, select the URI and edit it.
  • To delete an existing URI, hover over it and select the trash bin icon .
  • In case of characters that are not allowed in the URI, like white spaces, hover over the URI and select Encode URI to encode it.

    Note

    You can delete an existing URI only if there are at least two URIs defined.

  • Select the URI you want to use and optionally customize it by adding the:

    • Project name
    • Concept label
    • Concept ID

    Note

    If you add the concept ID alone, you can't also add the concept label.

  • When done, select Save.

Extraction

The models generated during experiments extract the occurrences of the thesaurus concepts from documents. In the Extraction tab you find the settings that affect extraction and are used for new concepts. These settings can be overridden at the concept level.

Extraction method

Extraction method is the way generated models will use concepts' labels to determine the portions of text to extract.

Possible methods are:

  • Semantic: all the portions of text expressing the same meaning of the concept labels, in any inflected form. For example, if the label is sandglass, the model will extract sandglass, hourglass, sandglasses, hourglasses.
  • Base form: the label is considered as a lemma (the base form or dictionary entry for a term) and all the inflections of it are extracted. For example, if the label is sandglass the model will extract sandglass and sandglasses.
  • Exact label: the model will extract exact matches of the labels.

Context settings

The context is the subdivision of the text or the sequence of subdivisions in which to search for the expressions of the concept to be extracted.

Warning

Not all parts of a text correspond to clauses. For example, a heading like:

Disclaimer

may not be considered a clause, so be aware that if you set Clause as context, there may be portions of the document text in which extraction will not take place.

The context also serves in combination with the specification of terms that must or must not co-occur in the vicinity of the term corresponding to the concept to be extracted. Co-occurrence constraints can be set at the concept level.

Approval status

Like the Edit Concept panel in the Resources tab, you can confirm a concept status in terms of its validation. See the dedicated article for further details.

Suggestions

The Suggestions tab allows you to manage and filter the list of suggestions for your concepts.

Check and uncheck the available concept suggestion options you want to make available, then select Save.

Note

To view the effective changes, refresh the page.