Edit project settings
Thesaurus projects share some settings with all other project types. The settings that are specific to this type of project are grouped in these tabs of the project settings pop-up:
The settings of thesaurus projects are the same of all other project types, except for the:
The Languages tab allows you to set the project languages and choose the favorite language between them.
If you check only one language, it will be automatically set as the favorite.
The Thesaurus URI tab allows you to define the URIs for the thesaurus's concepts and choose a favorite URI between them.
- To add a new URI, select Add new URI prefix and enter the new identifier.
- To change an existing URI, select the URI and edit it.
- To delete an existing URI, hover over it and select the trash bin icon .
In case of characters that are not allowed in the URI, like white spaces, hover over the URI and select Encode URI to encode it.
You can delete an existing URI only if there are at least two URIs defined.
Select the URI you want to use and optionally customize it by adding the:
- Project name
- Concept label
- Concept ID
If you add the concept ID alone, you can't also add the concept label.
When done, select Save.
The models generated during experiments extract the occurrences of the thesaurus concepts from documents. In the Extraction tab you find the settings that affect extraction and are used for new concepts. These settings can be overridden at the concept level.
Extraction method is the way generated models will use concepts' labels to determine the portions of text to extract.
Possible methods are:
- Semantic: all the portions of text expressing the same meaning of the concept labels, in any inflected form. For example, if the label is sandglass, the model will extract sandglass, hourglass, sandglasses, hourglasses.
- Base form: the label is considered as a lemma (the base form or dictionary entry for a term) and all the inflections of it are extracted. For example, if the label is sandglass the model will extract sandglass and sandglasses.
- Exact label: the model will extract exact matches of the labels.
The context is the subdivision of the text or the sequence of subdivisions in which to search for the expressions of the concept to be extracted.
Not all parts of a text correspond to clauses. For example, a heading like:
may not be considered a clause, so be aware that if you set Clause as context, there may be portions of the document text in which extraction will not take place.
The context also serves in combination with the specification of terms that must or must not co-occur in the vicinity of the term corresponding to the concept to be extracted. Co-occurrence constraints can be set at the concept level.
Like the Edit Concept panel in the Resources tab, you can confirm a concept status in terms of its validation. See the dedicated article for further details.
The Suggestions tab allows you to manage and filter the list of suggestions for your concepts.
Check and uncheck the available concept suggestion options you want to make available, then select Save.
To view the effective changes, refresh the page.