Skip to content

Make experiments

Overview

Once the resources and the concepts have been set up and the documents have been annotated, you can start experiments that consist of creating a model and applying it to a library.

An experiment process is based on:

The library, or data set, consists of an annotated documents set that helps the engine to learn.

The engine parses the test library in order to give the analysis results.

Platform provides one type of engine for thesaurus projects, namely Thesaurus generation.

To start an experiment:

  1. In the upper bar, select Start an experiment .
  2. In the Start an experiment window:

    • Enter the experiment name in Name or leave the bar empty for an automatic one.
    • Select the Test library, then select Next.
  3. Check the summary, then select Start.

The run progress window is displayed during the engine process.

Note

To terminate the process before its end, select Delete experiment.

The process consists of six sequential stages:

  1. Initialization
  2. Model generation preparation
  3. Model generation
  4. Document analysis preparation
  5. Document analysis
  6. Experiment wrap-up

Once the process is completed, the analytics are displayed in the Analytics sub-tab and you can start to interpret the results.

Note

If the experiment fails, The tab Info appears displaying information and the type of errors. You can check also the Activity log tab for further information.

No annotations

The tab is displayed in the following form, if there are not any annotations, and so for this reason there are not quality indicators:

You can see several information, like:

  • Experiment name.
  • Performance date and time.
  • Author of the experiment.
  • Number of analyzed documents.
  • Number of documents with extractions and with annotations.

The Results coverage panel lists the number and percentages of Documents in which some of the Resources have been automatically spotted.

The Top concepts panel lists the the most automatically recognized concepts in your documents, vice versa for the Worst concepts panel.

The Statistics panel is made of the following sub-panels:

  • Documents, listing the analyzed documents.
  • Resources, listing the automatically recognized resources.

Documents

  • Select the expanding arrow and the collapsing one to expand the document and see the automatic extractions occurred.
  • Hover over a document and select Annotate document to open it in the Documents tab, detail view and start annotating it.
  • Hover over a document and select Open document to open it in the Experiments tab, Documents statistics sub-tab.
  • To sort your documents according to the number of automatically recognized concepts, select the column header on the right.

Filter documents with extractions

  • Select Only documents with extractions for positive filtering.
  • Double-click Only documents with extractions for negative filtering.
  • Select Only documents with extractions or double-click Only documents with extractions to turn off the filter.

Resources

  • Hover over a concept and select Search to perform a search according to the concept in focus.
  • Hover over a concept and select Show in resources to show the concept in the Resources panel.
  • Hover over a concept and select the information icon " to show, in case of no annotations, the preferred label only.
  • Select the column header to sort the concepts according to their hits.
  • Select a concept to view its labels and relations under Resource details.
  • Under Resource details, RELATED CONCEPTS, select the resources icon to view the concept it in the Resources tab.