Skip to content


Developing a text intelligence model requires an adequate number of carefully chosen documents representative of all cases that the model must be able to predict.
Annotated corpora are a powerful aid in the selection of documents for your projects.

In Annotate, a corpus is an uploaded collection of texts analyzed and indexed with Natural Language Understanding (NLU) technology. These features allow you to easily determine whether a corpus fits your needs or not, for example if it covers all cases or you need more documents, and so on.

At the end of the exploration, you will know if a corpus is a good candidate to become one of the libraries for your project.