SCOPE overview
When defining rules, SCOPE
is the statement that allows the user to define the portion of text in which a single rule or a group of rules has to be instantiated and verified. A rule will generate a hit, only if it recognizes all the elements it contains within the defined interval.
The syntax to define a rule's scope is the following:
SCOPE scopeOption [ON ATOM]
{
rule(s)
}
Defining a scope is mandatory for each rule, but it is up to the user to decide the extent. For greater flexibility, users can choose from a variety of scope options. The options can be divided into two groups: the first relates to the standard textual divisions generated by the semantic disambiguator's text analysis; the second to the textual divisions that can be optionally defined for a specific project and/or text type.
Note
Parts between square brackets ([]
) are optional.
ON ATOM
is optional and lets your rules trigger in function of an atom-based count of the textual elements of the sentence. You can find a practical example in the positional sequences section of this manual.
-
Standard options:
PARAGRAPH
SENTENCE
CLAUSE
PHRASE
-
Custom options:
SECTION
SEGMENT
The different types of standard and custom SCOPE
options can be used by themselves or in combination, depending on the rules' goals.
Within the same project, rules can share the same scope definition; however, this is not mandatory as a different
SCOPE
can be defined for each rule in the project.The value of selecting a wider or narrower scope option can be better appreciated when the rule(s) to be managed contain(s) expressions made of two or more attributes connected by boolean operators. If a rule looks for a single element, the results obtained applying the rule on paragraphs or on single sentences will be the same. However there is a significant difference between a rule that looks for two lemmas using the AND operator within a single sentence as opposed to within two paragraphs. In fact, if the
SCOPE
isSENTENCE
, a rule could trigger only within each single sentence; it will never recognize elements which are contained in two different sentences, not even if they are adjacent. On the other hand, if theSCOPE
isPARAGRAPH
, a rule could trigger on all the text contained within a paragraph, thus going beyond a single sentence scope.
SCOPE
options can sometimes be used in combination with particular features that further restrict rules' actions to specific conditions; these conditions neither depend on the text's structure, nor on the position of the textual elements, but on the content and meaning of the text. These features are called: DOMAIN
constraints and SENTENCE
RELEVANT
constraints. The first verifies whether a document has been associated to any domain during the disambiguation process, the second acts upon the most relevant sentences of a document. In other words, these two syntaxes take into consideration the context in which words occur. For a detailed description, please refer to the dedicated pages.