Non-English Environments

Using the English-Language Query Language

The keywords (operator and modifier names) in search queries are language-specific. If, for example, the Search Cartridge is operated using the German-language setting germanx (an optional so-called locale), the German equivalents of some English operator and modifier names will be identified as operators or modifiers, respectively, even if they were meant to be search words in a query. These words "und", "oder", "nicht" and others) must be quoted when they are to be interpreted as search words. Analogously, English words such as "and", "or", "any", "not" that correspond to operator or modifier names need to be enclosed in quotes if they are meant literally.

It is possible and recommendable to use the English-language operator and modifier names in search queries. This can be done by prefixing each operator or modifier name with a Pound sign (or hash mark, depending on your keyboard layout). In order to search for the phrase "fire department" in a non-English-language environment, enter the following expression:

<#PHRASE> fire department

In explicit syntax the expression looks like this:

<#PHRASE> (fire, department)

Tokenization

In environments in which languages such as English or German are used, the Search Cartridge can interpret space characters as word separators. For these languages no special language-specific tokenization modules are required since the Search Cartridge is able to identify words, phrases, and sentences appropriately. However, for other languages, such as Japanese and Chinese, a tokenization module is required in order to determine the bounds of words. If you are using a special tokenization module, some sections of the following query language descriptions might not apply.