Field Configuration
Token Filters
Token filters apply additional processing to tokens after they have been created.
Stemmer
Stemming is the process of reducing words to their root form. In English, for example, the root form of running
and runs
is run
. The stemmer
filter can
be applied to any tokenizer.
stemmer
Available stemmers are Arabic
, Danish
, Dutch
, English
, Finnish
,
French
, German
, Greek
, Hungarian
, Italian
, Norwegian
,
Portuguese
, Romanian
, Russian
, Spanish
, Swedish
, Tamil
, and
Turkish
.
Remove Long
The remove_long
filter removes all tokens longer than a fixed number of bytes. If not specified,
remove_long
defaults to 255
.
Lowercase
The lowercase
filter lowercases all tokens. If not specified, lowercase
defaults to true
.
Was this page helpful?