mock_items table.
Choosing a Key Field
In theCREATE INDEX statement above, note the mandatory key_field option.
Every BM25 index needs a key_field, which is the name of a column that will function as a row’s unique identifier within the index.
The key_field must:
- Have a
UNIQUEconstraint. Usually this means the table’sPRIMARY KEY. - Be the first column in the column list.
- Be untokenized, if it is a text field.
Tokenizers
Understanding how tokenization works is key to getting the search results you
want, as full text search in ParadeDB (and most search tools) is centered
around token matching. For a refresher, please see the text search
overview.
ngrams tokenizer for description.
Token Filters
After tokens are created, token filters can be configured to apply further processing like lowercasing, stemming, or unaccenting. For example, the following code block adds English stemming todescription: