Diacritic insensitive in like query

Hi @couchbase_fan,

A bit more text specific and efficient/faster way of doing this would be using the “asciifolding” character filters.
You need to create a custom analyser from the FTS web console like below. This one contains only the minimum parts for this demo.

And use this custom analyser for the field to be indexed like below.

This would make all those diacritic variations searchable.
Please note this asciifolding character filter is available on 6.5.0 release.

The problem with edit distance (fuzzy query) based approach would be, its won’t scale when we have more diacritic characters present(>2, which is very normal ) in a search text, and it won’t result in the fastest query time performance.

Cheers!

1 Like