You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
"Katakana: does アメリカ (amerika) finds アメリカン (amerikan)? It should, it doesn't, but practical consequences are not great (unlike hiragana, if two words in katakana follow each other, there usually is a middle dot in between, which is recognized as a word boundary.)"
Example from Martin H:
"Katakana: does アメリカ (amerika) finds アメリカン (amerikan)? It should, it doesn't, but practical consequences are not great (unlike hiragana, if two words in katakana follow each other, there usually is a middle dot in between, which is recognized as a word boundary.)"
We could add bigrams for this use case, or use specialized parsing from Stanford: https://github.com/sul-dlss/sul-solr-configs/blob/master/searchworks-prod/schema.xml#L493
The text was updated successfully, but these errors were encountered: