Migrate fultlext search from Lucene to PostgreSQL #12261

koppor · 2024-12-02T21:29:25Z

Currently, JabRef employs two different search backends: PostgreSQL and Apache Lucene. PostgreSQL is used for the search within the library (.bib file) and Apache Lucene is used for the fulltext search of PDF files.

It turned out that it is really hard to get "contains" search working properly in Lucene.

the query must be tokenized using the same tokenizer as during indexing to extract words from the query and look into the index for these words.

This will fix JabRef#235.

Working on handling following variants for the same word is as hard in Postgres as it is in Lucene:

Düsseldorf
Duesseldorf
D\"{u}sseldorf
Dusseldorf

The text was updated successfully, but these errors were encountered:

koppor added this to the 6.0-beta milestone Dec 2, 2024

koppor mentioned this issue Dec 2, 2024

Add initial search documentation JabRef/jabref-koppor#702

Draft

subhramit added the search label Dec 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate fultlext search from Lucene to PostgreSQL #12261

Migrate fultlext search from Lucene to PostgreSQL #12261

koppor commented Dec 2, 2024

Migrate fultlext search from Lucene to PostgreSQL #12261

Migrate fultlext search from Lucene to PostgreSQL #12261

Comments

koppor commented Dec 2, 2024