You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@jena.apache.org by Osma Suominen <os...@helsinki.fi> on 2016/11/11 11:02:52 UTC

Is anyone using jena-text with Solr?

Hi,

I've been looking at doing some refactoring of jena-text code while 
working on the Lucene upgrade (JENA-1250). The way query strings are 
constructed in TextQueryPF and then just passed as potentially long and 
complex strings to TextIndexLucene/TextIndexSolr is a bit awkward for 
the new implementation of language-specific indexing that I did for 
JENA-1250, which has to choose the targeted field names dynamically. The 
old implementation didn't have this issue but it relied on a feature in 
the Lucene 4 API that was dropped in Lucene 5 because it was considered 
dangerous (for good reason).

I'm a bit stuck with the refactoring as I'd need to do some non-trivial 
changes in the TextIndex interface that is implemented by both 
TextIndexLucene and TextIndexSolr. I'm hesitant to change TextIndexSolr 
because I don't want to break it further.

TextIndexSolr has suffered some bitrot and I'm not sure if it can be 
used anymore. I remember trying to set up a Solr index for use with 
jena-text but couldn't get it to work. There is very little 
documentation about the Solr side of jena-text and no working unit tests 
(there is a test class TestDatasetWithEmbeddedSolrTextIndex but it is 
disabled with the comment "Embedded solr not supported").

So I'm left wondering whether the Solr support in jena-text serves a 
useful purpose or not. Is anyone using it? I think it should either be 
improved (starting from the documentation) or dropped, because currently 
it's just a burden on the other code within jena-text.

Opinions?

-Osma

-- 
Osma Suominen
D.Sc. (Tech), Information Systems Specialist
National Library of Finland
P.O. Box 26 (Kaikukatu 4)
00014 HELSINGIN YLIOPISTO
Tel. +358 50 3199529
osma.suominen@helsinki.fi
http://www.nationallibrary.fi