You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by GitBox <gi...@apache.org> on 2019/10/28 20:06:22 UTC

[GitHub] [lucene-solr] cbuescher opened a new pull request #981: LUCENE-9030: Fix different Solr- and WordnetSynonymParser behaviour

cbuescher opened a new pull request #981: LUCENE-9030: Fix different Solr- and WordnetSynonymParser behaviour
URL: https://github.com/apache/lucene-solr/pull/981
 
 
   This fixes an issue where sets of equivalent synonyms in the Wordnet format are
   parsed and added to the SynonymMap in a way that leads to the original input
   token not being typed as "word" but as SYNONYM instead. Also the original token
   doesn't appear first in the token stream output, which is the case for
   equivalent solr formatted synonym files.
   
   Currently the WordnetSynonymParser adds all combinations of input/output pairs
   of a synset entry into the synonym map, while the SolrSynonymParser excludes
   those where input and output term are the same. This change adds the same
   behaviour to WordnetSynonymParser and adds tests that show the two formats are
   outputting the same token order and types now.
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org