You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Alan Woodward (JIRA)" <ji...@apache.org> on 2019/04/01 16:10:01 UTC

[jira] [Updated] (LUCENE-8730) Ensure WordDelimiterGraphFilter always emits its original token first

     [ https://issues.apache.org/jira/browse/LUCENE-8730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alan Woodward updated LUCENE-8730:
----------------------------------
    Attachment: LUCENE-8730.patch

> Ensure WordDelimiterGraphFilter always emits its original token first
> ---------------------------------------------------------------------
>
>                 Key: LUCENE-8730
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8730
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Alan Woodward
>            Assignee: Alan Woodward
>            Priority: Major
>         Attachments: LUCENE-8730.patch, LUCENE-8730.patch
>
>
> WordDelimiterFilter and WordDelimiterGraphFilter behave almost identically outside setting position length; the only difference being that WDGF can sometimes emit its original token as the second output token rather than the first.  We should change this to conform to the behaviour of the older filter - this will make it much easier to remove WDF entirely and cut over tests that use it incidentally.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org