You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Hoss Man (JIRA)" <ji...@apache.org> on 2015/11/30 22:55:10 UTC

[jira] [Created] (LUCENE-6914) DecimalDigitFilter skips characters in some cases (supplemental?)

Hoss Man created LUCENE-6914:
--------------------------------

             Summary: DecimalDigitFilter skips characters in some cases (supplemental?)
                 Key: LUCENE-6914
                 URL: https://issues.apache.org/jira/browse/LUCENE-6914
             Project: Lucene - Core
          Issue Type: Bug
    Affects Versions: 5.4
            Reporter: Hoss Man


Found this while writing up the solr ref guide for DecimalDigitFilter. 

With input like "𝟙𝟡𝟠𝟜" ("Double Struck" 1984) the filter produces "1𝟡8𝟜" (1, double struck 9, 8, double struck 4)  add some non-decimal characters in between the digits (ie: "𝟙x𝟡x𝟠x𝟜") and you get the expected output ("1x9x8x4").  This doesn't affect all decimal characters though, as evident by the existing test cases.

Perhaps this is an off by one bug in the "if the original was supplementary, shrink the string" code path?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org