You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Dawid Weiss (Commented) (JIRA)" <ji...@apache.org> on 2012/03/01 23:08:01 UTC

[jira] [Commented] (SOLR-3185) PatternReplaceCharFilterFactory can't replace with ampersands in index

    [ https://issues.apache.org/jira/browse/SOLR-3185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13220421#comment-13220421 ] 

Dawid Weiss commented on SOLR-3185:
-----------------------------------

Are there any other filters in the chain? Because PatternReplaceCharFilterFactory itself doesn't replace any html entities so it'd be weird. Also, can you quote the XML verbatim? If you have this:

{noformat}
<charFilter class="solr.PatternReplaceCharFilterFactory" 
                    pattern="(^\w)\s[&amp;]\s(\w)" 
                    replacement="$1&amp;amp;$2" />
{noformat}
then indeed the replaced value will be:
{noformat}
$1&amp;$2
{noformat}
                
> PatternReplaceCharFilterFactory can't replace with ampersands in index
> ----------------------------------------------------------------------
>
>                 Key: SOLR-3185
>                 URL: https://issues.apache.org/jira/browse/SOLR-3185
>             Project: Solr
>          Issue Type: Bug
>          Components: Schema and Analysis
>    Affects Versions: 3.5
>            Reporter: Mike Spencer
>            Priority: Minor
>              Labels: PatternReplaceCharFilter, regex
>
> Using solr.PatternReplaceCharFilterFactory to replace 'A & B' (no quotes) with 'A&B' (no spaces) will result in 'A&amp;amp;B' being indexed. Query analysis will give the expected result of 'A&B'. I examined the index with both standalone Luke and the schema browser field and the index value is incorrect in both tools.
> This is the affected charFilter:
> <charFilter class="solr.PatternReplaceCharFilterFactory"
>                     pattern="(^\w)\s[&amp;]\s(\w)"
>                     replacement="$1&amp;amp;$2" />

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org