You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Steven Rowe (JIRA)" <ji...@apache.org> on 2010/07/23 19:43:52 UTC

[jira] Created: (SOLR-2013) ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file

ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file
----------------------------------------------------------------

                 Key: SOLR-2013
                 URL: https://issues.apache.org/jira/browse/SOLR-2013
             Project: Solr
          Issue Type: New Feature
    Affects Versions: 3.1, Next
            Reporter: Steven Rowe
            Priority: Minor
             Fix For: 3.1, Next


Attached is a mapping file to provide the equivalent of ASCIIFoldingFilter through the MappingCharFilterFactory.

I'm not sure where this should go in the source tree.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Commented: (SOLR-2013) ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file

Posted by "Robert Muir (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12894991#action_12894991 ] 

Robert Muir commented on SOLR-2013:
-----------------------------------

Hi Tom: I was referring to mapping-ISOLatin1Accent.txt in the example solr/conf

by the way, there is also a newer alternative to ASCIIFoldingFilter (but for all of Unicode), if you use the icu contrib.

http://svn.apache.org/viewvc/lucene/dev/branches/branch_3x/lucene/contrib/icu/src/java/org/apache/lucene/analysis/icu/ICUFoldingFilter.java?view=markup


> ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file
> ----------------------------------------------------------------
>
>                 Key: SOLR-2013
>                 URL: https://issues.apache.org/jira/browse/SOLR-2013
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 3.1, Next
>            Reporter: Steven Rowe
>            Priority: Minor
>             Fix For: 3.1, Next
>
>         Attachments: mapping-FoldToASCII.txt, mapping-FoldToASCII.txt
>
>
> Attached is a mapping file to provide the equivalent of ASCIIFoldingFilter through the MappingCharFilterFactory.
> I'm not sure where this should go in the source tree.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Updated: (SOLR-2013) ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file

Posted by "Steven Rowe (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Steven Rowe updated SOLR-2013:
------------------------------

    Attachment: mapping-FoldToASCII.txt

Mapping file attached.

The Perl script used to convert the mappings in ASCIIFoldingFilter.java into the mapping file format required by MappingCharFilterFactory is included in a comment at the bottom of the file.

> ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file
> ----------------------------------------------------------------
>
>                 Key: SOLR-2013
>                 URL: https://issues.apache.org/jira/browse/SOLR-2013
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 3.1, Next
>            Reporter: Steven Rowe
>            Priority: Minor
>             Fix For: 3.1, Next
>
>         Attachments: mapping-FoldToASCII.txt
>
>
> Attached is a mapping file to provide the equivalent of ASCIIFoldingFilter through the MappingCharFilterFactory.
> I'm not sure where this should go in the source tree.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Updated: (SOLR-2013) ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file

Posted by "Steven Rowe (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Steven Rowe updated SOLR-2013:
------------------------------

    Attachment: mapping-FoldToASCII.txt

Fixed a mistake in the Perl conversion script and the resulting map for FullWidth Reverse Solidus: now mapping to a single escaped backslash, rather than two of them.

> ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file
> ----------------------------------------------------------------
>
>                 Key: SOLR-2013
>                 URL: https://issues.apache.org/jira/browse/SOLR-2013
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 3.1, Next
>            Reporter: Steven Rowe
>            Priority: Minor
>             Fix For: 3.1, Next
>
>         Attachments: mapping-FoldToASCII.txt, mapping-FoldToASCII.txt
>
>
> Attached is a mapping file to provide the equivalent of ASCIIFoldingFilter through the MappingCharFilterFactory.
> I'm not sure where this should go in the source tree.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Commented: (SOLR-2013) ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file

Posted by "Tom Burton-West (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12894973#action_12894973 ] 

Tom Burton-West commented on SOLR-2013:
---------------------------------------

Steven and Robert,

Thanks for contributing this Steven. It  is a really good idea.  A map file seems much more flexible than the hard-coded case statements.

Robert,  in your comment above you mention Solr includes a mapping based on the deprecated ISOLatin1AccentFilter.
Could you please point me to where I can find this mapping file for the deprecated ISOLatin1AccentFilter in SVN?

Otherwise, I'll just run or adapt the perl code here and run it against the ISOLatin1AccentFilter code.

We haven't switched to the newer ASCIIFoldingFilter and need to emulate the ISOLatin1AccentFilter in some custom non-java code until we make the switch and re-index all 6 million volumes.

Tom Burton-West

> ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file
> ----------------------------------------------------------------
>
>                 Key: SOLR-2013
>                 URL: https://issues.apache.org/jira/browse/SOLR-2013
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 3.1, Next
>            Reporter: Steven Rowe
>            Priority: Minor
>             Fix For: 3.1, Next
>
>         Attachments: mapping-FoldToASCII.txt, mapping-FoldToASCII.txt
>
>
> Attached is a mapping file to provide the equivalent of ASCIIFoldingFilter through the MappingCharFilterFactory.
> I'm not sure where this should go in the source tree.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Commented: (SOLR-2013) ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file

Posted by "Steven Rowe (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12894999#action_12894999 ] 

Steven Rowe commented on SOLR-2013:
-----------------------------------

bq. I was referring to mapping-ISOLatin1Accent.txt in the example solr/conf 

Here's a link to the version on the 3.x branch:

http://svn.apache.org/viewvc/lucene/dev/branches/branch_3x/solr/example/solr/conf/mapping-ISOLatin1Accent.txt?revision=940784&view=markup


> ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file
> ----------------------------------------------------------------
>
>                 Key: SOLR-2013
>                 URL: https://issues.apache.org/jira/browse/SOLR-2013
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 3.1, Next
>            Reporter: Steven Rowe
>            Priority: Minor
>             Fix For: 3.1, Next
>
>         Attachments: mapping-FoldToASCII.txt, mapping-FoldToASCII.txt
>
>
> Attached is a mapping file to provide the equivalent of ASCIIFoldingFilter through the MappingCharFilterFactory.
> I'm not sure where this should go in the source tree.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Resolved: (SOLR-2013) ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file

Posted by "Koji Sekiguchi (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Koji Sekiguchi resolved SOLR-2013.
----------------------------------

    Resolution: Fixed

trunk: Committed revision 991191.
branch_3x: Committed revision 991196.


> ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file
> ----------------------------------------------------------------
>
>                 Key: SOLR-2013
>                 URL: https://issues.apache.org/jira/browse/SOLR-2013
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 3.1, Next
>            Reporter: Steven Rowe
>            Assignee: Koji Sekiguchi
>            Priority: Minor
>             Fix For: 3.1, Next
>
>         Attachments: mapping-FoldToASCII.txt, mapping-FoldToASCII.txt
>
>
> Attached is a mapping file to provide the equivalent of ASCIIFoldingFilter through the MappingCharFilterFactory.
> I'm not sure where this should go in the source tree.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Commented: (SOLR-2013) ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file

Posted by "Steven Rowe (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12904668#action_12904668 ] 

Steven Rowe commented on SOLR-2013:
-----------------------------------

Thanks Koji.

> ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file
> ----------------------------------------------------------------
>
>                 Key: SOLR-2013
>                 URL: https://issues.apache.org/jira/browse/SOLR-2013
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 3.1, Next
>            Reporter: Steven Rowe
>            Assignee: Koji Sekiguchi
>            Priority: Minor
>             Fix For: 3.1, Next
>
>         Attachments: mapping-FoldToASCII.txt, mapping-FoldToASCII.txt
>
>
> Attached is a mapping file to provide the equivalent of ASCIIFoldingFilter through the MappingCharFilterFactory.
> I'm not sure where this should go in the source tree.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Commented: (SOLR-2013) ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file

Posted by "Robert Muir (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12891686#action_12891686 ] 

Robert Muir commented on SOLR-2013:
-----------------------------------

This seems like a really good idea. Currently it looks like solr includes a mapping based on the deprecated ISOLatin1AccentFilter, which we really should have removed in trunk already:
{noformat}
This class is included for use with existing
 * indexes and will be removed in a future release (possibly Lucene 4.0)
{noformat}


> ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file
> ----------------------------------------------------------------
>
>                 Key: SOLR-2013
>                 URL: https://issues.apache.org/jira/browse/SOLR-2013
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 3.1, Next
>            Reporter: Steven Rowe
>            Priority: Minor
>             Fix For: 3.1, Next
>
>         Attachments: mapping-FoldToASCII.txt
>
>
> Attached is a mapping file to provide the equivalent of ASCIIFoldingFilter through the MappingCharFilterFactory.
> I'm not sure where this should go in the source tree.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Assigned: (SOLR-2013) ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file

Posted by "Koji Sekiguchi (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Koji Sekiguchi reassigned SOLR-2013:
------------------------------------

    Assignee: Koji Sekiguchi

I'm going to commit the attached file (w/ perl script) to example conf directory of trunk and 3.x.

> ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file
> ----------------------------------------------------------------
>
>                 Key: SOLR-2013
>                 URL: https://issues.apache.org/jira/browse/SOLR-2013
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 3.1, Next
>            Reporter: Steven Rowe
>            Assignee: Koji Sekiguchi
>            Priority: Minor
>             Fix For: 3.1, Next
>
>         Attachments: mapping-FoldToASCII.txt, mapping-FoldToASCII.txt
>
>
> Attached is a mapping file to provide the equivalent of ASCIIFoldingFilter through the MappingCharFilterFactory.
> I'm not sure where this should go in the source tree.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Commented: (SOLR-2013) ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file

Posted by "Koji Sekiguchi (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-2013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12904092#action_12904092 ] 

Koji Sekiguchi commented on SOLR-2013:
--------------------------------------

I think this is ready to go. Any objections?

> ASCIIFoldingFilter => MappingCharFilterFactory as a mapping file
> ----------------------------------------------------------------
>
>                 Key: SOLR-2013
>                 URL: https://issues.apache.org/jira/browse/SOLR-2013
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 3.1, Next
>            Reporter: Steven Rowe
>            Priority: Minor
>             Fix For: 3.1, Next
>
>         Attachments: mapping-FoldToASCII.txt, mapping-FoldToASCII.txt
>
>
> Attached is a mapping file to provide the equivalent of ASCIIFoldingFilter through the MappingCharFilterFactory.
> I'm not sure where this should go in the source tree.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org