You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "ludovic Boutros (JIRA)" <ji...@apache.org> on 2012/05/28 11:12:22 UTC

[jira] [Created] (SOLR-3494) The hunspell filter should support compressed Hunspell dictionaries

ludovic Boutros created SOLR-3494:
-------------------------------------

             Summary: The hunspell filter should support compressed Hunspell dictionaries
                 Key: SOLR-3494
                 URL: https://issues.apache.org/jira/browse/SOLR-3494
             Project: Solr
          Issue Type: Improvement
          Components: Schema and Analysis
    Affects Versions: 3.5, 3.6
            Reporter: ludovic Boutros


OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Resolved] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "Chris Male (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris Male resolved LUCENE-4079.
--------------------------------

       Resolution: Fixed
    Fix Version/s: 5.0

Thanks Ludovic!
                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1, 5.0
>
>         Attachments: LUCENE-4079-3.6.x.patch, LUCENE-4079-trunk.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "Simon Willnauer (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284378#comment-13284378 ] 

Simon Willnauer commented on LUCENE-4079:
-----------------------------------------

Ludovic, I moved this to lucene since its a lucene issue.
                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Moved] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "Simon Willnauer (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Simon Willnauer moved SOLR-3494 to LUCENE-4079:
-----------------------------------------------

          Component/s:     (was: Schema and Analysis)
                       modules/analysis
        Fix Version/s:     (was: 3.6.1)
                           (was: 4.0)
                       3.6.1
                       4.0
        Lucene Fields: New,Patch Available
    Affects Version/s:     (was: 3.6)
                           (was: 3.5)
                           (was: 4.0)
                       4.0
                       3.5
                       3.6
                  Key: LUCENE-4079  (was: SOLR-3494)
              Project: Lucene - Java  (was: Solr)
    
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.6, 3.5, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "ludovic Boutros (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284386#comment-13284386 ] 

ludovic Boutros commented on LUCENE-4079:
-----------------------------------------

patch renamed.
                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "ludovic Boutros (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated LUCENE-4079:
------------------------------------

    Attachment: LUCENE-4079.patch
    
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "Chris Male (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284736#comment-13284736 ] 

Chris Male commented on LUCENE-4079:
------------------------------------

Thanks Ludovic, I'll take a look.
                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079-3.6.x.patch, LUCENE-4079-trunk.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (SOLR-3494) The hunspell filter should support compressed Hunspell dictionaries

Posted by "Simon Willnauer (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-3494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Simon Willnauer updated SOLR-3494:
----------------------------------

    Affects Version/s: 4.0
        Fix Version/s: 3.6.1
                       4.0
    
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: SOLR-3494
>                 URL: https://issues.apache.org/jira/browse/SOLR-3494
>             Project: Solr
>          Issue Type: Improvement
>          Components: Schema and Analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "ludovic Boutros (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284385#comment-13284385 ] 

ludovic Boutros commented on LUCENE-4079:
-----------------------------------------

oups, yes you are right, thank you Simon.

                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "ludovic Boutros (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated LUCENE-4079:
------------------------------------

    Attachment: LUCENE-4079-3.6.x.patch
                LUCENE-4079-trunk.patch

Ok, I have merged some typo corrections from the trunk to the branch 3.6.
I have applied the patch to the trunk, and run the tests.

Do you need something else ? :)
                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079-3.6.x.patch, LUCENE-4079-trunk.patch, LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "ludovic Boutros (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated LUCENE-4079:
------------------------------------

    Attachment: LUCENE-4079.patch

The patch with a test unit.

I don't know if a compressed dictionary could use both naming types (alias and direct rule name).
In the c++ code it seems that it is not possible so I did not test it in the java code.
                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "Chris Male (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284389#comment-13284389 ] 

Chris Male commented on LUCENE-4079:
------------------------------------

Ludovic, do you think you could include some tests to verify this behavior?
                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "Chris Male (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284383#comment-13284383 ] 

Chris Male commented on LUCENE-4079:
------------------------------------

Thanks Simon
                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "Chris Male (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13285379#comment-13285379 ] 

Chris Male commented on LUCENE-4079:
------------------------------------

Committed to trunk in r1344095, 4x in r1344094.

Will backport to 3.6.1
                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079-3.6.x.patch, LUCENE-4079-trunk.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (SOLR-3494) The hunspell filter should support compressed Hunspell dictionaries

Posted by "ludovic Boutros (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-3494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated SOLR-3494:
----------------------------------

    Attachment:     (was: SOLR-3494.patch)
    
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: SOLR-3494
>                 URL: https://issues.apache.org/jira/browse/SOLR-3494
>             Project: Solr
>          Issue Type: Improvement
>          Components: Schema and Analysis
>    Affects Versions: 3.5, 3.6
>            Reporter: ludovic Boutros
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "ludovic Boutros (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13285460#comment-13285460 ] 

ludovic Boutros commented on LUCENE-4079:
-----------------------------------------

Thanks Chris for taking this patch in account so fast !
                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1, 5.0
>
>         Attachments: LUCENE-4079-3.6.x.patch, LUCENE-4079-trunk.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "Chris Male (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284605#comment-13284605 ] 

Chris Male commented on LUCENE-4079:
------------------------------------

Ludovic,

Sorry to keep asking for changes, but do you think you could make your patch against trunk?
                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "ludovic Boutros (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated LUCENE-4079:
------------------------------------

    Attachment:     (was: LUCENE-4079.patch)
    
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079-3.6.x.patch, LUCENE-4079-trunk.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "ludovic Boutros (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13284650#comment-13284650 ] 

ludovic Boutros commented on LUCENE-4079:
-----------------------------------------

No problem, I will try to make the patch against trunk today.
                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "ludovic Boutros (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated LUCENE-4079:
------------------------------------

    Attachment:     (was: SOLR-3494.patch)
    
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "Chris Male (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13285387#comment-13285387 ] 

Chris Male commented on LUCENE-4079:
------------------------------------

Backported in r1344101.
                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079-3.6.x.patch, LUCENE-4079-trunk.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (SOLR-3494) The hunspell filter should support compressed Hunspell dictionaries

Posted by "ludovic Boutros (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-3494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated SOLR-3494:
----------------------------------

    Attachment: SOLR-3494.patch

I manage to load french compressed dictionaries with this little patch.
                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: SOLR-3494
>                 URL: https://issues.apache.org/jira/browse/SOLR-3494
>             Project: Solr
>          Issue Type: Improvement
>          Components: Schema and Analysis
>    Affects Versions: 3.5, 3.6
>            Reporter: ludovic Boutros
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (SOLR-3494) The hunspell filter should support compressed Hunspell dictionaries

Posted by "ludovic Boutros (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-3494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated SOLR-3494:
----------------------------------

    Attachment: SOLR-3494.patch

little changes.
                
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: SOLR-3494
>                 URL: https://issues.apache.org/jira/browse/SOLR-3494
>             Project: Solr
>          Issue Type: Improvement
>          Components: Schema and Analysis
>    Affects Versions: 3.5, 3.6
>            Reporter: ludovic Boutros
>         Attachments: SOLR-3494.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "ludovic Boutros (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ludovic Boutros updated LUCENE-4079:
------------------------------------

    Attachment:     (was: LUCENE-4079.patch)
    
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Assigned] (LUCENE-4079) The hunspell filter should support compressed Hunspell dictionaries

Posted by "Chris Male (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris Male reassigned LUCENE-4079:
----------------------------------

    Assignee: Chris Male
    
> The hunspell filter should support compressed Hunspell dictionaries
> -------------------------------------------------------------------
>
>                 Key: LUCENE-4079
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4079
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: modules/analysis
>    Affects Versions: 3.5, 3.6, 4.0
>            Reporter: ludovic Boutros
>            Assignee: Chris Male
>             Fix For: 4.0, 3.6.1
>
>         Attachments: LUCENE-4079.patch
>
>
> OpenOffice dictionaries are often compressed via some aliases on the beginning of the affixe file. The french one for instance.
> Currently the hunspell filter does not read the aliases.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org