You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by Ryado <gi...@git.apache.org> on 2018/09/13 12:39:45 UTC

[GitHub] lucene-solr pull request #449: [LUCENE-8462] New Arabic snowball stemmer and...

GitHub user Ryado opened a pull request:

    https://github.com/apache/lucene-solr/pull/449

    [LUCENE-8462] New Arabic snowball stemmer and test dataset

    This change adds an Arabic snowball stemmer based on 
     https://github.com/snowballstem/snowball/blob/master/algorithms/arabic.sbl
    
    as well as an Arabic test dataset in `TestSnowballVocabData.zip` based on
    https://github.com/ibnmalik/golden-corpus-arabic/blob/develop/core/words.txt
    
    It also updates the `ant patch-snowball` target to be compatible with
    the java classes generated by the last snowball version (tree:
    1964ce688cbeca505263c8f77e16ed923296ce7a). The `ant patch-snowball` target
    is retro-compatible with the version of snowball stemmers used in
    lucene 7.x and ignores already patched classes.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/Ryado/lucene-solr arabicsnowball

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/lucene-solr/pull/449.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #449
    
----
commit 89571cff737b4d4501eee2ea0bc84240835f6467
Author: Ryadh Dahimene <da...@...>
Date:   2018-09-13T12:26:34Z

    Added an arabic snowball stemmer and test dataset
    
    This change adds an Arabic snowball stemmer based on snowballstem.org
    as well as an arabic test dataset in `TestSnowballVocabData.zip`
    It also updates the `ant patch-snowball` target to be compatible with
    the java classes generated by the last snowball version (tree:
    1964ce688cbeca505263c8f77e16ed923296ce7a). The `ant patch-snowball` target
    is retro-compatible with the version of snowball stemmers used in
    lucene 7.x and ignores already patched classes.

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org