You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by Ryado <gi...@git.apache.org> on 2018/09/13 12:39:45 UTC
[GitHub] lucene-solr pull request #449: [LUCENE-8462] New Arabic snowball stemmer and...
GitHub user Ryado opened a pull request:
https://github.com/apache/lucene-solr/pull/449
[LUCENE-8462] New Arabic snowball stemmer and test dataset
This change adds an Arabic snowball stemmer based on
https://github.com/snowballstem/snowball/blob/master/algorithms/arabic.sbl
as well as an Arabic test dataset in `TestSnowballVocabData.zip` based on
https://github.com/ibnmalik/golden-corpus-arabic/blob/develop/core/words.txt
It also updates the `ant patch-snowball` target to be compatible with
the java classes generated by the last snowball version (tree:
1964ce688cbeca505263c8f77e16ed923296ce7a). The `ant patch-snowball` target
is retro-compatible with the version of snowball stemmers used in
lucene 7.x and ignores already patched classes.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/Ryado/lucene-solr arabicsnowball
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/lucene-solr/pull/449.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #449
----
commit 89571cff737b4d4501eee2ea0bc84240835f6467
Author: Ryadh Dahimene <da...@...>
Date: 2018-09-13T12:26:34Z
Added an arabic snowball stemmer and test dataset
This change adds an Arabic snowball stemmer based on snowballstem.org
as well as an arabic test dataset in `TestSnowballVocabData.zip`
It also updates the `ant patch-snowball` target to be compatible with
the java classes generated by the last snowball version (tree:
1964ce688cbeca505263c8f77e16ed923296ce7a). The `ant patch-snowball` target
is retro-compatible with the version of snowball stemmers used in
lucene 7.x and ignores already patched classes.
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org