You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by GitBox <gi...@apache.org> on 2021/01/26 11:07:33 UTC

[GitHub] [lucene-solr] donnerpeter commented on a change in pull request #2243: LUCENE-9698: Hunspell: reuse char[] when possible when stripping affix

donnerpeter commented on a change in pull request #2243:
URL: https://github.com/apache/lucene-solr/pull/2243#discussion_r564429159



##########
File path: lucene/analysis/common/src/java/org/apache/lucene/analysis/hunspell/Stemmer.java
##########
@@ -403,7 +408,12 @@ private CharsRef newStem(char[] buffer, int length, IntsRef forms, int formID) {
     return stems;
   }
 
-  private char[] stripAffix(char[] word, int length, int affixLen, int affix, boolean isPrefix) {
+  /**
+   * @return null if affix conditions isn't met; the same char[] if the affix has no strip data and

Review comment:
       "stripping" has a bit different meaning in Hunspell. Googling for "deaffixation" yields some results, some even seem to be by a native speaker (https://books.google.de/books?id=4A44AAAAIAAJ&pg=PA24&lpg=PA24&dq=deaffixation&source=bl&ots=bjtSTczny5&sig=ACfU3U1ujhDJpEFTR3BD_MiHYqgMJKbHMQ&hl=en&sa=X&ved=2ahUKEwjNw_uQurnuAhWvx4UKHXJoCQ0Q6AEwAnoECAUQAg#v=onepage&q=deaffixation&f=false), but rather few, indeed. How about "affix removal"?




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org