You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Federico Grilli (JIRA)" <ji...@apache.org> on 2006/09/30 23:00:22 UTC

[jira] Created: (LUCENE-677) Italian Analyzer

Italian Analyzer
----------------

                 Key: LUCENE-677
                 URL: http://issues.apache.org/jira/browse/LUCENE-677
             Project: Lucene - Java
          Issue Type: Improvement
          Components: Analysis
            Reporter: Federico Grilli
            Priority: Minor


Hello everybody,

I have written an Italian analyzer based on the Porter's stemming
algorithm as found at http://www.snowball.tartarus.org. I would like
to contribute it to the Lucene sandbox. The classes come with thorough
unit tests.

Regards,

Federico Grilli

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Updated: (LUCENE-677) Italian Analyzer

Posted by "Federico Grilli (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/LUCENE-677?page=all ]

Federico Grilli updated LUCENE-677:
-----------------------------------

    Attachment: ItalianAnalyzer.patch

> Italian Analyzer
> ----------------
>
>                 Key: LUCENE-677
>                 URL: http://issues.apache.org/jira/browse/LUCENE-677
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Federico Grilli
>            Priority: Minor
>         Attachments: ItalianAnalyzer.patch
>
>
> Hello everybody,
> I have written an Italian analyzer based on the Porter's stemming
> algorithm as found at http://www.snowball.tartarus.org. I would like
> to contribute it to the Lucene sandbox. The classes come with thorough
> unit tests.
> Regards,
> Federico Grilli

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Commented: (LUCENE-677) Italian Analyzer

Posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/LUCENE-677?page=comments#action_12439015 ] 
            
Otis Gospodnetic commented on LUCENE-677:
-----------------------------------------

Federico, could you please explain how your Italian analyzer is different from the Snowball one?
>From your description, I would think they use the same stemming rules.
Thanks!

> Italian Analyzer
> ----------------
>
>                 Key: LUCENE-677
>                 URL: http://issues.apache.org/jira/browse/LUCENE-677
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Federico Grilli
>            Priority: Minor
>         Attachments: ItalianAnalyzer.patch
>
>
> Hello everybody,
> I have written an Italian analyzer based on the Porter's stemming
> algorithm as found at http://www.snowball.tartarus.org. I would like
> to contribute it to the Lucene sandbox. The classes come with thorough
> unit tests.
> Regards,
> Federico Grilli

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Resolved: (LUCENE-677) Italian Analyzer

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll resolved LUCENE-677.
------------------------------------

    Resolution: Won't Fix

Can use the Snowball Italian

> Italian Analyzer
> ----------------
>
>                 Key: LUCENE-677
>                 URL: https://issues.apache.org/jira/browse/LUCENE-677
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Federico Grilli
>            Priority: Minor
>         Attachments: ItalianAnalyzer.patch
>
>
> Hello everybody,
> I have written an Italian analyzer based on the Porter's stemming
> algorithm as found at http://www.snowball.tartarus.org. I would like
> to contribute it to the Lucene sandbox. The classes come with thorough
> unit tests.
> Regards,
> Federico Grilli

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Commented: (LUCENE-677) Italian Analyzer

Posted by "Federico Grilli (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/LUCENE-677?page=comments#action_12438935 ] 
            
Federico Grilli commented on LUCENE-677:
----------------------------------------

One note on the patch size. Of course the java files themselves are very small, the bulk is a diffs.txt file containing more than 30000 Italian words and their corresponding stemmed forms used for testing.

> Italian Analyzer
> ----------------
>
>                 Key: LUCENE-677
>                 URL: http://issues.apache.org/jira/browse/LUCENE-677
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Federico Grilli
>            Priority: Minor
>         Attachments: ItalianAnalyzer.patch
>
>
> Hello everybody,
> I have written an Italian analyzer based on the Porter's stemming
> algorithm as found at http://www.snowball.tartarus.org. I would like
> to contribute it to the Lucene sandbox. The classes come with thorough
> unit tests.
> Regards,
> Federico Grilli

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Commented: (LUCENE-677) Italian Analyzer

Posted by "Federico Grilli (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/LUCENE-677?page=comments#action_12439640 ] 
            
Federico Grilli commented on LUCENE-677:
----------------------------------------

I guess you already got my answer through the lucene-dev mailing list. In any case, the answer is yes, same stemming rules but different implementation. I have seen
that also other languages (i.e. French, German)  have contributed
implementations of the Porter's algorithm in addition to the snowball
one, so I thought mine too could be of some interest.

Cheers,

Federico

> Italian Analyzer
> ----------------
>
>                 Key: LUCENE-677
>                 URL: http://issues.apache.org/jira/browse/LUCENE-677
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Federico Grilli
>            Priority: Minor
>         Attachments: ItalianAnalyzer.patch
>
>
> Hello everybody,
> I have written an Italian analyzer based on the Porter's stemming
> algorithm as found at http://www.snowball.tartarus.org. I would like
> to contribute it to the Lucene sandbox. The classes come with thorough
> unit tests.
> Regards,
> Federico Grilli

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Commented: (LUCENE-677) Italian Analyzer

Posted by "Federico Grilli (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/LUCENE-677?page=comments#action_12442239 ] 
            
Federico Grilli commented on LUCENE-677:
----------------------------------------


   [[ Old comment, sent by email on Sun, 1 Oct 2006 23:53:00 +0200 ]]

Yes, same stemming rules but different implementation. I have seen
that also other languages (i.e. French, German)  have contributed
implementations of the Porter's algorithm in addition to the snowball
one, so I thought mine too could be of some interest.

Regards,

Federico




-- 
Federico Grilli
--------------------
"Copyleft, all rights reversed" http://www.gnu.org/copyleft


> Italian Analyzer
> ----------------
>
>                 Key: LUCENE-677
>                 URL: http://issues.apache.org/jira/browse/LUCENE-677
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Federico Grilli
>            Priority: Minor
>         Attachments: ItalianAnalyzer.patch
>
>
> Hello everybody,
> I have written an Italian analyzer based on the Porter's stemming
> algorithm as found at http://www.snowball.tartarus.org. I would like
> to contribute it to the Lucene sandbox. The classes come with thorough
> unit tests.
> Regards,
> Federico Grilli

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org