You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Emre Bayram (JIRA)" <ji...@apache.org> on 2006/04/28 00:10:37 UTC

[jira] Created: (LUCENE-559) Turkish Analyzer for Lucene

Turkish Analyzer for Lucene
---------------------------

         Key: LUCENE-559
         URL: http://issues.apache.org/jira/browse/LUCENE-559
     Project: Lucene - Java
        Type: Improvement

  Components: Analysis  
    Reporter: Emre Bayram


I have developed an Analyzer for Turkish, thanks to German Language Analyzer and Brazillian Language Analyzers.
This Turkish Analyzer supports iso-8859-9 character set(Turkish) and have a nice stop words set. I hope it can help to Turkish developers who use lucene(i searched many hours for a turkish analyzer for lucene but couldnt find, so i coded and sending it here.)

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Re: [jira] Resolved: (LUCENE-559) Turkish Analyzer for Lucene

Posted by Shai Erera <se...@gmail.com>.
Why not use the SnowballAnalyzer for Turkish? Snowball recently added a
Turkish stemmer.

On Jan 10, 2008 8:51 PM, Grant Ingersoll (JIRA) <ji...@apache.org> wrote:

>
>     [
> https://issues.apache.org/jira/browse/LUCENE-559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel]
>
> Grant Ingersoll resolved LUCENE-559.
> ------------------------------------
>
>    Resolution: Incomplete
>
> Needs unit tests and a patch would be nice.
>
> > Turkish Analyzer for Lucene
> > ---------------------------
> >
> >                 Key: LUCENE-559
> >                 URL: https://issues.apache.org/jira/browse/LUCENE-559
> >             Project: Lucene - Java
> >          Issue Type: Improvement
> >          Components: Analysis
> >            Reporter: Emre Bayram
> >            Priority: Minor
> >         Attachments: IndexFiles.java, SearchFiles.java,
> TurkishAnalyzer.java, TurkishAnalyzer.java, TurkishStemFilter.java,
> TurkishStemFilter.java, TurkishStemmer.java, TurkishStemmer.java
> >
> >
> > I have developed an Analyzer for Turkish, thanks to German Language
> Analyzer and Brazillian Language Analyzers.
> > This Turkish Analyzer supports iso-8859-9 character set(Turkish) and
> have a nice stop words set. I hope it can help to Turkish developers who use
> lucene(i searched many hours for a turkish analyzer for lucene but couldnt
> find, so i coded and sending it here.)
>
> --
> This message is automatically generated by JIRA.
> -
> You can reply to this email to add a comment to the issue online.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-dev-help@lucene.apache.org
>
>


-- 
Regards,

Shai Erera

[jira] Resolved: (LUCENE-559) Turkish Analyzer for Lucene

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-559?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll resolved LUCENE-559.
------------------------------------

    Resolution: Incomplete

Needs unit tests and a patch would be nice.

> Turkish Analyzer for Lucene
> ---------------------------
>
>                 Key: LUCENE-559
>                 URL: https://issues.apache.org/jira/browse/LUCENE-559
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Analysis
>            Reporter: Emre Bayram
>            Priority: Minor
>         Attachments: IndexFiles.java, SearchFiles.java, TurkishAnalyzer.java, TurkishAnalyzer.java, TurkishStemFilter.java, TurkishStemFilter.java, TurkishStemmer.java, TurkishStemmer.java
>
>
> I have developed an Analyzer for Turkish, thanks to German Language Analyzer and Brazillian Language Analyzers.
> This Turkish Analyzer supports iso-8859-9 character set(Turkish) and have a nice stop words set. I hope it can help to Turkish developers who use lucene(i searched many hours for a turkish analyzer for lucene but couldnt find, so i coded and sending it here.)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Updated: (LUCENE-559) Turkish Analyzer for Lucene

Posted by "Emre Bayram (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/LUCENE-559?page=all ]

Emre Bayram updated LUCENE-559:
-------------------------------

    Attachment: TurkishAnalyzer.java
                TurkishStemFilter.java
                TurkishStemmer.java

With uni-codes.

> Turkish Analyzer for Lucene
> ---------------------------
>
>          Key: LUCENE-559
>          URL: http://issues.apache.org/jira/browse/LUCENE-559
>      Project: Lucene - Java
>         Type: Improvement

>   Components: Analysis
>     Reporter: Emre Bayram
>  Attachments: TurkishAnalyzer.java, TurkishAnalyzer.java, TurkishStemFilter.java, TurkishStemFilter.java, TurkishStemmer.java, TurkishStemmer.java
>
> I have developed an Analyzer for Turkish, thanks to German Language Analyzer and Brazillian Language Analyzers.
> This Turkish Analyzer supports iso-8859-9 character set(Turkish) and have a nice stop words set. I hope it can help to Turkish developers who use lucene(i searched many hours for a turkish analyzer for lucene but couldnt find, so i coded and sending it here.)

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Commented: (LUCENE-559) Turkish Analyzer for Lucene

Posted by "Daniel Naber (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/LUCENE-559?page=comments#action_12416407 ] 

Daniel Naber commented on LUCENE-559:
-------------------------------------

Thanks for your contribution. Could you write some unit tests for your classes, similar to the existing tests for other languages?


> Turkish Analyzer for Lucene
> ---------------------------
>
>          Key: LUCENE-559
>          URL: http://issues.apache.org/jira/browse/LUCENE-559
>      Project: Lucene - Java
>         Type: Improvement

>   Components: Analysis
>     Reporter: Emre Bayram
>  Attachments: TurkishAnalyzer.java, TurkishAnalyzer.java, TurkishStemFilter.java, TurkishStemFilter.java, TurkishStemmer.java, TurkishStemmer.java
>
> I have developed an Analyzer for Turkish, thanks to German Language Analyzer and Brazillian Language Analyzers.
> This Turkish Analyzer supports iso-8859-9 character set(Turkish) and have a nice stop words set. I hope it can help to Turkish developers who use lucene(i searched many hours for a turkish analyzer for lucene but couldnt find, so i coded and sending it here.)

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Updated: (LUCENE-559) Turkish Analyzer for Lucene

Posted by "Emre Bayram (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/LUCENE-559?page=all ]

Emre Bayram updated LUCENE-559:
-------------------------------

    Attachment: TurkishAnalyzer.java
                TurkishStemFilter.java
                TurkishStemmer.java

> Turkish Analyzer for Lucene
> ---------------------------
>
>          Key: LUCENE-559
>          URL: http://issues.apache.org/jira/browse/LUCENE-559
>      Project: Lucene - Java
>         Type: Improvement

>   Components: Analysis
>     Reporter: Emre Bayram
>  Attachments: TurkishAnalyzer.java, TurkishStemFilter.java, TurkishStemmer.java
>
> I have developed an Analyzer for Turkish, thanks to German Language Analyzer and Brazillian Language Analyzers.
> This Turkish Analyzer supports iso-8859-9 character set(Turkish) and have a nice stop words set. I hope it can help to Turkish developers who use lucene(i searched many hours for a turkish analyzer for lucene but couldnt find, so i coded and sending it here.)

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Updated: (LUCENE-559) Turkish Analyzer for Lucene

Posted by "Emre Bayram (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/LUCENE-559?page=all ]

Emre Bayram updated LUCENE-559:
-------------------------------

    Attachment: IndexFiles.java

> Turkish Analyzer for Lucene
> ---------------------------
>
>          Key: LUCENE-559
>          URL: http://issues.apache.org/jira/browse/LUCENE-559
>      Project: Lucene - Java
>         Type: Improvement

>   Components: Analysis
>     Reporter: Emre Bayram
>  Attachments: IndexFiles.java, SearchFiles.java, TurkishAnalyzer.java, TurkishAnalyzer.java, TurkishStemFilter.java, TurkishStemFilter.java, TurkishStemmer.java, TurkishStemmer.java
>
> I have developed an Analyzer for Turkish, thanks to German Language Analyzer and Brazillian Language Analyzers.
> This Turkish Analyzer supports iso-8859-9 character set(Turkish) and have a nice stop words set. I hope it can help to Turkish developers who use lucene(i searched many hours for a turkish analyzer for lucene but couldnt find, so i coded and sending it here.)

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Updated: (LUCENE-559) Turkish Analyzer for Lucene

Posted by "Emre Bayram (JIRA)" <ji...@apache.org>.
     [ http://issues.apache.org/jira/browse/LUCENE-559?page=all ]

Emre Bayram updated LUCENE-559:
-------------------------------

    Attachment: SearchFiles.java

> Turkish Analyzer for Lucene
> ---------------------------
>
>          Key: LUCENE-559
>          URL: http://issues.apache.org/jira/browse/LUCENE-559
>      Project: Lucene - Java
>         Type: Improvement

>   Components: Analysis
>     Reporter: Emre Bayram
>  Attachments: IndexFiles.java, SearchFiles.java, TurkishAnalyzer.java, TurkishAnalyzer.java, TurkishStemFilter.java, TurkishStemFilter.java, TurkishStemmer.java, TurkishStemmer.java
>
> I have developed an Analyzer for Turkish, thanks to German Language Analyzer and Brazillian Language Analyzers.
> This Turkish Analyzer supports iso-8859-9 character set(Turkish) and have a nice stop words set. I hope it can help to Turkish developers who use lucene(i searched many hours for a turkish analyzer for lucene but couldnt find, so i coded and sending it here.)

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


[jira] Commented: (LUCENE-559) Turkish Analyzer for Lucene

Posted by "Daniel Naber (JIRA)" <ji...@apache.org>.
    [ http://issues.apache.org/jira/browse/LUCENE-559?page=comments#action_12416646 ] 

Daniel Naber commented on LUCENE-559:
-------------------------------------

By testcase I meant classes that are JUnit tests (i.e. "... extends TestCase"), as you can see in the examples in the src/test directory. Could you provide those?


> Turkish Analyzer for Lucene
> ---------------------------
>
>          Key: LUCENE-559
>          URL: http://issues.apache.org/jira/browse/LUCENE-559
>      Project: Lucene - Java
>         Type: Improvement

>   Components: Analysis
>     Reporter: Emre Bayram
>  Attachments: IndexFiles.java, SearchFiles.java, TurkishAnalyzer.java, TurkishAnalyzer.java, TurkishStemFilter.java, TurkishStemFilter.java, TurkishStemmer.java, TurkishStemmer.java
>
> I have developed an Analyzer for Turkish, thanks to German Language Analyzer and Brazillian Language Analyzers.
> This Turkish Analyzer supports iso-8859-9 character set(Turkish) and have a nice stop words set. I hope it can help to Turkish developers who use lucene(i searched many hours for a turkish analyzer for lucene but couldnt find, so i coded and sending it here.)

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org