You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@lucene.apache.org by "karin (JIRA)" <ji...@apache.org> on 2007/03/28 12:19:32 UTC

[jira] Created: (LUCENE-852) spellchecker: make hard-coded values configurable

spellchecker: make hard-coded values configurable
-------------------------------------------------

                 Key: LUCENE-852
                 URL: https://issues.apache.org/jira/browse/LUCENE-852
             Project: Lucene - Java
          Issue Type: Improvement
            Reporter: karin


the class org.apache.lucene.search.spell.SpellChecker uses the following hard-coded values in its method
indexDictionary:
        writer.setMergeFactor(300);
        writer.setMaxBufferedDocs(150);
this poses problems when the spellcheck index is created on systems with certain limits, i.e. in unix
environment where the ulimit setting are restricted for the user (http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428).

there are several ways to circumvent this:
1. add another indexDictionary method with additional parameters:
    public void indexDictionary (Dictionary dict, int mergeFactor, int maxBufferedDocs) throws IOException
    
2. add setter methods for mergeFactor and maxBufferedDocs 
    (see code in http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428 )

3. Make SpellChecker subclassing easier as suggested by Chris Hostetter 
   (see reply  http://www.gossamer-threads.com/lists/lucene/java-dev/47463#47463)

thanx,
karin


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org

[jira] Updated: (LUCENE-852) spellchecker: make hard-coded values configurable

Posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/LUCENE-852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Otis Gospodnetic updated LUCENE-852:
------------------------------------

    Attachment: LUCENE-852.patch

> spellchecker: make hard-coded values configurable
> -------------------------------------------------
>
>                 Key: LUCENE-852
>                 URL: https://issues.apache.org/jira/browse/LUCENE-852
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/*
>            Reporter: karin
>            Assignee: Otis Gospodnetic
>            Priority: Minor
>         Attachments: LUCENE-852.patch, LUCENE-852.patch
>
>
> the class org.apache.lucene.search.spell.SpellChecker uses the following hard-coded values in its method
> indexDictionary:
>         writer.setMergeFactor(300);
>         writer.setMaxBufferedDocs(150);
> this poses problems when the spellcheck index is created on systems with certain limits, i.e. in unix
> environments where the ulimit settings are restricted for the user (http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428).
> there are several ways to circumvent this:
> 1. add another indexDictionary method with additional parameters:
>     public void indexDictionary (Dictionary dict, int mergeFactor, int maxBufferedDocs) throws IOException
>     
> 2. add setter methods for mergeFactor and maxBufferedDocs 
>     (see code in http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428 )
> 3. Make SpellChecker subclassing easier as suggested by Chris Hostetter 
>    (see reply  http://www.gossamer-threads.com/lists/lucene/java-dev/47463#47463)
> thanx,
> karin

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org

[jira] Resolved: (LUCENE-852) spellchecker: make hard-coded values configurable

Posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/LUCENE-852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Otis Gospodnetic resolved LUCENE-852.
-------------------------------------

       Resolution: Fixed
    Lucene Fields: [New, Patch Available]  (was: [Patch Available, New])

Thanks for the patch, Otis.

Committed revision 659021.


> spellchecker: make hard-coded values configurable
> -------------------------------------------------
>
>                 Key: LUCENE-852
>                 URL: https://issues.apache.org/jira/browse/LUCENE-852
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/*
>            Reporter: karin
>            Assignee: Otis Gospodnetic
>            Priority: Minor
>         Attachments: LUCENE-852.patch, LUCENE-852.patch
>
>
> the class org.apache.lucene.search.spell.SpellChecker uses the following hard-coded values in its method
> indexDictionary:
>         writer.setMergeFactor(300);
>         writer.setMaxBufferedDocs(150);
> this poses problems when the spellcheck index is created on systems with certain limits, i.e. in unix
> environments where the ulimit settings are restricted for the user (http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428).
> there are several ways to circumvent this:
> 1. add another indexDictionary method with additional parameters:
>     public void indexDictionary (Dictionary dict, int mergeFactor, int maxBufferedDocs) throws IOException
>     
> 2. add setter methods for mergeFactor and maxBufferedDocs 
>     (see code in http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428 )
> 3. Make SpellChecker subclassing easier as suggested by Chris Hostetter 
>    (see reply  http://www.gossamer-threads.com/lists/lucene/java-dev/47463#47463)
> thanx,
> karin

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org

[jira] Updated: (LUCENE-852) spellchecker: make hard-coded values configurable

Posted by "karin (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/LUCENE-852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

karin updated LUCENE-852:
-------------------------

      Description: 
the class org.apache.lucene.search.spell.SpellChecker uses the following hard-coded values in its method
indexDictionary:
        writer.setMergeFactor(300);
        writer.setMaxBufferedDocs(150);
this poses problems when the spellcheck index is created on systems with certain limits, i.e. in unix
environments where the ulimit settings are restricted for the user (http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428).

there are several ways to circumvent this:
1. add another indexDictionary method with additional parameters:
    public void indexDictionary (Dictionary dict, int mergeFactor, int maxBufferedDocs) throws IOException
    
2. add setter methods for mergeFactor and maxBufferedDocs 
    (see code in http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428 )

3. Make SpellChecker subclassing easier as suggested by Chris Hostetter 
   (see reply  http://www.gossamer-threads.com/lists/lucene/java-dev/47463#47463)

thanx,
karin


  was:
the class org.apache.lucene.search.spell.SpellChecker uses the following hard-coded values in its method
indexDictionary:
        writer.setMergeFactor(300);
        writer.setMaxBufferedDocs(150);
this poses problems when the spellcheck index is created on systems with certain limits, i.e. in unix
environment where the ulimit setting are restricted for the user (http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428).

there are several ways to circumvent this:
1. add another indexDictionary method with additional parameters:
    public void indexDictionary (Dictionary dict, int mergeFactor, int maxBufferedDocs) throws IOException
    
2. add setter methods for mergeFactor and maxBufferedDocs 
    (see code in http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428 )

3. Make SpellChecker subclassing easier as suggested by Chris Hostetter 
   (see reply  http://www.gossamer-threads.com/lists/lucene/java-dev/47463#47463)

thanx,
karin


    Lucene Fields: [New, Patch Available]  (was: [Patch Available, New])

> spellchecker: make hard-coded values configurable
> -------------------------------------------------
>
>                 Key: LUCENE-852
>                 URL: https://issues.apache.org/jira/browse/LUCENE-852
>             Project: Lucene - Java
>          Issue Type: Improvement
>            Reporter: karin
>
> the class org.apache.lucene.search.spell.SpellChecker uses the following hard-coded values in its method
> indexDictionary:
>         writer.setMergeFactor(300);
>         writer.setMaxBufferedDocs(150);
> this poses problems when the spellcheck index is created on systems with certain limits, i.e. in unix
> environments where the ulimit settings are restricted for the user (http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428).
> there are several ways to circumvent this:
> 1. add another indexDictionary method with additional parameters:
>     public void indexDictionary (Dictionary dict, int mergeFactor, int maxBufferedDocs) throws IOException
>     
> 2. add setter methods for mergeFactor and maxBufferedDocs 
>     (see code in http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428 )
> 3. Make SpellChecker subclassing easier as suggested by Chris Hostetter 
>    (see reply  http://www.gossamer-threads.com/lists/lucene/java-dev/47463#47463)
> thanx,
> karin

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org

[jira] Updated: (LUCENE-852) spellchecker: make hard-coded values configurable

Posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/LUCENE-852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Otis Gospodnetic updated LUCENE-852:
------------------------------------

    Attachment: LUCENE-852.patch

Here is one way of specifying IndexWriter parameters.
I'll commit next week.


> spellchecker: make hard-coded values configurable
> -------------------------------------------------
>
>                 Key: LUCENE-852
>                 URL: https://issues.apache.org/jira/browse/LUCENE-852
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/*
>            Reporter: karin
>            Assignee: Otis Gospodnetic
>            Priority: Minor
>         Attachments: LUCENE-852.patch
>
>
> the class org.apache.lucene.search.spell.SpellChecker uses the following hard-coded values in its method
> indexDictionary:
>         writer.setMergeFactor(300);
>         writer.setMaxBufferedDocs(150);
> this poses problems when the spellcheck index is created on systems with certain limits, i.e. in unix
> environments where the ulimit settings are restricted for the user (http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428).
> there are several ways to circumvent this:
> 1. add another indexDictionary method with additional parameters:
>     public void indexDictionary (Dictionary dict, int mergeFactor, int maxBufferedDocs) throws IOException
>     
> 2. add setter methods for mergeFactor and maxBufferedDocs 
>     (see code in http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428 )
> 3. Make SpellChecker subclassing easier as suggested by Chris Hostetter 
>    (see reply  http://www.gossamer-threads.com/lists/lucene/java-dev/47463#47463)
> thanx,
> karin

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org

[jira] Updated: (LUCENE-852) spellchecker: make hard-coded values configurable

Posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/LUCENE-852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Otis Gospodnetic updated LUCENE-852:
------------------------------------

      Component/s: contrib/*
         Priority: Minor  (was: Major)
    Lucene Fields: [New, Patch Available]  (was: [Patch Available, New])
         Assignee: Otis Gospodnetic

> spellchecker: make hard-coded values configurable
> -------------------------------------------------
>
>                 Key: LUCENE-852
>                 URL: https://issues.apache.org/jira/browse/LUCENE-852
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/*
>            Reporter: karin
>            Assignee: Otis Gospodnetic
>            Priority: Minor
>
> the class org.apache.lucene.search.spell.SpellChecker uses the following hard-coded values in its method
> indexDictionary:
>         writer.setMergeFactor(300);
>         writer.setMaxBufferedDocs(150);
> this poses problems when the spellcheck index is created on systems with certain limits, i.e. in unix
> environments where the ulimit settings are restricted for the user (http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428).
> there are several ways to circumvent this:
> 1. add another indexDictionary method with additional parameters:
>     public void indexDictionary (Dictionary dict, int mergeFactor, int maxBufferedDocs) throws IOException
>     
> 2. add setter methods for mergeFactor and maxBufferedDocs 
>     (see code in http://www.gossamer-threads.com/lists/lucene/java-dev/47428#47428 )
> 3. Make SpellChecker subclassing easier as suggested by Chris Hostetter 
>    (see reply  http://www.gossamer-threads.com/lists/lucene/java-dev/47463#47463)
> thanx,
> karin

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org