You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "P B (JIRA)" <ji...@apache.org> on 2010/07/05 09:36:49 UTC

[jira] Created: (SOLR-1984) add HyphenationCompoundWordTokenFilterFactory class

add HyphenationCompoundWordTokenFilterFactory class
---------------------------------------------------

                 Key: SOLR-1984
                 URL: https://issues.apache.org/jira/browse/SOLR-1984
             Project: Solr
          Issue Type: New Feature
            Reporter: P B
            Priority: Critical
         Attachments: HyphenationCompoundWordTokenFilterFactory.java

Please can you include my contribution into Solr night builds.

I can not compile on Linux server, I have tested only on Windows. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Assigned: (SOLR-1984) add HyphenationCompoundWordTokenFilterFactory class

Posted by "Robert Muir (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Muir reassigned SOLR-1984:
---------------------------------

    Assignee: Robert Muir

> add HyphenationCompoundWordTokenFilterFactory class
> ---------------------------------------------------
>
>                 Key: SOLR-1984
>                 URL: https://issues.apache.org/jira/browse/SOLR-1984
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis
>            Reporter: P B
>            Assignee: Robert Muir
>            Priority: Minor
>             Fix For: 3.1, 4.0
>
>         Attachments: HyphenationCompoundWordTokenFilterFactory.java
>
>
> Please can you include my contribution into Solr night builds.
> I can not compile on Linux server, I have tested only on Windows. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Updated: (SOLR-1984) add HyphenationCompoundWordTokenFilterFactory class

Posted by "Robert Muir (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Muir updated SOLR-1984:
------------------------------

    Attachment: SOLR-1984.patch

Thank you very much for contributing this, its true there is no factory for this feature.

I updated your code with a few tweaks:
* allow null dictionary. This allows the use of just the hyphenation grammar (LUCENE-1287)
* allow encoding to be specified (but default to UTF-8). Some of the grammar distributions from offo dont use UTF-8 encoding.
* set onlyLongestMatch default to 'false'. this is just to be consistent with the TokenFilter itself, which defaults to false.
* added the Apache-licensed danish grammar to test-files, along with a small dictionary and some test cases.

if no one objects, i'll commit in a bit.


> add HyphenationCompoundWordTokenFilterFactory class
> ---------------------------------------------------
>
>                 Key: SOLR-1984
>                 URL: https://issues.apache.org/jira/browse/SOLR-1984
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis
>            Reporter: P B
>            Assignee: Robert Muir
>            Priority: Minor
>             Fix For: 3.1, 4.0
>
>         Attachments: HyphenationCompoundWordTokenFilterFactory.java, SOLR-1984.patch
>
>
> Please can you include my contribution into Solr night builds.
> I can not compile on Linux server, I have tested only on Windows. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Updated: (SOLR-1984) add HyphenationCompoundWordTokenFilterFactory class

Posted by "P B (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

P B updated SOLR-1984:
----------------------

    Attachment: HyphenationCompoundWordTokenFilterFactory.java

source code

> add HyphenationCompoundWordTokenFilterFactory class
> ---------------------------------------------------
>
>                 Key: SOLR-1984
>                 URL: https://issues.apache.org/jira/browse/SOLR-1984
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: P B
>            Priority: Critical
>         Attachments: HyphenationCompoundWordTokenFilterFactory.java
>
>
> Please can you include my contribution into Solr night builds.
> I can not compile on Linux server, I have tested only on Windows. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Updated: (SOLR-1984) add HyphenationCompoundWordTokenFilterFactory class

Posted by "Uwe Schindler (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Uwe Schindler updated SOLR-1984:
--------------------------------

    Fix Version/s: 3.1
                   4.0
         Priority: Minor  (was: Critical)
      Component/s: Schema and Analysis

> add HyphenationCompoundWordTokenFilterFactory class
> ---------------------------------------------------
>
>                 Key: SOLR-1984
>                 URL: https://issues.apache.org/jira/browse/SOLR-1984
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis
>            Reporter: P B
>            Priority: Minor
>             Fix For: 3.1, 4.0
>
>         Attachments: HyphenationCompoundWordTokenFilterFactory.java
>
>
> Please can you include my contribution into Solr night builds.
> I can not compile on Linux server, I have tested only on Windows. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Resolved: (SOLR-1984) add HyphenationCompoundWordTokenFilterFactory class

Posted by "Robert Muir (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-1984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Muir resolved SOLR-1984.
-------------------------------

    Resolution: Fixed

Committed revision 962555, 962559 (3x)

> add HyphenationCompoundWordTokenFilterFactory class
> ---------------------------------------------------
>
>                 Key: SOLR-1984
>                 URL: https://issues.apache.org/jira/browse/SOLR-1984
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis
>            Reporter: P B
>            Assignee: Robert Muir
>            Priority: Minor
>             Fix For: 3.1, 4.0
>
>         Attachments: HyphenationCompoundWordTokenFilterFactory.java, SOLR-1984.patch
>
>
> Please can you include my contribution into Solr night builds.
> I can not compile on Linux server, I have tested only on Windows. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org