You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Erick Erickson (Created) (JIRA)" <ji...@apache.org> on 2012/02/06 14:29:59 UTC

[jira] [Created] (SOLR-3102) Document WordDelimiterFilterFactory "types" parameter.

Document WordDelimiterFilterFactory "types" parameter.
------------------------------------------------------

                 Key: SOLR-3102
                 URL: https://issues.apache.org/jira/browse/SOLR-3102
             Project: Solr
          Issue Type: Improvement
            Reporter: Erick Erickson
            Assignee: Erick Erickson
            Priority: Trivial


SOLR-2059 added the ability to customize the mapping of specific characters to types (e.g. # could considered an ALPHA character if desired). But there's no documentation showing that this is an option. The Javadoc for the factory and the Wiki should have this added.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (SOLR-3102) Document WordDelimiterFilterFactory "types" parameter.

Posted by "Robert Muir (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-3102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13201298#comment-13201298 ] 

Robert Muir commented on SOLR-3102:
-----------------------------------

Look again, its always been there:
{noformat}
In some cases you might want to adjust how WordDelimiterFilter splits on a per-character basis. To do this, you can supply a configuration file with the "types" attribute that specifies custom character categories. An example file is in subversion *here*. This is especially useful to add "hashtag or currency" searches. 
{noformat}

                
> Document WordDelimiterFilterFactory "types" parameter.
> ------------------------------------------------------
>
>                 Key: SOLR-3102
>                 URL: https://issues.apache.org/jira/browse/SOLR-3102
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Erick Erickson
>            Assignee: Erick Erickson
>            Priority: Trivial
>              Labels: Javadocs
>             Fix For: 3.6, 4.0
>
>         Attachments: SOLR-3102.patch
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> SOLR-2059 added the ability to customize the mapping of specific characters to types (e.g. # could considered an ALPHA character if desired). But there's no documentation showing that this is an option. The Javadoc for the factory and the Wiki should have this added.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Updated] (SOLR-3102) Document WordDelimiterFilterFactory "types" parameter.

Posted by "Erick Erickson (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-3102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Erick Erickson updated SOLR-3102:
---------------------------------

    Attachment: SOLR-3102.patch

Trivial patch updating javadocs to include "types" parameter
                
> Document WordDelimiterFilterFactory "types" parameter.
> ------------------------------------------------------
>
>                 Key: SOLR-3102
>                 URL: https://issues.apache.org/jira/browse/SOLR-3102
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Erick Erickson
>            Assignee: Erick Erickson
>            Priority: Trivial
>              Labels: Javadocs
>         Attachments: SOLR-3102.patch
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> SOLR-2059 added the ability to customize the mapping of specific characters to types (e.g. # could considered an ALPHA character if desired). But there's no documentation showing that this is an option. The Javadoc for the factory and the Wiki should have this added.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (SOLR-3102) Document WordDelimiterFilterFactory "types" parameter.

Posted by "Erick Erickson (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-3102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13201291#comment-13201291 ] 

Erick Erickson commented on SOLR-3102:
--------------------------------------

Not on this page: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters, until now.
                
> Document WordDelimiterFilterFactory "types" parameter.
> ------------------------------------------------------
>
>                 Key: SOLR-3102
>                 URL: https://issues.apache.org/jira/browse/SOLR-3102
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Erick Erickson
>            Assignee: Erick Erickson
>            Priority: Trivial
>              Labels: Javadocs
>             Fix For: 3.6, 4.0
>
>         Attachments: SOLR-3102.patch
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> SOLR-2059 added the ability to customize the mapping of specific characters to types (e.g. # could considered an ALPHA character if desired). But there's no documentation showing that this is an option. The Javadoc for the factory and the Wiki should have this added.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (SOLR-3102) Document WordDelimiterFilterFactory "types" parameter.

Posted by "Robert Muir (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-3102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13201277#comment-13201277 ] 

Robert Muir commented on SOLR-3102:
-----------------------------------

This is documented on the wiki, along with an example file (also happens to be the test): http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactory

Is there a better place it should be?
                
> Document WordDelimiterFilterFactory "types" parameter.
> ------------------------------------------------------
>
>                 Key: SOLR-3102
>                 URL: https://issues.apache.org/jira/browse/SOLR-3102
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Erick Erickson
>            Assignee: Erick Erickson
>            Priority: Trivial
>              Labels: Javadocs
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> SOLR-2059 added the ability to customize the mapping of specific characters to types (e.g. # could considered an ALPHA character if desired). But there's no documentation showing that this is an option. The Javadoc for the factory and the Wiki should have this added.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (SOLR-3102) Document WordDelimiterFilterFactory "types" parameter.

Posted by "Erick Erickson (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-3102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13201311#comment-13201311 ] 

Erick Erickson commented on SOLR-3102:
--------------------------------------

You mean I actually have to *read the entire thing?* Why! The noive a dat guy....

But I like the addition I put in, after all I wrote it <G>....

                
> Document WordDelimiterFilterFactory "types" parameter.
> ------------------------------------------------------
>
>                 Key: SOLR-3102
>                 URL: https://issues.apache.org/jira/browse/SOLR-3102
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Erick Erickson
>            Assignee: Erick Erickson
>            Priority: Trivial
>              Labels: Javadocs
>             Fix For: 3.6, 4.0
>
>         Attachments: SOLR-3102.patch
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> SOLR-2059 added the ability to customize the mapping of specific characters to types (e.g. # could considered an ALPHA character if desired). But there's no documentation showing that this is an option. The Javadoc for the factory and the Wiki should have this added.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Resolved] (SOLR-3102) Document WordDelimiterFilterFactory "types" parameter.

Posted by "Erick Erickson (Resolved) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/SOLR-3102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Erick Erickson resolved SOLR-3102.
----------------------------------

       Resolution: Fixed
    Fix Version/s: 4.0
                   3.6

Committed trivial javadoc change.
3.6 - r: 1241009
4.0 - r: 1241010
                
> Document WordDelimiterFilterFactory "types" parameter.
> ------------------------------------------------------
>
>                 Key: SOLR-3102
>                 URL: https://issues.apache.org/jira/browse/SOLR-3102
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Erick Erickson
>            Assignee: Erick Erickson
>            Priority: Trivial
>              Labels: Javadocs
>             Fix For: 3.6, 4.0
>
>         Attachments: SOLR-3102.patch
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> SOLR-2059 added the ability to customize the mapping of specific characters to types (e.g. # could considered an ALPHA character if desired). But there's no documentation showing that this is an option. The Javadoc for the factory and the Wiki should have this added.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] [Commented] (SOLR-3102) Document WordDelimiterFilterFactory "types" parameter.

Posted by "Robert Muir (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/SOLR-3102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13201313#comment-13201313 ] 

Robert Muir commented on SOLR-3102:
-----------------------------------

I agree, your improvements are great :)

I was just defending myself!
                
> Document WordDelimiterFilterFactory "types" parameter.
> ------------------------------------------------------
>
>                 Key: SOLR-3102
>                 URL: https://issues.apache.org/jira/browse/SOLR-3102
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Erick Erickson
>            Assignee: Erick Erickson
>            Priority: Trivial
>              Labels: Javadocs
>             Fix For: 3.6, 4.0
>
>         Attachments: SOLR-3102.patch
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> SOLR-2059 added the ability to customize the mapping of specific characters to types (e.g. # could considered an ALPHA character if desired). But there's no documentation showing that this is an option. The Javadoc for the factory and the Wiki should have this added.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org