You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Koji Sekiguchi (JIRA)" <ji...@apache.org> on 2010/08/16 11:38:17 UTC

[jira] Created: (LUCENE-2603) FastVectorHighlighter: add a method to set an arbitrary char that is used when concatenating multiValued data

FastVectorHighlighter: add a method to set an arbitrary char that is used when concatenating multiValued data
-------------------------------------------------------------------------------------------------------------

                 Key: LUCENE-2603
                 URL: https://issues.apache.org/jira/browse/LUCENE-2603
             Project: Lucene - Java
          Issue Type: Improvement
          Components: contrib/highlighter
    Affects Versions: 3.0.2, 2.9.3
            Reporter: Koji Sekiguchi
            Priority: Trivial
             Fix For: 3.1, 4.0


If the following multiValued names are in authors field:

* Michael McCandless
* Erik Hatcher
* Otis Gospodnetić

Since FragmentsBuilder concatenates multiValued data with a space in BaseFragmentsBuilder.getFragmentSource():

{code}
while( buffer.length() < endOffset && index[0] < values.length ){
  if( index[0] > 0 && values[index[0]].isTokenized() && values[index[0]].stringValue().length() > 0 )
    buffer.append( ' ' );
  buffer.append( values[index[0]++].stringValue() );
}
{code}

an entire field snippet (using LUCENE-2464) will be "Michael McCandless Erik Hatcher Otis Gospodnetić". There is a requirement an arbitrary char (e.g. '/') can be set so that client can separate the snippet easily. i.e. "Michael McCandless/Erik Hatcher/Otis Gospodnetić"

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Resolved: (LUCENE-2603) FastVectorHighlighter: add a method to set an arbitrary char that is used when concatenating multiValued data

Posted by "Koji Sekiguchi (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-2603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Koji Sekiguchi resolved LUCENE-2603.
------------------------------------

      Assignee: Koji Sekiguchi
    Resolution: Fixed

trunk: Committed revision 986173.
branch_3x: Committed revision 986175.


> FastVectorHighlighter: add a method to set an arbitrary char that is used when concatenating multiValued data
> -------------------------------------------------------------------------------------------------------------
>
>                 Key: LUCENE-2603
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2603
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/highlighter
>    Affects Versions: 2.9.3, 3.0.2
>            Reporter: Koji Sekiguchi
>            Assignee: Koji Sekiguchi
>            Priority: Trivial
>             Fix For: 3.1, 4.0
>
>         Attachments: LUCENE-2603.patch, LUCENE-2603.patch
>
>
> If the following multiValued names are in authors field:
> * Michael McCandless
> * Erik Hatcher
> * Otis Gospodnetić
> Since FragmentsBuilder concatenates multiValued data with a space in BaseFragmentsBuilder.getFragmentSource():
> {code}
> while( buffer.length() < endOffset && index[0] < values.length ){
>   if( index[0] > 0 && values[index[0]].isTokenized() && values[index[0]].stringValue().length() > 0 )
>     buffer.append( ' ' );
>   buffer.append( values[index[0]++].stringValue() );
> }
> {code}
> an entire field snippet (using LUCENE-2464) will be "Michael McCandless Erik Hatcher Otis Gospodnetić". There is a requirement an arbitrary char (e.g. '/') can be set so that client can separate the snippet easily. i.e. "Michael McCandless/Erik Hatcher/Otis Gospodnetić"

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Updated: (LUCENE-2603) FastVectorHighlighter: add a method to set an arbitrary char that is used when concatenating multiValued data

Posted by "Koji Sekiguchi (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-2603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Koji Sekiguchi updated LUCENE-2603:
-----------------------------------

    Attachment: LUCENE-2603.patch

> FastVectorHighlighter: add a method to set an arbitrary char that is used when concatenating multiValued data
> -------------------------------------------------------------------------------------------------------------
>
>                 Key: LUCENE-2603
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2603
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/highlighter
>    Affects Versions: 2.9.3, 3.0.2
>            Reporter: Koji Sekiguchi
>            Priority: Trivial
>             Fix For: 3.1, 4.0
>
>         Attachments: LUCENE-2603.patch
>
>
> If the following multiValued names are in authors field:
> * Michael McCandless
> * Erik Hatcher
> * Otis Gospodnetić
> Since FragmentsBuilder concatenates multiValued data with a space in BaseFragmentsBuilder.getFragmentSource():
> {code}
> while( buffer.length() < endOffset && index[0] < values.length ){
>   if( index[0] > 0 && values[index[0]].isTokenized() && values[index[0]].stringValue().length() > 0 )
>     buffer.append( ' ' );
>   buffer.append( values[index[0]++].stringValue() );
> }
> {code}
> an entire field snippet (using LUCENE-2464) will be "Michael McCandless Erik Hatcher Otis Gospodnetić". There is a requirement an arbitrary char (e.g. '/') can be set so that client can separate the snippet easily. i.e. "Michael McCandless/Erik Hatcher/Otis Gospodnetić"

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org


[jira] Updated: (LUCENE-2603) FastVectorHighlighter: add a method to set an arbitrary char that is used when concatenating multiValued data

Posted by "Koji Sekiguchi (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/LUCENE-2603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Koji Sekiguchi updated LUCENE-2603:
-----------------------------------

    Attachment: LUCENE-2603.patch

Updated patch attached. I'll commit shortly.

> FastVectorHighlighter: add a method to set an arbitrary char that is used when concatenating multiValued data
> -------------------------------------------------------------------------------------------------------------
>
>                 Key: LUCENE-2603
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2603
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/highlighter
>    Affects Versions: 2.9.3, 3.0.2
>            Reporter: Koji Sekiguchi
>            Priority: Trivial
>             Fix For: 3.1, 4.0
>
>         Attachments: LUCENE-2603.patch, LUCENE-2603.patch
>
>
> If the following multiValued names are in authors field:
> * Michael McCandless
> * Erik Hatcher
> * Otis Gospodnetić
> Since FragmentsBuilder concatenates multiValued data with a space in BaseFragmentsBuilder.getFragmentSource():
> {code}
> while( buffer.length() < endOffset && index[0] < values.length ){
>   if( index[0] > 0 && values[index[0]].isTokenized() && values[index[0]].stringValue().length() > 0 )
>     buffer.append( ' ' );
>   buffer.append( values[index[0]++].stringValue() );
> }
> {code}
> an entire field snippet (using LUCENE-2464) will be "Michael McCandless Erik Hatcher Otis Gospodnetić". There is a requirement an arbitrary char (e.g. '/') can be set so that client can separate the snippet easily. i.e. "Michael McCandless/Erik Hatcher/Otis Gospodnetić"

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org