You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Simon Rosenthal (JIRA)" <ji...@apache.org> on 2013/11/14 16:55:23 UTC

[jira] [Commented] (SOLR-4722) Highlighter which generates a list of query term position(s) for each item in a list of documents, or returns null if highlighting is disabled.

    [ https://issues.apache.org/jira/browse/SOLR-4722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13822537#comment-13822537 ] 

Simon Rosenthal commented on SOLR-4722:
---------------------------------------

Great patch !

I'd like to use the code as the basis for a component which will simply return term positions for each query term - no need for having highlighting enabled as a prerequisite, or to return term offsets - this is a text mining project where we'll be running queries in batch mode and storing this information externally. 

Can you think of any gotchas I might encounter ?

> Highlighter which generates a list of query term position(s) for each item in a list of documents, or returns null if highlighting is disabled.
> -----------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SOLR-4722
>                 URL: https://issues.apache.org/jira/browse/SOLR-4722
>             Project: Solr
>          Issue Type: New Feature
>          Components: highlighter
>    Affects Versions: 4.3, 5.0
>            Reporter: Tricia Jenkins
>            Priority: Minor
>         Attachments: SOLR-4722.patch, solr-positionshighlighter.jar
>
>
> As an alternative to returning snippets, this highlighter provides the (term) position for query matches.  One usecase for this is to reconcile the term position from the Solr index with 'word' coordinates provided by an OCR process.  In this way we are able to 'highlight' an image, like a page from a book or an article from a newspaper, in the locations that match the user's query.
> This is based on the FastVectorHighlighter and requires that termVectors, termOffsets and termPositions be stored.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org