You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/10/19 00:27:28 UTC

[jira] Updated: (SOLR-1536) Support for TokenFilters that may modify input documents

     [ https://issues.apache.org/jira/browse/SOLR-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andrzej Bialecki  updated SOLR-1536:
------------------------------------

    Attachment: altering.patch

Patch updated to trunk.

> Support for TokenFilters that may modify input documents
> --------------------------------------------------------
>
>                 Key: SOLR-1536
>                 URL: https://issues.apache.org/jira/browse/SOLR-1536
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis
>    Affects Versions: 1.5
>            Reporter: Andrzej Bialecki 
>         Attachments: altering.patch, altering.patch, altering.patch
>
>
> In some scenarios it's useful to be able to create or modify fields in the input document based on analysis of other fields of this document. This need arises e.g. when indexing multilingual documents, or when doing NLP processing such as NER. However, currently this is not possible to do.
> This issue provides an implementation of this functionality that consists of the following parts:
> * DocumentAlteringFilterFactory - abstract superclass that indicates that TokenFilter-s created from this factory may modify fields in a SolrInputDocument.
> * TypeAsFieldFilterFactory - example implementation that illustrates this concept, with a JUnit test.
> * DocumentBuilder modifications to support this functionality.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org