You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by HaYa aziz <ha...@hotmail.com> on 2012/03/13 12:13:53 UTC
split content field into two fields
Dear all ,,
we know that DOMContentUtils.java (in parse-html plug-in) extract the text from node and save it in sb (StringBuffer), then in will be saved in content field in solr.
Use Case: I want to split the content into 2 fields,,
content 1 : all the text
content 2: special text
in order to add fields i solr I build a class "addfield.java" that extend IndexingFilter:
doc.add("content2","value");
and it works well, now how can I send the variable "content2" in DOMContentUtils.java to addfield.java ????
Re: split content field into two fields
Posted by Lewis John Mcgibbney <le...@gmail.com>.
I remember a while ago this came up.
Please check out the user archives as it will definitely be in there.
2012/3/13 HaYa aziz <ha...@hotmail.com>
>
>
> Dear all ,,
>
> we know that DOMContentUtils.java (in parse-html plug-in) extract the text
> from node and save it in sb (StringBuffer), then in will be saved in
> content field in solr.
>
> Use Case: I want to split the content into 2 fields,,
> content 1 : all the text
> content 2: special text
>
> in order to add fields i solr I build a class "addfield.java" that extend
> IndexingFilter:
> doc.add("content2","value");
>
> and it works well, now how can I send the variable "content2" in
> DOMContentUtils.java to addfield.java ????
>
>
--
*Lewis*