You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by HaYa aziz <ha...@hotmail.com> on 2012/03/13 12:13:53 UTC

split content field into two fields


Dear all ,,

we know that DOMContentUtils.java (in parse-html plug-in) extract the text from node and save it in sb (StringBuffer), then in will be saved in content field in solr.

Use Case: I want to split the content into 2 fields,,
content 1 : all the text
 content 2: special text

in order to add fields i solr I build a class "addfield.java"  that extend IndexingFilter:
      doc.add("content2","value");

and it works well, now how can I send the variable "content2" in DOMContentUtils.java to addfield.java ????

 		 	   		  

Re: split content field into two fields

Posted by Lewis John Mcgibbney <le...@gmail.com>.
I remember a while ago this came up.
Please check out the user archives as it will definitely be in there.

2012/3/13 HaYa aziz <ha...@hotmail.com>

>
>
> Dear all ,,
>
> we know that DOMContentUtils.java (in parse-html plug-in) extract the text
> from node and save it in sb (StringBuffer), then in will be saved in
> content field in solr.
>
> Use Case: I want to split the content into 2 fields,,
> content 1 : all the text
>  content 2: special text
>
> in order to add fields i solr I build a class "addfield.java"  that extend
> IndexingFilter:
>      doc.add("content2","value");
>
> and it works well, now how can I send the variable "content2" in
> DOMContentUtils.java to addfield.java ????
>
>




-- 
*Lewis*