You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Naveen Gupta <nk...@gmail.com> on 2011/06/02 13:08:36 UTC

tika and solr 3,1 integration error

Hi

I am trying to integrate solr 3.1 and tika (which comes default with the
version)

and using curl command trying to index few of the documents, i am getting
this error. the error is attr_meta field is unknown. i checked the
solrconfig, it looks perfect to me.

can you please tell me what i am missing.

I copied all the jars from contrib/extraction/lib to solr/lib folder that is
there in same place where conf is there ....


I am using the same request handler which is coming with default

<requestHandler name="/update/extract"
                  startup="lazy"
                  class="solr.extraction.
>
> ExtractingRequestHandler" >
>     <lst name="defaults">
>       <!-- All the main content goes into "text"... if you need to return
>            the extracted text or do highlighting, use a stored field. -->
>       <str name="fmap.content">text</str>
>       <str name="lowernames">true</str>
>       <str name="uprefix">ignored_</str>
>
>       <!-- capture link hrefs but ignore div attributes -->
>       <str name="captureAttr">true</str>
>       <str name="fmap.a">links</str>
>       <str name="fmap.div">ignored_</str>
>     </lst>
>   </requestHandler>
>
>
>
>
>
> * curl "
> http://dev.grexit.com:8080/solr1/update/extract?literal.id=who.pdf&uprefix=attr_&attr_&fmap.content=attr_content&commit=true"
> -F "myfile=@/root/apache-solr-3.1.0/docs/who.pdf"*
>
>
> <html><head><title>Apache Tomcat/6.0.18 - Error report</title><style><!--H1
> {font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;font-size:22px;}
> H2
> {font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;font-size:16px;}
> H3
> {font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;font-size:14px;}
> BODY
> {font-family:Tahoma,Arial,sans-serif;color:black;background-color:white;} B
> {font-family:Tahoma,Arial,sans-serif;color:white;background-color:#525D76;}
> P
> {font-family:Tahoma,Arial,sans-serif;background:white;color:black;font-size:12px;}A
> {color : black;}A.name {color : black;}HR {color : #525D76;}--></style>
> </head><body><h1>HTTP Status 400 - ERROR:unknown field 'attr_meta'</h1><HR
> size="1" noshade="noshade"><p><b>type</b> Status report</p><p><b>message</b>
> <u>ERROR:unknown field 'attr_meta'</u></p><p><b>description</b> <u>The
> request sent by the client was syntactically incorrect (ERROR:unknown field
> 'attr_meta').</u></p><HR size="1" noshade="noshade"><h3>Apache
> Tomcat/6.0.18</h3></body></html>root@weforpeople:/usr/share/solr1/lib#
>
>
> Please note
>
> i integrated apacha tika 0.9 with apache-solr-1.4 locally on windows
> machine and using solr cell
>
> calling the program works fine without any changes in configuration.
>
> Thanks
> Naveen
>

>