You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Nutan Shinde <nu...@gmail.com> on 2013/09/04 13:55:40 UTC
Re: Indexing pdf files - question.
My solrconfig.xml is:
<requestHandler name="/update/extract"
class="solr.extraction.ExtractingRequestHandler" >
<lst name="defaults">
<str name="fmap.content">desc</str> <!-to map this field of my table which
is defined as shown below in schem.xml-->
<str name="lowernames">true</str>
<str name="uprefix">attr_</str>
<str name="captureAttr">true</str>
</lst>
</requestHandler>
<lib dir="../../extract" regex=".*\.jar" />
Schema.xml:
<fields>
<field name="doc_id" type="integer" indexed="true" stored="true"
multiValued="false"/>
<field name="name" type="text" indexed="true" stored="true"
multiValued="false"/>
<field name="path" type="text" indexed="true" stored="true"
multiValued="false"/>
<field name="desc" type="text_split" indexed="true" stored="true"
multiValued="false"/>
</fields>
<types>
<fieldType name="string" class="solr.StrField" />
<fieldType name="integer" class="solr.IntField" />
<fieldType name="text" class="solr.TextField" />
<fieldType name="text" class="solr.TextField" />
</types>
<dynamicField name="*_i" type="integer" indexed="true" stored="true"/>
<uniqueKey>doc_id</uniqueKey>
I have created extract directory and copied all required .jar and solr-cell
jar files into this extract directory and given its path in lib tag in
solrconfig.xml
When I try out this:
curl
"http://localhost:8080/solr/update/extract?literal.doc_id=1&commit=true"
-F myfile=@solr-word.pdf <ma...@solr-word.pdf> in Windows 7.
I get /solr/update/extract is not available and sometimes I get access
denied error.
I tried resolving through net,but in vain.as all the solutions are related
to linux os,im working on Windows.
Please help me and provide solutions related o Windows os.
I referred Apache_solr_4_Cookbook.
Thanks a lot.
Re: Indexing pdf files - question.
Posted by Nutan Shinde <nu...@gmail.com>.
Error got resolved,solution was <dynamic field /> must be within <fields>
tag.
On Sun, Sep 8, 2013 at 3:31 AM, Furkan KAMACI <fu...@gmail.com>wrote:
> Could you show us logs you get when you start your web container?
>
>
> 2013/9/4 Nutan Shinde <nu...@gmail.com>
>
> > My solrconfig.xml is:
> >
> >
> >
> > <requestHandler name="/update/extract"
> > class="solr.extraction.ExtractingRequestHandler" >
> >
> > <lst name="defaults">
> >
> > <str name="fmap.content">desc</str> <!-to map this field of my table
> > which
> > is defined as shown below in schem.xml-->
> >
> > <str name="lowernames">true</str>
> >
> > <str name="uprefix">attr_</str>
> >
> > <str name="captureAttr">true</str>
> >
> > </lst>
> >
> > </requestHandler>
> >
> > <lib dir="../../extract" regex=".*\.jar" />
> >
> >
> >
> > Schema.xml:
> >
> > <fields>
> >
> > <field name="doc_id" type="integer" indexed="true" stored="true"
> > multiValued="false"/>
> >
> > <field name="name" type="text" indexed="true" stored="true"
> > multiValued="false"/>
> >
> > <field name="path" type="text" indexed="true" stored="true"
> > multiValued="false"/>
> >
> > <field name="desc" type="text_split" indexed="true" stored="true"
> > multiValued="false"/>
> >
> > </fields>
> >
> > <types>
> >
> > <fieldType name="string" class="solr.StrField" />
> >
> > <fieldType name="integer" class="solr.IntField" />
> >
> > <fieldType name="text" class="solr.TextField" />
> >
> > <fieldType name="text" class="solr.TextField" />
> >
> > </types>
> >
> > <dynamicField name="*_i" type="integer" indexed="true" stored="true"/>
> >
> > <uniqueKey>doc_id</uniqueKey>
> >
> >
> >
> > I have created extract directory and copied all required .jar and
> solr-cell
> > jar files into this extract directory and given its path in lib tag in
> > solrconfig.xml
> >
> >
> >
> > When I try out this:
> >
> >
> >
> > curl
> > "http://localhost:8080/solr/update/extract?literal.doc_id=1&commit=true"
> >
> > -F myfile=@solr-word.pdf <ma...@solr-word.pdf> in Windows 7.
> >
> >
> >
> > I get /solr/update/extract is not available and sometimes I get access
> > denied error.
> >
> > I tried resolving through net,but in vain.as all the solutions are
> related
> > to linux os,im working on Windows.
> >
> > Please help me and provide solutions related o Windows os.
> >
> > I referred Apache_solr_4_Cookbook.
> >
> > Thanks a lot.
> >
> >
>
Re: Indexing pdf files - question.
Posted by Furkan KAMACI <fu...@gmail.com>.
Could you show us logs you get when you start your web container?
2013/9/4 Nutan Shinde <nu...@gmail.com>
> My solrconfig.xml is:
>
>
>
> <requestHandler name="/update/extract"
> class="solr.extraction.ExtractingRequestHandler" >
>
> <lst name="defaults">
>
> <str name="fmap.content">desc</str> <!-to map this field of my table
> which
> is defined as shown below in schem.xml-->
>
> <str name="lowernames">true</str>
>
> <str name="uprefix">attr_</str>
>
> <str name="captureAttr">true</str>
>
> </lst>
>
> </requestHandler>
>
> <lib dir="../../extract" regex=".*\.jar" />
>
>
>
> Schema.xml:
>
> <fields>
>
> <field name="doc_id" type="integer" indexed="true" stored="true"
> multiValued="false"/>
>
> <field name="name" type="text" indexed="true" stored="true"
> multiValued="false"/>
>
> <field name="path" type="text" indexed="true" stored="true"
> multiValued="false"/>
>
> <field name="desc" type="text_split" indexed="true" stored="true"
> multiValued="false"/>
>
> </fields>
>
> <types>
>
> <fieldType name="string" class="solr.StrField" />
>
> <fieldType name="integer" class="solr.IntField" />
>
> <fieldType name="text" class="solr.TextField" />
>
> <fieldType name="text" class="solr.TextField" />
>
> </types>
>
> <dynamicField name="*_i" type="integer" indexed="true" stored="true"/>
>
> <uniqueKey>doc_id</uniqueKey>
>
>
>
> I have created extract directory and copied all required .jar and solr-cell
> jar files into this extract directory and given its path in lib tag in
> solrconfig.xml
>
>
>
> When I try out this:
>
>
>
> curl
> "http://localhost:8080/solr/update/extract?literal.doc_id=1&commit=true"
>
> -F myfile=@solr-word.pdf <ma...@solr-word.pdf> in Windows 7.
>
>
>
> I get /solr/update/extract is not available and sometimes I get access
> denied error.
>
> I tried resolving through net,but in vain.as all the solutions are related
> to linux os,im working on Windows.
>
> Please help me and provide solutions related o Windows os.
>
> I referred Apache_solr_4_Cookbook.
>
> Thanks a lot.
>
>