You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Nutan Shinde <nu...@gmail.com> on 2013/09/04 13:55:40 UTC

Re: Indexing pdf files - question.

My solrconfig.xml is:

 

<requestHandler name="/update/extract"
class="solr.extraction.ExtractingRequestHandler" >

<lst name="defaults">

<str name="fmap.content">desc</str>   <!-to map this field of my table which
is defined as shown below in schem.xml-->

<str name="lowernames">true</str>

<str name="uprefix">attr_</str>

<str name="captureAttr">true</str>

</lst>

</requestHandler>

<lib dir="../../extract" regex=".*\.jar" />

 

Schema.xml:

<fields> 

<field name="doc_id" type="integer" indexed="true" stored="true"
multiValued="false"/>  

<field name="name" type="text" indexed="true" stored="true"
multiValued="false"/>  

<field name="path" type="text" indexed="true" stored="true"
multiValued="false"/>

<field name="desc" type="text_split" indexed="true" stored="true"
multiValued="false"/>

</fields> 

<types>

<fieldType name="string" class="solr.StrField"  />

<fieldType name="integer" class="solr.IntField" />

<fieldType name="text" class="solr.TextField" />

<fieldType name="text" class="solr.TextField" />

</types>

<dynamicField name="*_i"  type="integer"  indexed="true"  stored="true"/>

<uniqueKey>doc_id</uniqueKey>

 

I have created extract directory and copied all required .jar and solr-cell
jar files into this extract directory and given its path in lib tag in
solrconfig.xml

 

When I try out this:

 

curl
"http://localhost:8080/solr/update/extract?literal.doc_id=1&commit=true"

-F myfile=@solr-word.pdf <ma...@solr-word.pdf>   in Windows 7.

 

I get /solr/update/extract is not available and sometimes I get access
denied error.

I tried resolving through net,but in vain.as all the solutions are related
to linux os,im working on Windows.

Please help me and provide solutions related o Windows os.

I referred Apache_solr_4_Cookbook.

Thanks a lot.


Re: Indexing pdf files - question.

Posted by Nutan Shinde <nu...@gmail.com>.
Error got resolved,solution was <dynamic field /> must be within <fields>
tag.


On Sun, Sep 8, 2013 at 3:31 AM, Furkan KAMACI <fu...@gmail.com>wrote:

> Could you show us logs you get when you start your web container?
>
>
> 2013/9/4 Nutan Shinde <nu...@gmail.com>
>
> > My solrconfig.xml is:
> >
> >
> >
> > <requestHandler name="/update/extract"
> > class="solr.extraction.ExtractingRequestHandler" >
> >
> > <lst name="defaults">
> >
> > <str name="fmap.content">desc</str>   <!-to map this field of my table
> > which
> > is defined as shown below in schem.xml-->
> >
> > <str name="lowernames">true</str>
> >
> > <str name="uprefix">attr_</str>
> >
> > <str name="captureAttr">true</str>
> >
> > </lst>
> >
> > </requestHandler>
> >
> > <lib dir="../../extract" regex=".*\.jar" />
> >
> >
> >
> > Schema.xml:
> >
> > <fields>
> >
> > <field name="doc_id" type="integer" indexed="true" stored="true"
> > multiValued="false"/>
> >
> > <field name="name" type="text" indexed="true" stored="true"
> > multiValued="false"/>
> >
> > <field name="path" type="text" indexed="true" stored="true"
> > multiValued="false"/>
> >
> > <field name="desc" type="text_split" indexed="true" stored="true"
> > multiValued="false"/>
> >
> > </fields>
> >
> > <types>
> >
> > <fieldType name="string" class="solr.StrField"  />
> >
> > <fieldType name="integer" class="solr.IntField" />
> >
> > <fieldType name="text" class="solr.TextField" />
> >
> > <fieldType name="text" class="solr.TextField" />
> >
> > </types>
> >
> > <dynamicField name="*_i"  type="integer"  indexed="true"  stored="true"/>
> >
> > <uniqueKey>doc_id</uniqueKey>
> >
> >
> >
> > I have created extract directory and copied all required .jar and
> solr-cell
> > jar files into this extract directory and given its path in lib tag in
> > solrconfig.xml
> >
> >
> >
> > When I try out this:
> >
> >
> >
> > curl
> > "http://localhost:8080/solr/update/extract?literal.doc_id=1&commit=true"
> >
> > -F myfile=@solr-word.pdf <ma...@solr-word.pdf>   in Windows 7.
> >
> >
> >
> > I get /solr/update/extract is not available and sometimes I get access
> > denied error.
> >
> > I tried resolving through net,but in vain.as all the solutions are
> related
> > to linux os,im working on Windows.
> >
> > Please help me and provide solutions related o Windows os.
> >
> > I referred Apache_solr_4_Cookbook.
> >
> > Thanks a lot.
> >
> >
>

Re: Indexing pdf files - question.

Posted by Furkan KAMACI <fu...@gmail.com>.
Could you show us logs you get when you start your web container?


2013/9/4 Nutan Shinde <nu...@gmail.com>

> My solrconfig.xml is:
>
>
>
> <requestHandler name="/update/extract"
> class="solr.extraction.ExtractingRequestHandler" >
>
> <lst name="defaults">
>
> <str name="fmap.content">desc</str>   <!-to map this field of my table
> which
> is defined as shown below in schem.xml-->
>
> <str name="lowernames">true</str>
>
> <str name="uprefix">attr_</str>
>
> <str name="captureAttr">true</str>
>
> </lst>
>
> </requestHandler>
>
> <lib dir="../../extract" regex=".*\.jar" />
>
>
>
> Schema.xml:
>
> <fields>
>
> <field name="doc_id" type="integer" indexed="true" stored="true"
> multiValued="false"/>
>
> <field name="name" type="text" indexed="true" stored="true"
> multiValued="false"/>
>
> <field name="path" type="text" indexed="true" stored="true"
> multiValued="false"/>
>
> <field name="desc" type="text_split" indexed="true" stored="true"
> multiValued="false"/>
>
> </fields>
>
> <types>
>
> <fieldType name="string" class="solr.StrField"  />
>
> <fieldType name="integer" class="solr.IntField" />
>
> <fieldType name="text" class="solr.TextField" />
>
> <fieldType name="text" class="solr.TextField" />
>
> </types>
>
> <dynamicField name="*_i"  type="integer"  indexed="true"  stored="true"/>
>
> <uniqueKey>doc_id</uniqueKey>
>
>
>
> I have created extract directory and copied all required .jar and solr-cell
> jar files into this extract directory and given its path in lib tag in
> solrconfig.xml
>
>
>
> When I try out this:
>
>
>
> curl
> "http://localhost:8080/solr/update/extract?literal.doc_id=1&commit=true"
>
> -F myfile=@solr-word.pdf <ma...@solr-word.pdf>   in Windows 7.
>
>
>
> I get /solr/update/extract is not available and sometimes I get access
> denied error.
>
> I tried resolving through net,but in vain.as all the solutions are related
> to linux os,im working on Windows.
>
> Please help me and provide solutions related o Windows os.
>
> I referred Apache_solr_4_Cookbook.
>
> Thanks a lot.
>
>