You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Erlend Garåsen <e....@usit.uio.no> on 2011/01/27 11:06:29 UTC

Tika config in ExtractingRequestHandler

The wiki page for the ExtractingRequestHandler says that I can add the 
following configuration:
<str name="tika.config">/my/path/to/tika.config</str>

I have tried to google for an example of such a Tika config file, but 
haven't found anything.

Erlend

-- 
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050

Re: Tika config in ExtractingRequestHandler

Posted by Lance Norskog <go...@gmail.com>.
The tika.config file is obsolete. I don't know what replaces it.

On 1/27/11, Erlend Garåsen <e....@usit.uio.no> wrote:
>
> If this configuration file is the same as the tika-mimetypes.xml file
> inside Nutch' conf file, I have an example.
>
> I was trying to implement language detection for Solr and thought I had
> to invoke some Tika functionality by this configuration file in order to
> do so, but found out that I could rewrite some of the
> ExtractingRequestHandler classes instead.
>
> Erlend
>
> On 27.01.11 16.12, Adam Estrada wrote:
>> I believe that as along as Tika is included in a folder that is
>> referenced by solrconfig.xml you should be good. Solr will
>> automatically throw mime types to Tika for parsing. Can anyone else
>> add to this?
>>
>> Thanks,
>> Adam
>>
>> On Thu, Jan 27, 2011 at 5:06 AM, Erlend Garåsen<e....@usit.uio.no>
>> wrote:
>>>
>>> The wiki page for the ExtractingRequestHandler says that I can add the
>>> following configuration:
>>> <str name="tika.config">/my/path/to/tika.config</str>
>>>
>>> I have tried to google for an example of such a Tika config file, but
>>> haven't found anything.
>>>
>>> Erlend
>>>
>>> --
>>> Erlend Garåsen
>>> Center for Information Technology Services
>>> University of Oslo
>>> P.O. Box 1086 Blindern, N-0317 OSLO, Norway
>>> Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP:
>>> 31050
>>>
>
>
> --
> Erlend Garåsen
> Center for Information Technology Services
> University of Oslo
> P.O. Box 1086 Blindern, N-0317 OSLO, Norway
> Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050
>


-- 
Lance Norskog
goksron@gmail.com

Re: Tika config in ExtractingRequestHandler

Posted by Erlend Garåsen <e....@usit.uio.no>.
If this configuration file is the same as the tika-mimetypes.xml file 
inside Nutch' conf file, I have an example.

I was trying to implement language detection for Solr and thought I had 
to invoke some Tika functionality by this configuration file in order to 
do so, but found out that I could rewrite some of the 
ExtractingRequestHandler classes instead.

Erlend

On 27.01.11 16.12, Adam Estrada wrote:
> I believe that as along as Tika is included in a folder that is
> referenced by solrconfig.xml you should be good. Solr will
> automatically throw mime types to Tika for parsing. Can anyone else
> add to this?
>
> Thanks,
> Adam
>
> On Thu, Jan 27, 2011 at 5:06 AM, Erlend Garåsen<e....@usit.uio.no>  wrote:
>>
>> The wiki page for the ExtractingRequestHandler says that I can add the
>> following configuration:
>> <str name="tika.config">/my/path/to/tika.config</str>
>>
>> I have tried to google for an example of such a Tika config file, but
>> haven't found anything.
>>
>> Erlend
>>
>> --
>> Erlend Garåsen
>> Center for Information Technology Services
>> University of Oslo
>> P.O. Box 1086 Blindern, N-0317 OSLO, Norway
>> Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050
>>


-- 
Erlend Garåsen
Center for Information Technology Services
University of Oslo
P.O. Box 1086 Blindern, N-0317 OSLO, Norway
Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050

Re: Tika config in ExtractingRequestHandler

Posted by Adam Estrada <es...@gmail.com>.
I believe that as along as Tika is included in a folder that is
referenced by solrconfig.xml you should be good. Solr will
automatically throw mime types to Tika for parsing. Can anyone else
add to this?

Thanks,
Adam

On Thu, Jan 27, 2011 at 5:06 AM, Erlend Garåsen <e....@usit.uio.no> wrote:
>
> The wiki page for the ExtractingRequestHandler says that I can add the
> following configuration:
> <str name="tika.config">/my/path/to/tika.config</str>
>
> I have tried to google for an example of such a Tika config file, but
> haven't found anything.
>
> Erlend
>
> --
> Erlend Garåsen
> Center for Information Technology Services
> University of Oslo
> P.O. Box 1086 Blindern, N-0317 OSLO, Norway
> Ph: (+47) 22840193, Fax: (+47) 22852970, Mobile: (+47) 91380968, VIP: 31050
>