You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by hadi <md...@gmail.com> on 2011/09/09 23:34:36 UTC

index video and image format with nutch 1.3?

when i want to index video file with nutch 1.3 i get the following error : 

*Error parsing: file:///D:/film.avi: failed(2,0): Can't retrieve Tika parser
for
   mime-type video/x-msvideo*
(also it is the same error for images file)

and in hadoop log the detail error is:

*parse.ParserFactory - ParserFactory:Plugin:
org.apache.nutch.parse.feed.FeedParser mapped to contentType video/x-msvideo
via parse-plugins.xml, but its plugin.xml file does not claim to support
contentType: video/x-msvideo*

i metioned that i add the following config in parse-plugins.xml:

*<mimeType name="video/x-msvideo">
        <plugin id="parse-tika" />
        <plugin id="feed" />
</mimeType>*

also add the folowing config in nutch-site.xml

*<property>
  <name>plugin.includes</name>
 
<value>nutch-extensionpoints|protocol-file|protocol-http|urlfilter-regex|parse-(html|tika|pdf|zip|avi)|index-(basic|anchor)|scoring-opic|urlnormalizer-(pass|regex|basic)</value>  
</property>*

but the it doesn't work and get the same tika error,please help me

--
View this message in context: http://lucene.472066.n3.nabble.com/index-video-and-image-format-with-nutch-1-3-tp3324172p3324172.html
Sent from the Apache Tika - Development mailing list archive at Nabble.com.

Re: index video and image format with nutch 1.3?

Posted by Julien Nioche <li...@gmail.com>.
This is not a Tika issue. Ask this on the Nutch user list instead.

On 9 September 2011 22:34, hadi <md...@gmail.com> wrote:

> when i want to index video file with nutch 1.3 i get the following error :
>
> *Error parsing: file:///D:/film.avi: failed(2,0): Can't retrieve Tika
> parser
> for
>   mime-type video/x-msvideo*
> (also it is the same error for images file)
>
> and in hadoop log the detail error is:
>
> *parse.ParserFactory - ParserFactory:Plugin:
> org.apache.nutch.parse.feed.FeedParser mapped to contentType
> video/x-msvideo
> via parse-plugins.xml, but its plugin.xml file does not claim to support
> contentType: video/x-msvideo*
>
> i metioned that i add the following config in parse-plugins.xml:
>
> *<mimeType name="video/x-msvideo">
>        <plugin id="parse-tika" />
>        <plugin id="feed" />
> </mimeType>*
>
> also add the folowing config in nutch-site.xml
>
> *<property>
>  <name>plugin.includes</name>
>
>
> <value>nutch-extensionpoints|protocol-file|protocol-http|urlfilter-regex|parse-(html|tika|pdf|zip|avi)|index-(basic|anchor)|scoring-opic|urlnormalizer-(pass|regex|basic)</value>
> </property>*
>
> but the it doesn't work and get the same tika error,please help me
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/index-video-and-image-format-with-nutch-1-3-tp3324172p3324172.html
> Sent from the Apache Tika - Development mailing list archive at Nabble.com.
>



-- 
*
*Open Source Solutions for Text Engineering

http://digitalpebble.blogspot.com/
http://www.digitalpebble.com

Re: index video and image format with nutch 1.3?

Posted by hadi <md...@gmail.com>.
but i think it is related to tila parser,because the error related to
it,however where is the nutch section in this site?

--
View this message in context: http://lucene.472066.n3.nabble.com/index-video-and-image-format-with-nutch-1-3-tp3324172p3325835.html
Sent from the Apache Tika - Development mailing list archive at Nabble.com.