You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Alp Timurhan Çevik <at...@turkguven.com> on 2015/08/10 02:09:01 UTC

tika problem

While trying to use 2.4.x for Tika 1.8 (to use tesseract for ocr,
actually), tika could not parse application/pdf files. The mapping is
correct, in the plugin-xml, * are routed to tika, and the log states that
tika cannot handle application/pdf

any ideas ?

Cheers,
Alp