You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by al...@aim.com on 2007/12/21 20:54:51 UTC

pdf parsing

I parsed a few sites with pdf files. Then added one more site to urls file. Now, nutch does not parse pdf's at all.

Any ideas what is wrong.

Thanks.
Alex.

________________________________________________________________________
More new features than ever.  Check out the new AIM(R) Mail ! - http://webmail.aim.com

Fwd: pdf parsing

Posted by al...@aim.com.
Hello,


When I try to parse audio files, nutch gives error "No textual content available".

What might be wrong?




Thanks.
Alex.

________________________________________________________________________
More new features than ever.  Check out the new AIM(R) Mail ! - 
http://webmail.aim.com



 


________________________________________________________________________
More new features than ever.  Check out the new AIM(R) Mail ! - http://webmail.aim.com