You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Cam Bazz <ca...@gmail.com> on 2011/07/17 21:36:33 UTC

custom encoding - or encoding detection does not work

Hello,

How is encoding detection done? At what stage? Fetching? or Parsing?

Because some pages that I am working with are full of errors, such as
declaring the encoding meta, wrong, or double, etc.

Could it be possible to tell nutch use this encoding for this site?

Best Regards,
C.B.