You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Jair Piedrahita Vargas <JA...@bancolombia.com.co> on 2009/09/05 00:03:51 UTC

Authentication

I'm trying to crawl in a intranet, but the crawler can not to access some pages, I think because authentication problems.
What configuration option have I to change to get a better result?.

Thanks in advance

Saludos,

Jair Piedrahíta Vargas

________________________________
El contenido de este mensaje puede ser información privilegiada y confidencial. Si usted no es el destinatario real del mismo, por favor informe de ello a quien lo envía y destrúyalo en forma inmediata. Está prohibida su retención, grabación, utilización o divulgación con cualquier propósito. Este mensaje ha sido verificado con software antivirus; en consecuencia, el remitente de éste no se hace responsable por la presencia en él o en sus anexos de algún virus que pueda generar daños en los equipos o programas del destinatario.
******************************************************************************************************
This communication (including all attachments) may contain information that is private, confidential and privileged. If you have received this communication in error; please notify the sender immediately, delete this communication from all data storage devices and destroy all hard copies. Any use, dissemination, distribution, copying or disclosure of this message and any attachments, in whole or in part, by anyone other than the intended recipient(s) is strictly prohibited. This message has been checked with an antivirus software; accordingly, the sender is not liable for the presence of any virus in attachments that causes or may cause damage to the recipient's equipment or software.

Re: Authentication

Posted by "David M. Cole" <dm...@colegroup.com>.
At 5:03 PM -0500 9/4/09, Jair Piedrahita Vargas wrote:
>What configuration option have I to change to get a better result?.

Have you visited

http://wiki.apache.org/nutch/HttpAuthenticationSchemes

?

\dmc

-- 
*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+
    David M. Cole                                            dmc@colegroup.com
    Editor & Publisher, NewsInc. <http://newsinc.net>        V: (650) 557-2993
    Consultant: The Cole Group <http://colegroup.com/>       F: (650) 475-8479
*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+*+