You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Zaheed Haque <za...@gmail.com> on 2005/09/21 09:57:26 UTC
Fwd: your hostname
Hi:
I follow the agent mailing list and the following e.mail might be of
interest. I think tutorial should fix the problem. A simple solution
could also be a "sample file" containing values that are specific to
user installation lifted up to nutch-site.xml (i.e crawl-filter,
MY.DOMAIN.COM example). Offcourse this should be done prior to running
crawler (i.e. read documentation) but the fact of the matter is after
installation you just want to run the crawler and get busy :-)
Cheers
Zaheed
nutch-default.xml
<property>
<name>http.agent.email</name>
<value>nutch-agent@lucene.apache.org</value>
<description>An email address to advertise in the HTTP 'From' request
header and User-Agent header.</description>
</property>
---------- Forwarded message ----------
From: Edgar Müller <ed...@strassenmalerei.com>
Date: Sep 21, 2005 4:49 AM
Subject: your hostname
To: nutch-agent@lucene.apache.org
hello,
could you please tell me your hostname?
someone is crawling my site with your name. it's host is
turingc.cs.washington.edu
is that your host?
thanks allot, edgar müller
_________________________________________________________
Haben Sie schon mal eine Seite über Strassenmalerei besucht?
www.strassenmalerei.com
Informationsseite über Strassenmalerei. Was ist wenn es regnet? Wie
lange braucht man für ein Strassenbild?
Stellen Sie Ihre Fragen im Forum. www.strassenmaler-info.de
Alle Bilder des traditionsreichen Festivals der Strassenmaler in
Geldern als Onlinegalerie. www.rettet-die-bilder.de
Projekt mit Alexander Wild. Das Webverzeichnis ist noch ganz neu und
wartet auf Einträge. www.onwork.de
Edgar Müller
Schulstrasse 9
56132 Becheln
fon: 0049-(0)-2603 931367
mobil: 0049-(0)-172 6982874