You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@jspwiki.apache.org by Robert FORBES <rf...@highlinecorp.com> on 2009/12/05 01:35:08 UTC

Bocking Google Bot access

I don't want Google to index my pages, but I cannot find any means to block it.  As a last ditch effort, I have put the JSPWiki installation behind a firewall, but this is defeating the purpose for which it is intended.

I am using Tomcat right now as the app server, and I have a robots.txt file in the WebApps folder, the folder below that (where Wiki.jsp is), the WEB_INF folder, and anywhere else I can think of.  The robots file looks like this:

User-agent: *
Disallow: /

But it is not working.  Am I missing something obvious ? (likely)

Robert


Re: Bocking Google Bot access

Posted by Harry Metske <ha...@gmail.com>.
did you place robots.txt in the root of the web site hierarchy ?
Meaning the url for the robots.txt must be something like
http://mywonderfulsite.somewhere.org/robots.txt, and not ttp://
mywonderfulsite.somewhere.org/JSPWiki/robots.txt
Did you look at you access.log to see if robots.txt is asked for ?

/Harry

2009/12/5 Robert FORBES <rf...@highlinecorp.com>

> I don't want Google to index my pages, but I cannot find any means to block
> it.  As a last ditch effort, I have put the JSPWiki installation behind a
> firewall, but this is defeating the purpose for which it is intended.
>
> I am using Tomcat right now as the app server, and I have a robots.txt file
> in the WebApps folder, the folder below that (where Wiki.jsp is), the
> WEB_INF folder, and anywhere else I can think of.  The robots file looks
> like this:
>
> User-agent: *
> Disallow: /
>
> But it is not working.  Am I missing something obvious ? (likely)
>
> Robert
>
>