You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@httpd.apache.org by Aaron <mi...@microchp.org> on 2002/03/07 20:44:16 UTC

Re: Possible rehash of a question, but looking for a different solution

Ok, I figured it out for anyone else who needs this.

LogFormat "deny from %a" spiders

SetEnvIfNoCase Request_URI bogus_hidden_dir1_here spiderlog
SetEnvIfNoCase Request_URI bogus_hidden_dir2_here spiderlog

CustomLog /your/document_root/.htaccess spiders env=spiderlog

Simple, crude, but works.  Most spiders/mirroring proggies wont know ahead of
time what not to look for.

Aaron


On Thu, 07 Mar 2002 08:46:55 -0800
 "Aaron" <mi...@microchp.org> wrote:
> I have seen many ways using cgi scripts and User-Agent variables to block
> spiders from an Apache web site.
> 
> I am looking for something a little more complex that can be done in the
> httpd.conf file (so that it is not dependant on anything external) which
> will
> detect when a user has accessed a spam-bot/spider page, and will then set a
> variable which will follow them throught their sessions.
> 
> I tried playing around with Request_URI but that is only 'per-request'.
> 
> The reason that I do not rely upon User-Agent, is that most mirroring
> programs
> now allow people to simply mimic a normal web browser.
> 
> Does anyone know of a way within 'Any' apache install, wether it be on
> Linux,
> BSD, Win2k, AIX or whatever, that I can trap users which hit a page that
> would
> not otherwise be accessed by normal browsing?
> 
> I apogize if this has already been beaten to death, though I did not see
> anything quite to this detail in the archives.
> 
> Thanks,
> 
> Aaron
> 
> ---------------------------------------------------------------------
> The official User-To-User support forum of the Apache HTTP Server Project.
> See <URL:http://httpd.apache.org/userslist.html> for more info.
> To unsubscribe, e-mail: users-unsubscribe@httpd.apache.org
> For additional commands, e-mail: users-help@httpd.apache.org
> 


---------------------------------------------------------------------
The official User-To-User support forum of the Apache HTTP Server Project.
See <URL:http://httpd.apache.org/userslist.html> for more info.
To unsubscribe, e-mail: users-unsubscribe@httpd.apache.org
For additional commands, e-mail: users-help@httpd.apache.org