You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@httpd.apache.org by Aaron <mi...@microchp.org> on 2002/03/07 20:44:16 UTC
Re: Possible rehash of a question, but looking for a different
solution
Ok, I figured it out for anyone else who needs this.
LogFormat "deny from %a" spiders
SetEnvIfNoCase Request_URI bogus_hidden_dir1_here spiderlog
SetEnvIfNoCase Request_URI bogus_hidden_dir2_here spiderlog
CustomLog /your/document_root/.htaccess spiders env=spiderlog
Simple, crude, but works. Most spiders/mirroring proggies wont know ahead of
time what not to look for.
Aaron
On Thu, 07 Mar 2002 08:46:55 -0800
"Aaron" <mi...@microchp.org> wrote:
> I have seen many ways using cgi scripts and User-Agent variables to block
> spiders from an Apache web site.
>
> I am looking for something a little more complex that can be done in the
> httpd.conf file (so that it is not dependant on anything external) which
> will
> detect when a user has accessed a spam-bot/spider page, and will then set a
> variable which will follow them throught their sessions.
>
> I tried playing around with Request_URI but that is only 'per-request'.
>
> The reason that I do not rely upon User-Agent, is that most mirroring
> programs
> now allow people to simply mimic a normal web browser.
>
> Does anyone know of a way within 'Any' apache install, wether it be on
> Linux,
> BSD, Win2k, AIX or whatever, that I can trap users which hit a page that
> would
> not otherwise be accessed by normal browsing?
>
> I apogize if this has already been beaten to death, though I did not see
> anything quite to this detail in the archives.
>
> Thanks,
>
> Aaron
>
> ---------------------------------------------------------------------
> The official User-To-User support forum of the Apache HTTP Server Project.
> See <URL:http://httpd.apache.org/userslist.html> for more info.
> To unsubscribe, e-mail: users-unsubscribe@httpd.apache.org
> For additional commands, e-mail: users-help@httpd.apache.org
>
---------------------------------------------------------------------
The official User-To-User support forum of the Apache HTTP Server Project.
See <URL:http://httpd.apache.org/userslist.html> for more info.
To unsubscribe, e-mail: users-unsubscribe@httpd.apache.org
For additional commands, e-mail: users-help@httpd.apache.org