You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@tomcat.apache.org by Brian Braun <br...@gmail.com> on 2011/05/20 17:03:18 UTC
Regular expression for the Crawler Session Manager Valve?
Hi Mark Thomas and everybody else,
I just discovered the new valve, the *Crawler Session Manager Valve*. It
deals with the search engine bots, making them use just one session among
their requests (one session for each bot). I see that it includes a
default regular expression for detecting the bots, which I guess is not
intended to detect every bot available.
Has anybody created a more complete regular expression for that? It would be
great if such a reg ex existed and was published for the whole world.
Brian
Re: Regular expression for the Crawler Session Manager Valve?
Posted by Christopher Schultz <ch...@christopherschultz.net>.
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Brian,
On 5/20/2011 11:03 AM, Brian Braun wrote:
> I just discovered the new valve, the *Crawler Session Manager Valve*. It
> deals with the search engine bots, making them use just one session among
> their requests (one session for each bot). I see that it includes a
> default regular expression for detecting the bots, which I guess is not
> intended to detect every bot available.
Doesn't look like it.
> Has anybody created a more complete regular expression for that? It would be
> great if such a reg ex existed and was published for the whole world.
You could write your own... starting with the information available here:
http://www.robotstxt.org/db.html
- -chris
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
iEYEARECAAYFAk3WidsACgkQ9CaO5/Lv0PDIWgCfTt099z9nWvnaABWLOJFH0Emh
tu0An257Alw/6R/lUGcvtRKumrPfy1EA
=xBnD
-----END PGP SIGNATURE-----
---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@tomcat.apache.org
For additional commands, e-mail: users-help@tomcat.apache.org