You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@httpd.apache.org by Jonathan Hayward <ch...@gmail.com> on 2010/01/04 04:35:26 UTC

[users@httpd] Whole HTML pages in error log (1.3.x)

Our system has rapidly growing error logs filling up the disk. It appears
that much or all of the traffic to http://JonathansCorner.com/ is logged:

[Sun Jan 03 21:08:25 2010] [warn] Cannot get media type from 'text.html'
[Sun Jan 03 21:08:25 2010] [error] [client ##.##.##.##] <!doctype html
public "-//w3c//dtd html 4.0 transitional//en">
[Sun Jan 03 21:08:25 2010] [error] [client ##.##.##.##] <html>
[Sun Jan 03 21:08:25 2010] [error] [client ##.##.##.##] <head>
[Sun Jan 03 21:08:25 2010] [error] [client ##.##.##.##] <link rel="icon"
href="/favicon.ico" type="image/x-icon">
[Sun Jan 03 21:08:25 2010] [error] [client ##.##.##.##] --rest of webpage
snipped--

Why would Apache 1.3.x be pulling a media-type of text.html? The files that
would be pulled are straight, uninterpreted HTML pages--no PHP or
server-side parsing involved, and the string "text.html" does not appear in
httpd.conf.

What can or should I be doing so that the pages are served up text/html and
not logged to the error log in typical requests?

-- 
→ Jonathan Hayward, christos.jonathan.hayward@gmail.com
→ An Orthodox Christian author: theology, literature, et cetera.
→ My award-winning collection is available for free reading online:
☩ I invite you to visit my main site at http://JonathansCorner.com/

Re: [users@httpd] Whole HTML pages in error log (1.3.x)

Posted by Jonathan Hayward <ch...@gmail.com>.
On Mon, Jan 4, 2010 at 5:54 AM, Daniel Reinhardt
<cr...@cryptodan.net> wrote:
>
> From: Jonathan Hayward
> Sent: 04 January, 2010 3:35
> To: users@httpd.apache.org
> Subject: [users@httpd] Whole HTML pages in error log (1.3.x)
> Our system has rapidly growing error logs filling up the disk. It appears that much or all of the traffic to http://JonathansCorner.com/ is logged:
> [Sun Jan 03 21:08:25 2010] [warn] Cannot get media type from 'text.html'
> [Sun Jan 03 21:08:25 2010] [error] [client ##.##.##.##] <!doctype html public "-//w3c//dtd html 4.0 transitional//en">
> [Sun Jan 03 21:08:25 2010] [error] [client ##.##.##.##] <html>
> [Sun Jan 03 21:08:25 2010] [error] [client ##.##.##.##] <head>
> [Sun Jan 03 21:08:25 2010] [error] [client ##.##.##.##] <link rel="icon" href="/favicon.ico" type="image/x-icon">
> [Sun Jan 03 21:08:25 2010] [error] [client ##.##.##.##] --rest of webpage snipped--
> Why would Apache 1.3.x be pulling a media-type of text.html? The files that would be pulled are straight, uninterpreted HTML pages--no PHP or server-side parsing involved, and the string "text.html" does not appear in httpd.conf.
> What can or should I be doing so that the pages are served up text/html and not logged to the error log in typical requests?
> --
> → Jonathan Hayward, christos.jonathan.hayward@gmail.com
> → An Orthodox Christian author: theology, literature, et cetera.
> → My award-winning collection is available for free reading online:
> ☩ I invite you to visit my main site at http://JonathansCorner.com/
> Have a read here: http://www.ietf.org/rfc/rfc2854.txt

Thank you; I've gone through and corrected the bad DOCTYPE. I'm still
getting errors, though; I can't reproduce the errors by identifying
the page and loading it in Firefox, but I got:

[Sun Jan 24 19:23:12 2010] [error] [client ##.##.##.##] <!DOCTYPE html
PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
[Sun Jan 24 19:23:12 2010] [error] [client ##.##.##.##]
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
[Sun Jan 24 19:23:12 2010] [error] [client ##.##.##.##] <html
xmlns="http://www.w3.org/1999/xhtml" xml:lang="en">
[Sun Jan 24 19:23:12 2010] [error] [client ##.##.##.##] <head>
[Sun Jan 24 19:23:12 2010] [error] [client ##.##.##.##] <link
rel="icon" href="/favicon.ico" type="image/x-icon" />
[Sun Jan 24 19:23:12 2010] [error] [client ##.##.##.##] --rest of
webpage snipped--

And the webpage (I checked) validates as 1.0 Transitional.

Are there ways some user-agent could make a request that would get a
webpage to be logged? Content negotiation?

Thanks,
>
>


--
→ Jonathan Hayward, christos.jonathan.hayward@gmail.com
→ An Orthodox Christian author: theology, literature, et cetera.
→ My award-winning collection is available for free reading online:
☩ I invite you to visit my main site at http://JonathansCorner.com/

---------------------------------------------------------------------
The official User-To-User support forum of the Apache HTTP Server Project.
See <URL:http://httpd.apache.org/userslist.html> for more info.
To unsubscribe, e-mail: users-unsubscribe@httpd.apache.org
   "   from the digest: users-digest-unsubscribe@httpd.apache.org
For additional commands, e-mail: users-help@httpd.apache.org


Re: [users@httpd] Whole HTML pages in error log (1.3.x)

Posted by Daniel Reinhardt <cr...@cryptodan.net>.

From: Jonathan Hayward 
Sent: 04 January, 2010 3:35
To: users@httpd.apache.org 
Subject: [users@httpd] Whole HTML pages in error log (1.3.x)


Our system has rapidly growing error logs filling up the disk. It appears that much or all of the traffic to http://JonathansCorner.com/ is logged:


[Sun Jan 03 21:08:25 2010] [warn] Cannot get media type from 'text.html'
[Sun Jan 03 21:08:25 2010] [error] [client ##.##.##.##] <!doctype html public "-//w3c//dtd html 4.0 transitional//en">
[Sun Jan 03 21:08:25 2010] [error] [client ##.##.##.##] <html>
[Sun Jan 03 21:08:25 2010] [error] [client ##.##.##.##] <head>
[Sun Jan 03 21:08:25 2010] [error] [client ##.##.##.##] <link rel="icon" href="/favicon.ico" type="image/x-icon">
[Sun Jan 03 21:08:25 2010] [error] [client ##.##.##.##] --rest of webpage snipped--


Why would Apache 1.3.x be pulling a media-type of text.html? The files that would be pulled are straight, uninterpreted HTML pages--no PHP or server-side parsing involved, and the string "text.html" does not appear in httpd.conf.


What can or should I be doing so that the pages are served up text/html and not logged to the error log in typical requests?

-- 

→ Jonathan Hayward, christos.jonathan.hayward@gmail.com
→ An Orthodox Christian author: theology, literature, et cetera.
→ My award-winning collection is available for free reading online:
☩ I invite you to visit my main site at http://JonathansCorner.com/


Have a read here: http://www.ietf.org/rfc/rfc2854.txt