You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Richard Braman <rb...@bramantax.com> on 2006/03/26 06:53:48 UTC

Increase in java.io.EOFException in HttpResponse.readLine(HttpResponse.java:423)

I have noticed an increase in these types of exceptions, something I
didn't see so much in Nutch .7

I am using trunk.

060325 234735 parsing file:/T:/nutch-trunk/conf/hadoop-default.xml
060325 234735 parsing file:/T:/nutch-trunk/conf/nutch-default.xml
060325 234735 parsing file:/T:/nutch-trunk/conf/nutch-site.xml
060325 234735 parsing file:/T:/nutch-trunk/conf/hadoop-site.xml
060325 234735 Fetcher: starting
060325 234735 Fetcher: segment: taxcrawl\segments\20060325234543
060325 234735 parsing file:/T:/nutch-trunk/conf/hadoop-default.xml
060325 234735 parsing file:/T:/nutch-trunk/conf/nutch-default.xml
060325 234735 parsing file:/T:/nutch-trunk/conf/mapred-default.xml
060325 234735 parsing file:/T:/nutch-trunk/conf/nutch-site.xml
060325 234735 parsing file:/T:/nutch-trunk/conf/hadoop-site.xml
060325 234739 parsing file:/T:/nutch-trunk/conf/hadoop-default.xml
060325 234739 parsing file:/T:/nutch-trunk/conf/nutch-default.xml
060325 234739 parsing file:/T:/nutch-trunk/conf/mapred-default.xml
060325 234739 parsing file:/T:/nutch-trunk/conf/mapred-default.xml
060325 234739 parsing file:/T:/nutch-trunk/conf/nutch-site.xml
060325 234739 parsing file:/T:/nutch-trunk/conf/hadoop-site.xml
060325 234739 Running job: job_fgk859
060325 234739 parsing file:/T:/nutch-trunk/conf/hadoop-default.xml
060325 234739 parsing file:/T:/nutch-trunk/conf/mapred-default.xml
060325 234739 parsing
\nutch\filesystem\mapreduce\local\job_fgk859.xml\localRunner
060325 234739 parsing file:/T:/nutch-trunk/conf/hadoop-site.xml
060325 234739 parsing file:/T:/nutch-trunk/conf/hadoop-default.xml
060325 234739 parsing file:/T:/nutch-trunk/conf/mapred-default.xml
060325 234739 parsing
\nutch\filesystem\mapreduce\local\job_fgk859.xml\localRunner
060325 234739 parsing file:/T:/nutch-trunk/conf/mapred-default.xml
060325 234739 parsing file:/T:/nutch-trunk/conf/hadoop-site.xml
060325 234740 Fetcher: threads: 10
060325 234740 Plugins: looking in: T:\nutch-trunk\build\plugins
060325 234740  map 0%  reduce 0%
060325 234740 Plugin Auto-activation mode: [true]
060325 234740 Registered Plugins:
060325 234740 	CyberNeko HTML Parser (lib-nekohtml)
060325 234740 	Site Query Filter (query-site)
060325 234740 	Html Parse Plug-in (parse-html)
060325 234740 	Regex URL Filter Framework (lib-regex-filter)
060325 234740 	Pdf Parse Plug-in (parse-pdf)
060325 234740 	Basic Indexing Filter (index-basic)
060325 234740 	Text Parse Plug-in (parse-text)
060325 234740 	JavaScript Parser (parse-js)
060325 234740 	Regex URL Filter (urlfilter-regex)
060325 234740 	Basic Query Filter (query-basic)
060325 234740 	HTTP Framework (lib-http)
060325 234740 	URL Query Filter (query-url)
060325 234740 	Http Protocol Plug-in (protocol-http)
060325 234740 	the nutch core extension points (nutch-extensionpoints)
060325 234740 Registered Extension-Points:
060325 234740 	Nutch Protocol (org.apache.nutch.protocol.Protocol)
060325 234740 	Nutch URL Filter (org.apache.nutch.net.URLFilter)
060325 234740 	HTML Parse Filter
(org.apache.nutch.parse.HtmlParseFilter)
060325 234740 	Nutch Online Search Results Clustering Plugin
(org.apache.nutch.clustering.OnlineClusterer)
060325 234740 	Nutch Indexing Filter
(org.apache.nutch.indexer.IndexingFilter)
060325 234740 	Nutch Content Parser (org.apache.nutch.parse.Parser)
060325 234740 	Ontology Model Loader
(org.apache.nutch.ontology.Ontology)
060325 234740 	Nutch Analysis (org.apache.nutch.analysis.NutchAnalyzer)
060325 234740 	Nutch Query Filter
(org.apache.nutch.searcher.QueryFilter)
060325 234740 found resource crawl-urlfilter.txt at
file:/T:/nutch-trunk/conf/crawl-urlfilter.txt
060325 234740 found resource parse-plugins.xml at
file:/T:/nutch-trunk/conf/parse-plugins.xml
060325 234740
taxcrawl\segments\20060325234543\crawl_generate\part-00000:0+2568622
060325 234740 fetching http://www.sctax.org/NR/System/WBC
060325 234740 fetching http://www.tax.state.ak.us/
060325 234740 fetching http://www.sctax.org/
060325 234740 fetching http://tax.utah.gov/forms
060325 234740 fetching http://www.myflorida.com/dor/leaving/text/css
060325 234740 fetching http://www.revenue.alabama.gov/eft/eftfaq.html
060325 234740 fetching http://taxes.state.mn.us/gclogon
060325 234740 fetching
http://tax.ohio.gov/dynamicforms/dynamicredirect.asp
060325 234740 fetching
http://www.tennessee.gov/revenue/taxassetauctions/skipnav
060325 234740 fetching http://revenue.ky.gov/calendar.htm
060325 234740 http.proxy.host = null
060325 234740 http.proxy.port = 8080
060325 234740 http.timeout = 10000
060325 234740 http.content.limit = -1
060325 234740 http.agent = NutchCVS/0.8-dev (Nutch;
http://lucene.apache.org/nutch/bot.html; nutch-agent@lucene.apache.org)
060325 234740 fetcher.server.delay = 5000
060325 234740 http.max.delays = 1000
java.io.EOFException
	at
org.apache.nutch.protocol.http.HttpResponse.readLine(HttpResponse.java:4
23)
	at
org.apache.nutch.protocol.http.HttpResponse.parseStatusLine(HttpResponse
.java:315)
	at
org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:143
)
	at org.apache.nutch.protocol.http.Http.getResponse(Http.java:61)
	at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.j
ava:176)
	at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:129)
060325 234741 fetch of http://www.myflorida.com/dor/leaving/text/css
failed with: java.io.EOFException
060325 234741 fetching
http://www.myflorida.com/dor/forms/download/</form>\n
060325 234741 fetching
http://www.michigan.gov/treasury/0,1607,7-121-1751_2194---,00.html
060325 234741 fetching
http://www.mstc.state.ms.us/taxareas/individ/efiling/nactpinfo/recspec05
.pdf
060325 234741 fetching http://www.tax.virginia.gov/Ind/refund.cfm
060325 234741 0 pages, 1 errors, 0.0 pages/s, 0 kb/s, 
060325 234741 fetching http://www.sctax.org/Search+DOR/
060325 234742 fetching
http://www.michigan.gov/treasury/0,1607,7-121-3245---F,00.html
060325 234743 0 pages, 1 errors, 0.0 pages/s, 0 kb/s, 
060325 234743 Using URL normalizer:
org.apache.nutch.net.BasicUrlNormalizer
060325 234743 Using URL normalizer:
org.apache.nutch.net.BasicUrlNormalizer
060325 234743 ParserFactory:Plugin:
org.apache.nutch.parse.text.TextParser mapped to contentType
application/pdf via parse-plugins.xml, but its plugin.xml file does not
claim to support contentType: application/pdf
060325 234743 Using Signature impl: org.apache.nutch.crawl.MD5Signature
060325 234743 fetch of
http://www.mstc.state.ms.us/taxareas/individ/efiling/nactpinfo/recspec05
.pdf failed with: java.lang.NoClassDefFoundError:
org/apache/log4j/Layout
060325 234743 fetching
http://www.myflorida.com/dor/eservices/apps/enroll/text/css
060325 234743 fetching
http://www.mstc.state.ms.us/taxareas/misc/gasrule.htm
060325 234743 fetching
http://www.myflorida.com/dor/childsupport/text/css
060325 234743 fetching http://www.dor.mo.gov/tax/taxcredit
060325 234745 3 pages, 2 errors, 0.6 pages/s, 107 kb/s, 
java.io.EOFException
	at
org.apache.nutch.protocol.http.HttpResponse.readLine(HttpResponse.java:4
23)
	at
org.apache.nutch.protocol.http.HttpResponse.parseStatusLine(HttpResponse
.java:315)
	at
org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:143
)
	at org.apache.nutch.protocol.http.Http.getResponse(Http.java:61)
	at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.j
ava:176)
	at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:129)
060325 234746 fetch of
http://www.myflorida.com/dor/forms/download/</form>\n failed with:
java.io.EOFException
060325 234746 3 pages, 2 errors, 0.5 pages/s, 90 kb/s, 
060325 234746 fetching
http://www.michigan.gov/treasury/0,1607,7-121-1751_31038---,00.html
060325 234746 fetching
http://www.state.nj.us/treasury/revenue/interncrd.htm
060325 234747 fetching
http://www.myflorida.com/dor/forms/download/</div>\n
060325 234748 4 pages, 3 errors, 0.5 pages/s, 70 kb/s, 
060325 234748 fetching
http://tax.ohio.gov/RoboHTML/Frequently%20Asked%20Questions/!SSL!/Blue_V
evet/FAQ.htm
060325 234748 fetching
http://ftb.ca.gov/individuals/Bills/voluntary/rtcCodes.html
060325 234748 fetching
http://www.policylibrary.tax.virginia.gov/OTP/Policy.nsf/Welcome?OpenPag
e
060325 234749 7 pages, 3 errors, 0.8 pages/s, 90 kb/s, 
060325 234749 fetching http://www.sctax.org/shell/resources/eemail.pdf
060325 234749 fetching http://www.dor.state.wi.us/forms/misc/a-222.pdf
060325 234750 fetch of http://www.dor.state.wi.us/forms/misc/a-222.pdf
failed with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234750 fetching
http://www.myflorida.com/dor/eservices/faqs/payment.html
060325 234750 8 pages, 4 errors, 0.8 pages/s, 87 kb/s, 
060325 234750 fetching
http://search.myflorida.com/dor/query.html?col=&ht=0&qp=&qs=&qc=&pw=100%
25&la=en&charset=iso-8859-1&si=0&fs=&qt=&ex=&rq=0&oq=&qm=1&ql=a&st=1&nh=
10&lk=1&rf=0
060325 234751 fetching
http://www.myflorida.com/dor/eservices/faqs/software.html
060325 234751 9 pages, 4 errors, 0.8 pages/s, 114 kb/s, 
060325 234751 fetching
http://www.ustaxcourt.gov/UstcInOp/asp/HistoricOptions.asp
060325 234752 fetching
http://search.myflorida.com/dor/query.html?col=&ht=0&qp=&qs=&qc=&pw=100%
25&la=en&charset=iso-8859-1&si=0&fs=&qt=&ex=&rq=0&oq=&qm=0&ql=&st=1&nh=1
0&lk=1&rf=0
060325 234753 11 pages, 4 errors, 0.8 pages/s, 112 kb/s, 
060325 234753 fetching http://tax.utah.gov/forms/index.html
060325 234754 12 pages, 4 errors, 0.9 pages/s, 119 kb/s, 
060325 234754 fetching
http://www.revenue.state.co.us/TPS_Dir/wrap.asp?incl=taxprofessionals
060325 234756 13 pages, 4 errors, 0.9 pages/s, 135 kb/s, 
060325 234756 fetching
http://www.revenue.state.co.us/racing_dir/wrap.asp
060325 234756 fetching
http://www.mstc.state.ms.us/edi/ediwithholding.htm
060325 234756 fetching
http://www.revenue.state.pa.us/revenue/cwp/view.asp?A=180&Q=245626
060325 234757 16 pages, 4 errors, 0.9 pages/s, 145 kb/s, 
060325 234757 fetching
http://www.mstc.state.ms.us/novellsearch/01-00UsingWebSearchform.htm
java.io.EOFException
	at
org.apache.nutch.protocol.http.HttpResponse.readLine(HttpResponse.java:4
23)
	at
org.apache.nutch.protocol.http.HttpResponse.parseStatusLine(HttpResponse
.java:315)
	at
org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:143
)
	at org.apache.nutch.protocol.http.Http.getResponse(Http.java:61)
	at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.j
ava:176)
	at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:129)
060325 234757 fetch of
http://www.myflorida.com/dor/childsupport/text/css failed with:
java.io.EOFException
060325 234757 fetching
http://www.rev.state.la.us/sections/aboutus/telephone.asp
060325 234757 fetching
http://www.revenue.alabama.gov/salestax/index.html
060325 234757 fetching
http://cfo.dc.gov/otr/cwp/view,A,1329,Q,634392.asp
060325 234758 18 pages, 5 errors, 1.0 pages/s, 145 kb/s, 
060325 234758 fetching
http://www.mstc.state.ms.us/taxareas/individ/efiling/nactpinfo/testpack0
5.pdf
060325 234759 fetching
http://www.michigan.gov/treasury/0,1607,7-121-1751_2197---,00.html
060325 234800 20 pages, 5 errors, 1.0 pages/s, 147 kb/s, 
060325 234800 fetching
http://www.revenue.state.co.us/liquor_dir/?sitesearch=www.revenue.state.
co.us&domains=www.revenue.state.co.us&q=
060325 234800 fetch of
http://www.revenue.state.co.us/liquor_dir/?sitesearch=www.revenue.state.
co.us&domains=www.revenue.state.co.us&q= failed with: Http code=403,
url=http://www.revenue.state.co.us/liquor_dir/?sitesearch=www.revenue.st
ate.co.us&domains=www.revenue.state.co.us&q=
060325 234800 fetching
http://www.revenue.state.pa.us/revenue/cwp/browse.asp?A=13&Q=34632
060325 234801 21 pages, 6 errors, 1.0 pages/s, 153 kb/s, 
060325 234801 fetching
http://www.michigan.gov/treasury/0,1607,7-121-1751_8013---,00.html
060325 234802 fetching http://www.mstc.state.ms.us/bankruptcy/Transcript
Instructions.pdf
060325 234802 22 pages, 6 errors, 1.0 pages/s, 148 kb/s, 
060325 234802 fetching
http://www.taxes.state.mn.us/taxes/use_of_information/use_of_information
.shtml
060325 234803 fetching
http://www.revenue.state.co.us/TPS_Dir/wrap.asp?incl=severanceforms
060325 234804 25 pages, 6 errors, 1.0 pages/s, 152 kb/s, 
060325 234804 fetch of http://www.sctax.org/shell/resources/eemail.pdf
failed with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234804 fetching
http://www.dor.state.nc.us/practitioner/corporate/bulletins/2005-2006/II
I-C.pdf
060325 234805 fetch of
http://www.dor.state.nc.us/practitioner/corporate/bulletins/2005-2006/II
I-C.pdf failed with: java.lang.NoClassDefFoundError:
org/apache/log4j/Layout
060325 234805 fetching
http://www.access.gpo.gov/uscode/title26/subtitlei_chapter98_subchaptera
_.html
060325 234805 25 pages, 8 errors, 1.0 pages/s, 146 kb/s, 
060325 234805 fetching
http://www.revenue.state.co.us/TPS_Dir/wrap.asp?incl=individuals
060325 234805 fetching http://google.dor.state.ma.us/search
060325 234805 fetching http://www.dor.mo.gov/tax
060325 234806 27 pages, 8 errors, 1.0 pages/s, 151 kb/s, 
060325 234806 fetching
http://www.mt.gov/revenue/formsandresources/administrativerules/currentd
eptrules/chapters.asp
060325 234807 fetching
http://www.state.nj.us/treasury/revenue/crdscrpt.htm
060325 234807 fetching
http://www.revenue.state.pa.us/revenue/cwp/eMailPage.asp?a=180&t=Revenue
%3A+Public+Law+107%2D147&URL=http%3A%2F%2Fwww%2Erevenue%2Estate%2Epa%2Eu
s%2Frevenue%2Fcwp%2Fview%2Easp%3FA%3D180%26Q%3D186399
java.io.EOFException
	at
org.apache.nutch.protocol.http.HttpResponse.readLine(HttpResponse.java:4
23)
	at
org.apache.nutch.protocol.http.HttpResponse.parseStatusLine(HttpResponse
.java:315)
	at
org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:143
)
	at org.apache.nutch.protocol.http.Http.getResponse(Http.java:61)
	at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.j
ava:176)
	at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:129)
060325 234807 fetch of
http://www.myflorida.com/dor/forms/download/</div>\n failed with:
java.io.EOFException
060325 234807 29 pages, 8 errors, 1.1 pages/s, 158 kb/s, 
060325 234807 fetching
http://www.sctax.org/Forms+and+Instructions/Current+Years+Forms+and+Inst
ructions/
060325 234808 fetching
http://www.mstc.state.ms.us/taxareas/withhold/prioryears.htm
060325 234808 fetching
http://www.ct.gov/drs/cwp/view.asp?a=1433&Q=265872&PM=1&pp=12&n=1
060325 234809 31 pages, 9 errors, 1.1 pages/s, 158 kb/s, 
060325 234809 fetching
http://www.myflorida.com/dor/forms/download/</select>\n
060325 234809 fetching
http://www.ct.gov/drs/cwp/view.asp?a=1433&q=265872&pp=12&n=1
060325 234811 32 pages, 9 errors, 1.0 pages/s, 158 kb/s, 
060325 234811 fetching
http://www.revenue.state.co.us/TPS_Dir/wrap.asp?incl=salestaxforms
060325 234812 fetch of
http://www.mstc.state.ms.us/taxareas/individ/efiling/nactpinfo/testpack0
5.pdf failed with: java.lang.NoClassDefFoundError:
org/apache/log4j/Layout
060325 234812 32 pages, 9 errors, 1.0 pages/s, 153 kb/s, 
060325 234812 fetching
http://www.revenue.state.co.us/TPS_Dir/wrap.asp?incl=fyi_index_rev
060325 234812 fetching
http://www.revenue.state.pa.us/revenue/cwp/browse.asp?a=246&bc=0&c=35134
&revenuenavdltest=|690|&pp=0
java.io.EOFException
	at
org.apache.nutch.protocol.http.HttpResponse.readLine(HttpResponse.java:4
23)
	at
org.apache.nutch.protocol.http.HttpResponse.parseStatusLine(HttpResponse
.java:315)
	at
org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:143
)
	at org.apache.nutch.protocol.http.Http.getResponse(Http.java:61)
	at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.j
ava:176)
	at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:129)
060325 234813 fetch of
http://www.myflorida.com/dor/forms/download/</select>\n failed with:
java.io.EOFException
060325 234813 fetching http://www.myflorida.com/dor/taxes/text/css
060325 234813 fetching
http://www.myflorida.com/dor/forms/download/text/css
060325 234814 34 pages, 11 errors, 1.0 pages/s, 162 kb/s, 
060325 234815 34 pages, 11 errors, 1.0 pages/s, 157 kb/s, 
060325 234815 fetching http://www.tax.state.ny.us/wt/default.htm
060325 234816 fetching http://www.myflorida.com/dor/governments/text/css
060325 234817 36 pages, 11 errors, 1.0 pages/s, 160 kb/s, 
060325 234817 fetching
http://www.state.nj.us/treasury/revenue/enrolleft.htm
java.io.EOFException
	at
org.apache.nutch.protocol.http.HttpResponse.readLine(HttpResponse.java:4
23)
	at
org.apache.nutch.protocol.http.HttpResponse.parseStatusLine(HttpResponse
.java:315)
	at
org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:143
)
	at org.apache.nutch.protocol.http.Http.getResponse(Http.java:61)
	at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.j
ava:176)
	at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:129)
060325 234818 fetch of
http://www.myflorida.com/dor/forms/download/text/css failed with:
java.io.EOFException
060325 234818 37 pages, 12 errors, 1.0 pages/s, 156 kb/s, 
060325 234818 fetching
http://www.state.sd.us/drr2/reg/abstracters/abstractapp.pdf
060325 234818 fetching http://www.ustaxcourt.gov/UstcInOp/asp/Todays.asp
060325 234818 fetching
http://www.revenue.state.co.us/TPS_Dir/wrap.asp?incl=liquorexcise
060325 234820 39 pages, 12 errors, 1.0 pages/s, 163 kb/s, 
060325 234820 fetching http://revenue.ky.gov/laws/default.htm
060325 234820 fetch of
http://www.state.sd.us/drr2/reg/abstracters/abstractapp.pdf failed with:
java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234820 fetching
http://www.revenue.state.ne.us/news_rel/jan_05/nr0105.htm
060325 234821 39 pages, 12 errors, 1.0 pages/s, 159 kb/s, 
060325 234821 fetching http://www.myflorida.com/dor/taxes/GT-800011.html
060325 234821 fetching
http://www.revenue.state.co.us/TPS_Dir/wrap.asp?incl=publiceducation
060325 234822 41 pages, 13 errors, 1.0 pages/s, 160 kb/s, 
java.io.EOFException
	at
org.apache.nutch.protocol.http.HttpResponse.readLine(HttpResponse.java:4
23)
	at
org.apache.nutch.protocol.http.HttpResponse.parseStatusLine(HttpResponse
.java:315)
	at
org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:143
)
	at org.apache.nutch.protocol.http.Http.getResponse(Http.java:61)
	at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.j
ava:176)
	at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:129)
060325 234823 fetch of http://www.myflorida.com/dor/taxes/text/css
failed with: java.io.EOFException
060325 234823 41 pages, 13 errors, 1.0 pages/s, 156 kb/s, 
060325 234823 fetching
http://tax.ohio.gov/divisions/communications/information_releases/index.
stm
060325 234823 fetching
http://www.access.gpo.gov/uscode/title26/subtitlei_chapter98_subchapterb
_.html
060325 234823 fetching
http://www.state.me.us/revenue/forms/property/propertytaxforms.htm
060325 234824 fetching http://www.dor.mo.gov/mvdl/motorv/complaints.htm
060325 234824 43 pages, 14 errors, 1.0 pages/s, 160 kb/s, 
060325 234824 fetching
http://www.myflorida.com/dor/eservices/faqs/internet.html
060325 234824 fetching http://ftb.ca.gov/law/disclosure/manual/1000.html
060325 234825 fetching
http://www.revenue.state.pa.us/revenue/cwp/browse.asp?A=13&Q=35001
060325 234825 fetching
http://www.mstc.state.ms.us/bankruptcy/taxtranscriptrequestform-draft.pd
f
060325 234825 46 pages, 14 errors, 1.0 pages/s, 174 kb/s, 
060325 234825 fetching
http://cfo.dc.gov/otr/lib/otr/information/pdf/bsa_2003_tax_changes.pdf
060325 234825 fetching http://www.state.nm.us/tax/forms/mvd/mvd10383.pdf
060325 234826 fetch of
http://cfo.dc.gov/otr/lib/otr/information/pdf/bsa_2003_tax_changes.pdf
failed with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234826 fetching http://www.myflorida.com/dor/utils/redirect.jsp
060325 234826 fetch of
http://www.mstc.state.ms.us/bankruptcy/taxtranscriptrequestform-draft.pd
f failed with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234826 fetching http://www.state.nm.us/tax/pubs/nontax.pdf
060325 234827 48 pages, 16 errors, 1.0 pages/s, 171 kb/s, 
060325 234828 fetch of http://www.state.nm.us/tax/forms/mvd/mvd10383.pdf
failed with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234828 48 pages, 16 errors, 1.0 pages/s, 167 kb/s, 
060325 234828 fetching
http://www.revenue.state.ne.us/news_rel/jan_05/ro012805.htm
java.io.EOFException
	at
org.apache.nutch.protocol.http.HttpResponse.readLine(HttpResponse.java:4
23)
	at
org.apache.nutch.protocol.http.HttpResponse.parseStatusLine(HttpResponse
.java:315)
	at
org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:143
)
	at org.apache.nutch.protocol.http.Http.getResponse(Http.java:61)
	at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.j
ava:176)
	at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:129)
060325 234828 fetch of http://www.myflorida.com/dor/utils/redirect.jsp
failed with: java.io.EOFException
060325 234828 fetching
http://www.revenue.state.pa.us/revenue/cwp/eMailPage.asp?a=246&t=Revenue
%3A+Statistical+Reports+%2D%2D+Property+Tax%2FRent+Rebate&URL=http%3A%2F
%2Fwww%2Erevenue%2Estate%2Epa%2Eus%2Frevenue%2Fcwp%2Fbrowse%2Easp%3Fa%3D
246%26bc%3D0%26c%3D35134%26revenueNavDLTEST%3D%7C690%7C%26tx%3D1
060325 234828 fetching
http://www.revenue.state.pa.us/revenue/cwp/eMailPage.asp?a=13&t=Revenue%
3A+Sales+%26+Use+Tax&URL=http%3A%2F%2Fwww%2Erevenue%2Estate%2Epa%2Eus%2F
revenue%2Fcwp%2Fbrowse%2Easp%3Fc%3D35424%26a%3D13%26BMDRN%3D2000%26BCOB%
3D0
060325 234829 49 pages, 18 errors, 1.0 pages/s, 164 kb/s, 
060325 234829 fetching http://www.state.hi.us/tax/pubs/trc_rpt_2003.pdf
060325 234830 50 pages, 18 errors, 1.0 pages/s, 166 kb/s, 
060325 234830 fetching http://www.in.gov/dor/news/2005/){this.value=
060325 234831 fetching
http://www.mstc.state.ms.us/taxareas/property/rules/advrules.htm
060325 234831 fetching
http://www.revenue.state.co.us/TPS_Dir/wrap.asp?incl=wagewithforms
060325 234832 52 pages, 18 errors, 1.0 pages/s, 162 kb/s, 
060325 234834 52 pages, 18 errors, 1.0 pages/s, 159 kb/s, 
060325 234834 fetching
http://www.state.nm.us/tax/forms/year03corporate.htm
060325 234834 fetch of http://www.state.nm.us/tax/pubs/nontax.pdf failed
with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234834 fetching
http://www.revenue.state.pa.us/revenue/cwp/browse.asp?c=35424&a=13&BMDRN
=2000&BCOB=0&pp=0
060325 234835 53 pages, 19 errors, 1.0 pages/s, 156 kb/s, 
060325 234835 fetching http://www.dor.state.ma.us/rul_reg/oll_main.htm
060325 234836 54 pages, 19 errors, 1.0 pages/s, 160 kb/s, 
060325 234836 fetching
http://www.revenue.state.co.us/taxstatutesregs/wrap.asp?incl=mainpage/in
dexintro
060325 234836 fetching
http://www.state.sd.us/drr2/reg/bank/agenda040306.pdf
060325 234837 fetch of
http://www.state.sd.us/drr2/reg/bank/agenda040306.pdf failed with:
java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234837 fetching http://www.dor.state.nc.us/faq/
060325 234837 56 pages, 20 errors, 1.0 pages/s, 162 kb/s, 
060325 234837 fetching
http://cfo.dc.gov/otr/frames.asp?doc=/otr/lib/otr/tax/property/pdf/TY200
6_Delay_Props_by_SSL_022806.pdf
060325 234837 fetch of
http://cfo.dc.gov/otr/frames.asp?doc=/otr/lib/otr/tax/property/pdf/TY200
6_Delay_Props_by_SSL_022806.pdf failed with:
java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234837 fetching http://www.dor.state.nc.us/electronic/d400v.html
java.io.EOFException
	at
org.apache.nutch.protocol.http.HttpResponse.readLine(HttpResponse.java:4
23)
	at
org.apache.nutch.protocol.http.HttpResponse.parseStatusLine(HttpResponse
.java:315)
	at
org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:143
)
	at org.apache.nutch.protocol.http.Http.getResponse(Http.java:61)
	at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.j
ava:176)
	at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:129)
060325 234839 fetch of http://www.myflorida.com/dor/governments/text/css
failed with: java.io.EOFException
060325 234839 57 pages, 21 errors, 1.0 pages/s, 161 kb/s, 
060325 234839 fetching http://www.state.nm.us/tax/pubs/authstore.pdf
060325 234839 fetching http://www.state.nm.us/tax/ptd/delaofin.htm
060325 234839 fetch of http://www.state.hi.us/tax/pubs/trc_rpt_2003.pdf
failed with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234839 fetching
http://www.revenue.state.ne.us/incentiv/statutes/829stats.htm
060325 234840 57 pages, 23 errors, 1.0 pages/s, 155 kb/s, 
060325 234840 fetching http://www.state.ia.us/tax/contact/mission.html
060325 234841 fetching
http://cfo.dc.gov/otr/cwp/view,A,1330,Q,594065,otrNav_gid,1679,otrNav,|3
3280|.asp
060325 234841 fetching
http://www.mstc.state.ms.us/taxareas/petrol/netinst0501.pdf
060325 234841 fetch of
http://www.mstc.state.ms.us/taxareas/petrol/netinst0501.pdf failed with:
java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234841 60 pages, 23 errors, 1.0 pages/s, 170 kb/s, 
060325 234841 fetching
http://www.mstc.state.ms.us/taxareas/petrol/netinst0501.doc
060325 234842 fetching
http://www.taxes.state.mn.us/taxes/software_developers/index.shtml
060325 234842 fetching http://www.dor.state.wi.us/eserv/eft1.html
060325 234842 fetching http://www.state.nm.us/tax/ptd/deladdrs.htm
060325 234843 62 pages, 24 errors, 1.0 pages/s, 169 kb/s, 
060325 234843 fetching
http://www.marylandtaxes.com/publications/nr/current/main/calculator.asp
060325 234843 fetching
http://www.revenue.state.ne.us/incentiv/jobsact/econ_inc.htm
060325 234843 fetching
http://www.state.sd.us/drr2/seminar/border_issues/border_issues_seminars
.htm
060325 234844 64 pages, 24 errors, 1.0 pages/s, 172 kb/s, 
060325 234844 fetching
http://www.access.gpo.gov/uscode/title26/subtitlef_chapter72_subchapterb
_.html
060325 234844 fetching http://www.tax.virginia.gov/Ind
060325 234844 fetching http://www.state.sd.us/drr2/reg/insurance/
060325 234845 66 pages, 24 errors, 1.0 pages/s, 171 kb/s, 
060325 234845 fetching
http://cfo.dc.gov/otr/cwp/view,a,1328,q,634154.asp
060325 234845 fetching http://www.dor.state.nc.us/taxes/rate.html
060325 234846 69 pages, 24 errors, 1.0 pages/s, 172 kb/s, 
060325 234847 ParserFactory: Plugin:
org.apache.nutch.parse.msword.MSWordParser mapped to contentType
application/msword via parse-plugins.xml, but not enabled via
plugin.includes in nutch-default.xml
060325 234847 fetching
http://cfo.dc.gov/otr/frames.asp?doc=/otr/lib/otr/tax/property/pdf/TY200
6_Delay_Props_by_Premise_Addr_022806.pdf
060325 234847 fetching http://www.state.sd.us/drr2/reg/securities/
060325 234848 71 pages, 24 errors, 1.1 pages/s, 177 kb/s, 
060325 234848 fetching
http://tax.ohio.gov/RoboHTML/Frequently%20Asked%20Questions/!SSL!/Blue_V
evet/whskin_frmset010.htm
060325 234848 fetching http://www.state.nm.us/tax/ptd/delinfo.htm
060325 234848 fetch of
http://cfo.dc.gov/otr/frames.asp?doc=/otr/lib/otr/tax/property/pdf/TY200
6_Delay_Props_by_Premise_Addr_022806.pdf failed with:
java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234848 fetching
http://www.revenue.alabama.gov/motorvehicle/pdf/stfarm.pdf
060325 234848  map 1%  reduce 0%
060325 234849 fetch of
http://www.revenue.alabama.gov/motorvehicle/pdf/stfarm.pdf failed with:
java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234849 fetching
http://www.michigan.gov/treasury/0,1607,7-121-28687---F,00.html
060325 234849 73 pages, 26 errors, 1.1 pages/s, 173 kb/s, 
060325 234849 fetching http://ftb.ca.gov/emailapps/formsXemail.asp
java.io.EOFException
	at
org.apache.nutch.protocol.http.HttpResponse.readLine(HttpResponse.java:4
23)
	at
org.apache.nutch.protocol.http.HttpResponse.parseStatusLine(HttpResponse
.java:315)
	at
org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:143
)
	at org.apache.nutch.protocol.http.Http.getResponse(Http.java:61)
	at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.j
ava:176)
	at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:129)
060325 234849 fetch of
http://www.myflorida.com/dor/eservices/apps/enroll/text/css failed with:
java.io.EOFException
060325 234849 fetching
http://www.state.sd.us/drr2/seminar/basictaxindex.htm
060325 234849 fetching
http://www.revenue.state.pa.us/revenue/cwp/view.asp?A=190&Q=210003
060325 234850 fetching http://dor.wa.gov/content/laws/eta/eta.aspx
060325 234850 fetching
http://www.state.sd.us/drr2/reg/appraisers/overview.htm
060325 234851 76 pages, 27 errors, 1.1 pages/s, 176 kb/s, 
060325 234851 fetching http://www.myflorida.com/dor/tips/text/css
060325 234851 fetch of http://www.state.nm.us/tax/pubs/authstore.pdf
failed with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234851 fetching
http://www.ct.gov/drs/cwp/view.asp?a=1433&q=276548&drsPNavCtr=|40959|409
61|&pp=12&n=1
060325 234852 fetching
http://tax.ohio.gov/RoboHTML/Frequently%20Asked%20Questions/!SSL!/Blue_V
evet/whgdata/whnvf30.htm
060325 234852 78 pages, 28 errors, 1.1 pages/s, 177 kb/s, 
060325 234853 fetching
http://www.sctax.org/?wbc_purpose=Basic%23misc%23saluse%23alter%23saluse
%23saluse%23EPay
060325 234853 fetching
http://www.revenue.state.ne.us/tax/current/fill-in/f_27d.pdf
060325 234853 80 pages, 28 errors, 1.1 pages/s, 179 kb/s, 
060325 234853 fetching
http://cfo.dc.gov/otr/cwp/view,A,1330,Q,593687.asp
060325 234854 fetch of
http://www.revenue.state.ne.us/tax/current/fill-in/f_27d.pdf failed
with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234854 fetching
http://www.revenue.state.pa.us/revenue/guestaccount/registration_form.as
p
java.io.EOFException
	at
org.apache.nutch.protocol.http.HttpResponse.readLine(HttpResponse.java:4
23)
	at
org.apache.nutch.protocol.http.HttpResponse.parseStatusLine(HttpResponse
.java:315)
	at
org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:143
)
	at org.apache.nutch.protocol.http.Http.getResponse(Http.java:61)
	at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.j
ava:176)
	at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:129)
060325 234855 fetch of http://www.myflorida.com/dor/tips/text/css failed
with: java.io.EOFException
060325 234855 81 pages, 29 errors, 1.1 pages/s, 177 kb/s, 
060325 234855 fetching http://www.tax.virginia.gov/business
060325 234855 fetching
http://www.revenue.state.pa.us/revenue/cwp/view.asp?A=190&Q=206958
060325 234855 fetching
http://www.access.gpo.gov/uscode/title26/subtitlek_chapter100_subchapter
a_.html
060325 234856 81 pages, 30 errors, 1.1 pages/s, 175 kb/s, 
060325 234856 fetching
http://cfo.dc.gov/otr/cwp/view,a,1330,q,594065,otrNav_gid,1679,otrNav,|3
3280|33288|.asp
060325 234856 fetching
http://www.state.sd.us/drr2/reg/abstracters/AbstractADAApp.pdf
060325 234857 fetching
http://cfo.dc.gov/otr/cwp/view,A,1330,Q,594261,otrNav,|33280|.asp
060325 234857 85 pages, 30 errors, 1.1 pages/s, 176 kb/s, 
060325 234858 fetching
http://www.access.gpo.gov/uscode/title26/subtitlef_chapter72_subchaptera
_.html
060325 234859 85 pages, 30 errors, 1.1 pages/s, 174 kb/s, 
060325 234859 fetching http://www.state.nm.us/tax/ptd/relalink.htm
060325 234900 fetching http://www.myflorida.com/dor/news/text/css
java.io.EOFException
	at
org.apache.nutch.protocol.http.HttpResponse.readLine(HttpResponse.java:4
23)
	at
org.apache.nutch.protocol.http.HttpResponse.parseStatusLine(HttpResponse
.java:315)
	at
org.apache.nutch.protocol.http.HttpResponse.<init>(HttpResponse.java:143
)
	at org.apache.nutch.protocol.http.Http.getResponse(Http.java:61)
	at
org.apache.nutch.protocol.http.api.HttpBase.getProtocolOutput(HttpBase.j
ava:176)
	at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:129)
060325 234900 fetch of http://www.myflorida.com/dor/news/text/css failed
with: java.io.EOFException
060325 234900 86 pages, 30 errors, 1.1 pages/s, 173 kb/s, 
060325 234900 fetching
http://www.sctax.org/News+Releases/?wbc_purpose=Basic%23misc%23saluse%23
alter%23saluse%23saluse%23EPay
060325 234901 86 pages, 31 errors, 1.1 pages/s, 171 kb/s, 
060325 234901 fetching
http://www.mstc.state.ms.us/taxareas/misc/Laws/gas/HTML%20Files/sections
.htm
060325 234901 fetching
http://dor.wa.gov/Content/doingbusiness/filing/efile/
060325 234901 fetch of
http://www.state.sd.us/drr2/reg/abstracters/AbstractADAApp.pdf failed
with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234901 fetching
http://www.mstc.state.ms.us/info/employ/page3.html
060325 234901 fetching
http://www.tax.ri.gov/news/RI%20TAX%20NEWS-WINTER%202003_files/editdata.
mso
060325 234901 fetching
http://cfo.dc.gov/otr/frames.asp?doc=/otr/lib/otr/information/TPI_Agenda
_2006.pdf
060325 234902 90 pages, 32 errors, 1.1 pages/s, 169 kb/s, 
060325 234902 fetching http://www.dor.state.wi.us/forms/misc/a-222f.pdf
060325 234903 fetching http://dor.wa.gov/content/brd/
060325 234904 91 pages, 32 errors, 1.1 pages/s, 165 kb/s, 
060325 234904 fetching
http://cfo.dc.gov/otr/site/default.asp?element=top
060325 234905 92 pages, 32 errors, 1.1 pages/s, 166 kb/s, 
060325 234906 fetch of http://www.dor.state.wi.us/forms/misc/a-222f.pdf
failed with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234906 fetching http://www.revenue.nh.gov/dcrs/index.htm
060325 234906 92 pages, 33 errors, 1.1 pages/s, 164 kb/s, 
060325 234906 fetching
http://www.mstc.state.ms.us/taxareas/misc/Laws/oil/HTML%20Files/sections
.htm
060325 234907 fetching
http://www.revenue.state.ne.us/gaming/statute/gambling.pdf
060325 234907 fetching
http://cfo.dc.gov/otr/site/nav_styled.asp?group=1679&openstr=|33280|3328
8|
060325 234908 95 pages, 33 errors, 1.1 pages/s, 166 kb/s, 
060325 234908 fetching
http://www.access.gpo.gov/uscode/title26/subtitlef_chapter61_subchapterb
_.html
060325 234908 fetching
http://www.access.gpo.gov/uscode/title26/subtitlek_chapter100_subchapter
c_.html
060325 234908 fetching
http://www.access.gpo.gov/uscode/title26/subtitlef_chapter70_subchapterb
_.html
060325 234908 fetching
http://www.state.sd.us/drr2/reg/appraisers/+document.lastModified+
060325 234909 98 pages, 33 errors, 1.1 pages/s, 167 kb/s, 
060325 234910 fetching
http://www.revenue.state.ne.us/tax/current/fill-in/f_1120n.pdf
060325 234910 fetch of
http://www.revenue.state.ne.us/gaming/statute/gambling.pdf failed with:
java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234910 99 pages, 33 errors, 1.1 pages/s, 166 kb/s, 
060325 234910 fetching
http://tax.ohio.gov/RoboHTML/Frequently%20Asked%20Questions/!SSL!/Blue_V
evet/whfdhtml.htm
060325 234911 fetching http://www.rev.state.la.us/sections/individual/
060325 234911 100 pages, 34 errors, 1.1 pages/s, 164 kb/s, 
060325 234911 fetching
http://www.access.gpo.gov/uscode/title26/subtitlef_chapter65_subchapterb
_.html
060325 234912 fetching http://www.tax.virginia.gov/ireg
060325 234912 fetching
http://www.mstc.state.ms.us/info/employ/page2.html
060325 234913 102 pages, 34 errors, 1.1 pages/s, 164 kb/s, 
060325 234913 fetching
http://www.state.nj.us/treasury/revenue/dcr/geninfo/fees_pd.html
060325 234914 fetching http://www.tax.state.ak.us/Forms/index.htm
060325 234914 103 pages, 34 errors, 1.1 pages/s, 162 kb/s, 
060325 234914 fetching http://www.tax.state.ak.us/FAQ/FAQ.HTM
060325 234915 fetching
http://tax.ohio.gov/RoboHTML/Frequently%20Asked%20Questions/!SSL!/Blue_V
evet/whtdhtml.htm
060325 234915 105 pages, 34 errors, 1.1 pages/s, 164 kb/s, 
060325 234916 fetching
http://dor.wa.gov/content/home/search/default.aspx?viewas=graphic
060325 234916 fetch of
http://www.revenue.state.ne.us/tax/current/fill-in/f_1120n.pdf failed
with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234916 fetching
http://www.state.sd.us/drr2/reg/abstracters/AbstractADAProp.pdf
060325 234917 106 pages, 35 errors, 1.1 pages/s, 162 kb/s, 
060325 234917 fetching
http://www.access.gpo.gov/uscode/title26/subtitlek_chapter100_subchapter
b_.html
060325 234917 fetching http://www.tax.virginia.gov/ind
060325 234917 fetching
http://www.revenue.state.ne.us/news_rel/oct_05/fuel_waiver.htm
060325 234918 108 pages, 35 errors, 1.1 pages/s, 164 kb/s, 
060325 234918 fetching
http://www.revenue.state.ne.us/news_rel/aug_05/nr0805.htm
060325 234918 fetching
http://www.revenue.state.az.us/ADOR_Forms/90-99/91-0111f.pdf
060325 234919 110 pages, 35 errors, 1.1 pages/s, 161 kb/s, 
060325 234919 fetch of
http://www.revenue.state.az.us/ADOR_Forms/90-99/91-0111f.pdf failed
with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234919 fetching
http://cfo.dc.gov/otr/cwp/view,a,1330,q,594254.asp
060325 234920 fetching
http://www.mstc.state.ms.us/taxareas/petrol/booklet.pdf
060325 234920 fetching
http://www.mstc.state.ms.us/taxareas/misc/Laws/unfcig/HTML%20Files/secti
ons.htm
060325 234921 111 pages, 36 errors, 1.1 pages/s, 161 kb/s, 
060325 234921 fetching
http://www.state.sd.us/drr2/reg/appraisers/+document.location+
060325 234922 fetching
http://www.access.gpo.gov/uscode/title26/subtitlea_chapter3_subchaptera_
.html
060325 234922 113 pages, 36 errors, 1.1 pages/s, 158 kb/s, 
060325 234923 113 pages, 36 errors, 1.1 pages/s, 157 kb/s, 
060325 234923 fetching
http://www.dor.state.ma.us/rul_reg/tir/TIR98_8.htm
060325 234923 fetching http://www.tax.state.ak.us/PRICES/index.htm
060325 234923 fetching http://www.tax.state.ak.us/PRODUCTION/index.htm
060325 234925 114 pages, 36 errors, 1.1 pages/s, 155 kb/s, 
060325 234925 fetching
http://www.revenue.state.ne.us/news_rel/jul_05/nr0705.htm
060325 234925 fetch of
http://cfo.dc.gov/otr/frames.asp?doc=/otr/lib/otr/information/TPI_Agenda
_2006.pdf failed with: java.lang.NoClassDefFoundError:
org/apache/log4j/Layout
060325 234925 fetching
http://www.access.gpo.gov/uscode/title26/subtitlea_chapter6_subchapterb_
.html
060325 234926 114 pages, 37 errors, 1.1 pages/s, 153 kb/s, 
060325 234926 fetching
http://www.access.gpo.gov/uscode/title26/subtitlef_chapter61_subchaptera
_.html
060325 234927 fetching
http://www.access.gpo.gov/uscode/title26/subtitlef_chapter62_subchaptera
_.html
060325 234927 fetch of
http://www.mstc.state.ms.us/taxareas/petrol/booklet.pdf failed with:
java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234927 115 pages, 37 errors, 1.1 pages/s, 153 kb/s, 
060325 234927 fetching
http://www.tennessee.gov/revenue/streamlined/streamlinedtraining1005_fil
es/frame.htm
060325 234928 fetching
http://cfo.dc.gov/otr/cwp/view,a,1330,q,594093.asp
060325 234929 116 pages, 38 errors, 1.1 pages/s, 152 kb/s, 
060325 234929 fetching
http://www.access.gpo.gov/uscode/title26/subtitlef_chapter70_subchaptera
_.html
060325 234929 fetch of
http://www.state.sd.us/drr2/reg/abstracters/AbstractADAProp.pdf failed
with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234929 fetching
http://www.tax.state.ak.us/AnnualReports/reports.htm
060325 234930 117 pages, 39 errors, 1.1 pages/s, 150 kb/s, 
060325 234930 fetching
http://www.access.gpo.gov/uscode/title26/subtitlef_chapter65_subchaptera
_.html
060325 234931 fetching http://www.ustaxcourt.gov/Docket.htm
060325 234931 fetching
http://www.access.gpo.gov/uscode/title26/subtitlea_chapter6_subchaptera_
.html
060325 234932 119 pages, 39 errors, 1.1 pages/s, 150 kb/s, 
060325 234932 fetching
http://www.mstc.state.ms.us/taxareas/misc/Laws/waste/HTML%20Files/sectio
ns.htm
060325 234933 120 pages, 39 errors, 1.1 pages/s, 149 kb/s, 
060325 234933 fetching
http://www.revenue.state.ne.us/news_rel/oct_04/nr1004.htm
060325 234934 120 pages, 39 errors, 1.1 pages/s, 147 kb/s, 
060325 234934 fetching
http://www.mstc.state.ms.us/taxareas/misc/Laws/tobacco/HTML%20Files/sect
ions.htm
060325 234935 121 pages, 39 errors, 1.1 pages/s, 145 kb/s, 
060325 234936 fetching
http://cfo.dc.gov/otr/cwp/view,A,1330,Q,634798.asp
060325 234936 fetching
http://www.tax.state.ny.us/pubs_and_bulls/publications/withholding_pubs.
htm
060325 234937 122 pages, 39 errors, 1.1 pages/s, 144 kb/s, 
060325 234937 fetching http://www.revenue.state.ne.us/fuels/f_84_tbl.pdf
060325 234937 fetching
http://cfo.dc.gov/otr/cwp/view,a,1330,q,594107.asp
060325 234938 124 pages, 39 errors, 1.1 pages/s, 146 kb/s, 
060325 234938 fetching
http://www.state.sd.us/drr2/reg/securities/essaycontest/EntryFormSurvey.
doc
060325 234939 fetching
http://www.state.nm.us/tax/News/pr020201_files/oledata.mso
060325 234939 125 pages, 39 errors, 1.1 pages/s, 145 kb/s, 
060325 234939 fetching
http://www.revenue.state.pa.us/revenue/cwp/eMailPage.asp?a=3&t=Revenue%3
A+Requesting+an+Extension+of+Time+to+File+State+Income+Tax+Returns&URL=h
ttp%3A%2F%2Fwww%2Erevenue%2Estate%2Epa%2Eus%2Frevenue%2Fcwp%2Fview%2Easp
%3FA%3D3%26Q%3D257183
060325 234939 fetching
http://cfo.dc.gov/cfo/cwp/view,a,1327,q,590999.asp
060325 234939 fetching
http://cfo.dc.gov/otr/cwp/view,a,11,q,634686,otrNav_GID,1678.asp
060325 234940 128 pages, 39 errors, 1.1 pages/s, 145 kb/s, 
060325 234941 128 pages, 39 errors, 1.1 pages/s, 144 kb/s, 
060325 234942 fetching
http://www.revenue.state.pa.us/revenue/cwp/view.asp?A=3&Q=257183&pp=0
060325 234942 fetch of http://www.revenue.state.ne.us/fuels/f_84_tbl.pdf
failed with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234942 fetching
http://cfo.dc.gov/otr/cwp/view,a,1330,q,594261.asp
060325 234943 129 pages, 40 errors, 1.0 pages/s, 142 kb/s, 
060325 234944 fetching
http://www.mstc.state.ms.us/taxareas/misc/9996197.pdf
060325 234944 fetch of
http://www.mstc.state.ms.us/taxareas/misc/9996197.pdf failed with:
java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234944 130 pages, 40 errors, 1.0 pages/s, 142 kb/s, 
060325 234944 fetching
http://dor.wa.gov/content/home/aboutus/ServiceCommitment.aspx
060325 234945 fetching
http://www.myflorida.com/dor/forms/2006/dr15dss.pdf
060325 234946 131 pages, 41 errors, 1.0 pages/s, 141 kb/s, 
060325 234946 fetching
http://cfo.dc.gov/otr/frames.asp?doc=/otr/lib/otr/tax/ucc/ucc5.pdf&group
=1679&open=|33280|33288|
060325 234946 fetch of
http://www.myflorida.com/dor/forms/2006/dr15dss.pdf failed with:
java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234946 fetching
http://www.state.nm.us/tax/pubs/TaxreseStat/gross%20receipts%20by%20SIC_
files/editdata.mso
060325 234946 fetching
http://www.state.sd.us/drr2/reg/bank/licensees/State%20Banks%20for%20Int
ernet.rtf
060325 234947 132 pages, 42 errors, 1.0 pages/s, 143 kb/s, 
060325 234947 fetching
http://cfo.dc.gov/otr/cwp/view,a,1330,q,594268.asp
060325 234947 ParserFactory: Plugin:
org.apache.nutch.parse.rtf.RTFParseFactory mapped to contentType
text/rtf via parse-plugins.xml, but not enabled via plugin.includes in
nutch-default.xml
060325 234947 ParserFactory: Plugin:
org.apache.nutch.parse.msword.MSWordParser mapped to contentType
text/rtf via parse-plugins.xml, but not enabled via plugin.includes in
nutch-default.xml
060325 234947 fetching http://www.rev.state.la.us/sections/business/
060325 234948 134 pages, 42 errors, 1.1 pages/s, 144 kb/s, 
060325 234948 fetching
http://dor.wa.gov/content/home/newsroom/default.aspx?viewas=graphic
060325 234949 fetching
http://www.state.sd.us/drr2/reg/gaming/notice_of_meeting_031406.pdf
060325 234949 136 pages, 42 errors, 1.1 pages/s, 143 kb/s, 
060325 234951 136 pages, 42 errors, 1.0 pages/s, 141 kb/s, 
060325 234952 fetching
http://www.tax.state.ak.us/SourcesBook/SOURCES.htm
060325 234952 fetching
http://www.revenue.state.ne.us/news_rel/oct_05/nr1005.htm
060325 234952 fetching
http://www.state.nm.us/tax/News/pr020201_files/editdata.mso
060325 234952 fetching
http://www.revenue.state.ne.us/news_rel/oct_04/amnesty_rel2.htm
060325 234952 fetching
http://www.state.nm.us/tax/News/pr020201_files/filelist.xml
060325 234953 fetch of
http://www.state.sd.us/drr2/reg/gaming/notice_of_meeting_031406.pdf
failed with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234953 139 pages, 42 errors, 1.1 pages/s, 145 kb/s, 
060325 234953 fetching
http://www.dor.state.nc.us/publications/abstract/Table1.pdf
060325 234953 fetch of
http://www.dor.state.nc.us/publications/abstract/Table1.pdf failed with:
java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 234953 fetching
http://www.revenue.state.pa.us/revenue/cwp/view.asp?A=3&Q=257183&pp=12&n
=1
060325 234954 fetching
http://www.access.gpo.gov/uscode/title26/subtitlef_chapter62_subchapterb
_.html
060325 234954 140 pages, 44 errors, 1.0 pages/s, 143 kb/s, 
060325 234955 fetching
http://www.state.sd.us/drr2/reg/securities/essaycontest/contestrules.htm
060325 234956 141 pages, 44 errors, 1.0 pages/s, 141 kb/s, 
060325 234957 141 pages, 44 errors, 1.0 pages/s, 140 kb/s, 
060325 234957 fetching http://www.state.nm.us/tax/forms/mvd/mvd10353.pdf
060325 234957 fetching http://www.in.gov/dor/taxforms/state.html
060325 234958 fetching
http://www.policylibrary.tax.virginia.gov/OTP/Policy.nsf/enable?OpenPage
060325 234958 fetching
http://www.state.nm.us/tax/pubs/TaxreseStat/gross%20receipts%20by%20SIC_
files/filelist.xml
060325 234958 fetching
http://www.policylibrary.tax.virginia.gov/OTP/Policy.nsf/Enable%20JavaSc
ript?OpenPage&BaseTarget=MainFrame&Click=
060325 234959 145 pages, 44 errors, 1.1 pages/s, 143 kb/s, 
060325 234959 fetching
http://www.mstc.state.ms.us/abc/forms/SpecialNotice_files/filelist.xml
060325 234959 fetching
http://cfo.dc.gov/otr/cwp/view,a,1330,q,593862.asp
060325 235000 146 pages, 44 errors, 1.1 pages/s, 142 kb/s, 
060325 235000 fetching
http://www.state.sd.us/drr2/forms/Banking/nonE0072V1-TrustCompanyApplica
tion.pdf
060325 235001 147 pages, 44 errors, 1.0 pages/s, 141 kb/s, 
060325 235002 147 pages, 44 errors, 1.0 pages/s, 140 kb/s, 
060325 235002 fetching
http://www.tax.virginia.gov/site.cfm?alias=SoftwareDevelopers
060325 235003 fetching
http://www.state.sd.us/drr2/reg/securities/essaycontest/prizes.htm
060325 235003 fetching
http://cfo.dc.gov/cfo/cwp/view,a,1327,q,591006.asp
060325 235004 149 pages, 44 errors, 1.0 pages/s, 138 kb/s, 
060325 235004 fetching http://www.revenue.state.co.us/includes/java.htm
060325 235005 fetching
http://www.revenue.state.pa.us/revenue/cwp/view.asp?A=3&Q=257183&tx=1
060325 235005 fetching
http://www.state.sd.us/drr2/publications/annrpt/2001%20annual.pdf
060325 235005 151 pages, 44 errors, 1.0 pages/s, 138 kb/s, 
060325 235005 fetching
http://www.access.gpo.gov/uscode/title26/subtitlea_chapter3_subchapterb_
.html
060325 235007 153 pages, 44 errors, 1.0 pages/s, 137 kb/s, 
060325 235008 fetching http://www.oregon.gov/DOR/salestax.shml
060325 235008 fetching
http://www.tax.state.ak.us/divisions/director_dickinson.htm
060325 235009 154 pages, 44 errors, 1.0 pages/s, 137 kb/s, 
060325 235009 fetching
http://ftb.ca.gov/professionals/taxnews/tn_06/03_04.html
060325 235009 fetch of http://www.state.nm.us/tax/forms/mvd/mvd10353.pdf
failed with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 235009 fetching
http://www.revenue.state.ne.us/news_rel/jan_06/nr0106.htm
060325 235009 fetching
http://www.revenue.state.co.us/racing_dir/?sitesearch=www.revenue.state.
co.us&domains=www.revenue.state.co.us&q=
060325 235010 155 pages, 45 errors, 1.0 pages/s, 137 kb/s, 
060325 235010 fetching
http://www.state.nm.us/tax/pubs/TaxreseStat/03PITStatsLite.pdf
060325 235010 fetch of
http://www.revenue.state.co.us/racing_dir/?sitesearch=www.revenue.state.
co.us&domains=www.revenue.state.co.us&q= failed with: Http code=403,
url=http://www.revenue.state.co.us/racing_dir/?sitesearch=www.revenue.st
ate.co.us&domains=www.revenue.state.co.us&q=
060325 235010 fetching
http://www.revenue.state.az.us/ResearchStats/Proposedrules.htm
060325 235011 fetching http://www.tax.state.ak.us/contactus.htm
060325 235011 fetching
http://tax.utah.gov/cgi-bin/ksearch.cgi?terms=&p=1&help=1&display=10&sor
t=Scores
060325 235011 156 pages, 46 errors, 1.0 pages/s, 138 kb/s, 
060325 235011 fetching http://www.myflorida.com/dor/forms/2006/dr1.pdf
060325 235012 159 pages, 46 errors, 1.0 pages/s, 138 kb/s, 
060325 235014 159 pages, 46 errors, 1.0 pages/s, 137 kb/s, 
060325 235014 fetching http://www.in.gov/dor/news/2006/){this.value=
060325 235014 fetch of http://www.myflorida.com/dor/forms/2006/dr1.pdf
failed with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 235014 fetching
http://www.state.nm.us/tax/pubs/TaxreseStat/gross%20receipts%20by%20SIC_
files/oledata.mso
060325 235014 fetching
http://cfo.dc.gov/otr/cwp/view,A,1330,Q,593925.asp
060325 235014 fetching
http://www.state.sd.us/drr2/reg/abstracters/abstractcom.pdf
060325 235014 fetching
http://www.state.nm.us/tax/mvd/vintsc_files/editdata.mso
060325 235015 160 pages, 47 errors, 1.0 pages/s, 136 kb/s, 
060325 235015 fetch of
http://www.state.nm.us/tax/pubs/TaxreseStat/03PITStatsLite.pdf failed
with: java.lang.NoClassDefFoundError: org/apache/log4j/Layout
060325 235015 fetching http://www.mstc.state.ms.us/ordering/request.htm
060325 235016 fetching
http://www.state.nm.us/tax/mvd/vintsc_files/filelist.xml
060325 235016 fetching
http://www.state.sd.us/drr2/reg/bank/licensees/Trust%20Company%20Interne
t%20List.rtf
060325 235017 162 pages, 48 errors, 1.0 pages/s, 135 kb/s, 
060325 235019 162 pages, 48 errors, 1.0 pages/s, 134 kb/s, 
060325 235019 fetching
http://cfo.dc.gov/cfo/cwp/view,a,1327,q,591013.asp
060325 235020 163 pages, 48 errors, 1.0 pages/s, 133 kb/s, 
060325 235020 fetching
http://www.revenue.state.pa.us/revenue/cwp/browse.asp?A=180&Q=34803
060325 235021 fetching
http://ftb.ca.gov/individuals/hoh/PrYear/selftest/definitions.html