You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by nutch_newbie <ka...@hotmail.com> on 2008/06/12 16:19:42 UTC

Nutch- crawling?

I ran the crawler, and it seems just fine. and  in localhost:8080/nutch-0.8.1
the nutch search window is displayed, but whenever something is searched,
the results always say "Hits 0-0 (out of about 0 total matching pages): "
here is the piece of my crawl-urlfilter.txt that i modified:

# accept hosts in MY.DOMAIN.NAME
+^http://([a-z0-9]*\.)*MY.DOMAIN.NAME/
+^http://([a-z0-9]*\.)*www.en.wikipedia.org
+^http://([a-z0-9]*\.)*www.google.com
+^http://([a-z0-9]*\.)*www.search.yahoo.com/

what else am i supposed to do?  i'm really confused and running short on
time. any and all help would be greatly appreciated. thanks in advance. 

PS: my computer is linux- FC5- but the folders and config files are still
the same. and i also tried restarting tomcat- which didn;t help. 


-- 
View this message in context: http://www.nabble.com/Nutch--crawling--tp17801131p17801131.html
Sent from the Nutch - User mailing list archive at Nabble.com.


Re: Nutch- crawling?

Posted by nutch_newbie <ka...@hotmail.com>.
Restarted Tomcat. here is the tail end of catalina.out:
I restarted tomcat. here is the tail end of catalina.out:

Jun 12, 2008 10:52:27 AM org.apache.coyote.http11.Http11BaseProtocol pause
INFO: Pausing Coyote HTTP/1.1 on http-8080
Jun 12, 2008 10:52:28 AM org.apache.catalina.core.StandardService stop
INFO: Stopping service Catalina
Jun 12, 2008 10:52:29 AM org.apache.coyote.http11.Http11BaseProtocol destroy
INFO: Stopping Coyote HTTP/1.1 on http-8080
Jun 12, 2008 10:52:29 AM org.apache.catalina.core.AprLifecycleListener
lifecycleEvent
INFO: Failed shutdown of Apache Portable Runtime
Jun 12, 2008 10:52:33 AM org.apache.catalina.core.AprLifecycleListener
lifecycleEvent
INFO: The Apache Tomcat Native library which allows optimal performance in
production environments was not found on the java.library.path:
/opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
Jun 12, 2008 10:52:33 AM org.apache.coyote.http11.Http11BaseProtocol init
INFO: Initializing Coyote HTTP/1.1 on http-8080
Jun 12, 2008 10:52:33 AM org.apache.catalina.startup.Catalina load
INFO: Initialization processed in 489 ms
Jun 12, 2008 10:52:33 AM org.apache.catalina.core.StandardService start
INFO: Starting service Catalina
Jun 12, 2008 10:52:33 AM org.apache.catalina.core.StandardEngine start
INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
Jun 12, 2008 10:52:33 AM org.apache.catalina.core.StandardHost start
INFO: XML validation disabled
Jun 12, 2008 10:52:33 AM org.apache.catalina.startup.HostConfig deployWAR
INFO: Deploying web application archive nutch-0.8.1.war
Jun 12, 2008 10:52:34 AM org.apache.coyote.http11.Http11BaseProtocol start
INFO: Starting Coyote HTTP/1.1 on http-8080
Jun 12, 2008 10:52:34 AM org.apache.jk.common.ChannelSocket init
INFO: JK: ajp13 listening on /0.0.0.0:8009
Jun 12, 2008 10:52:34 AM org.apache.jk.server.JkMain start
INFO: Jk running ID=0 time=0/15  config=null
Jun 12, 2008 10:52:34 AM org.apache.catalina.storeconfig.StoreLoader load
INFO: Find registry server-registry.xml at classpath resource
Jun 12, 2008 10:52:34 AM org.apache.catalina.startup.Catalina start
INFO: Server startup in 1072 ms
2008-06-12 10:52:58,788 INFO  Configuration - parsing
jar:file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/lib/hadoop-0.4.0-patched.jar!/hadoop-default.xml
2008-06-12 10:52:58,845 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-default.xml
2008-06-12 10:52:58,870 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-site.xml
2008-06-12 10:52:58,872 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/hadoop-site.xml
2008-06-12 10:52:58,879 INFO  PluginRepository - Plugins: looking in:
/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/plugins
2008-06-12 10:52:58,974 INFO  PluginRepository - Plugin Auto-activation
mode: [true]
2008-06-12 10:52:58,974 INFO  PluginRepository - Registered Plugins:
2008-06-12 10:52:58,974 INFO  PluginRepository - 	the nutch core extension
points (nutch-extensionpoints)
2008-06-12 10:52:58,974 INFO  PluginRepository - 	Basic Query Filter
(query-basic)
2008-06-12 10:52:58,974 INFO  PluginRepository - 	Basic Indexing Filter
(index-basic)
2008-06-12 10:52:58,974 INFO  PluginRepository - 	Html Parse Plug-in
(parse-html)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Site Query Filter
(query-site)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Basic Summarizer Plug-in
(summary-basic)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	HTTP Framework (lib-http)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Text Parse Plug-in
(parse-text)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Regex URL Filter
(urlfilter-regex)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Http Protocol Plug-in
(protocol-http)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	OPIC Scoring Plug-in
(scoring-opic)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	CyberNeko HTML Parser
(lib-nekohtml)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	JavaScript Parser
(parse-js)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	URL Query Filter
(query-url)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Regex URL Filter Framework
(lib-regex-filter)
2008-06-12 10:52:58,975 INFO  PluginRepository - Registered
Extension-Points:
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Nutch Summarizer
(org.apache.nutch.searcher.Summarizer)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Nutch Protocol
(org.apache.nutch.protocol.Protocol)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Nutch Analysis
(org.apache.nutch.analysis.NutchAnalyzer)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Nutch URL Filter
(org.apache.nutch.net.URLFilter)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Nutch Indexing Filter
(org.apache.nutch.indexer.IndexingFilter)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Nutch Online Search
Results Clustering Plugin (org.apache.nutch.clustering.OnlineClusterer)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	HTML Parse Filter
(org.apache.nutch.parse.HtmlParseFilter)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Nutch Content Parser
(org.apache.nutch.parse.Parser)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Nutch Scoring
(org.apache.nutch.scoring.ScoringFilter)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Nutch Query Filter
(org.apache.nutch.searcher.QueryFilter)
2008-06-12 10:52:58,975 INFO  PluginRepository - 	Ontology Model Loader
(org.apache.nutch.ontology.Ontology)
2008-06-12 10:52:58,984 INFO  NutchBean - creating new bean
2008-06-12 10:52:58,992 INFO  NutchBean - opening indexes in crawl/indexes
2008-06-12 10:52:59,037 INFO  Configuration - found resource
common-terms.utf8 at
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/common-terms.utf8
2008-06-12 10:52:59,043 INFO  NutchBean - opening segments in crawl/segments
2008-06-12 10:52:59,055 INFO  SummarizerFactory - Using the first summarizer
extension found: Basic Summarizer
2008-06-12 10:52:59,055 INFO  NutchBean - opening linkdb in crawl/linkdb
2008-06-12 10:52:59,060 INFO  NutchBean - query request from 127.0.0.1
2008-06-12 10:52:59,073 INFO  NutchBean - query: grass
2008-06-12 10:52:59,073 INFO  NutchBean - lang: en
2008-06-12 10:52:59,115 INFO  NutchBean - searching for 20 raw hits
2008-06-12 10:52:59,149 INFO  NutchBean - total hits: 0

:confused:


Stop Tomcat and then restart it.  paste the tail end of catalina.out
after you search.  Also make sure you are starting tomcat in the
directory where you have your crawl and that your config files are
setup correct for a local or dfs search.

Jason
-- 
View this message in context: http://www.nabble.com/Nutch--crawling--tp17801131p17803443.html
Sent from the Nutch - User mailing list archive at Nabble.com.


Re: Nutch- crawling?

Posted by Jason Boss <jb...@gmail.com>.
Stop Tomcat and then restart it.  paste the tail end of catalina.out
after you search.  Also make sure you are starting tomcat in the
directory where you have your crawl and that your config files are
setup correct for a local or dfs search.

Jason


On Thu, Jun 12, 2008 at 8:36 AM, nutch_newbie <ka...@hotmail.com> wrote:
>
>
> Which ones? there are 11 of them in the "logs" folder...
>
> here is one of them: catalina.out:
>
> "Jun 11, 2008 1:11:03 PM org.apache.catalina.core.AprLifecycleListener
> lifecycleEvent
> INFO: The Apache Tomcat Native library which allows optimal performance in
> production environments was not found on the java.library.path:
> /opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
> Jun 11, 2008 1:11:03 PM org.apache.coyote.http11.Http11BaseProtocol init
> SEVERE: Error initializing endpoint
> java.net.BindException: Address already in use:8080
>        at
> org.apache.tomcat.util.net.PoolTcpEndpoint.initEndpoint(PoolTcpEndpoint.java:297)
>        at
> org.apache.coyote.http11.Http11BaseProtocol.init(Http11BaseProtocol.java:138)
>        at org.apache.catalina.connector.Connector.initialize(Connector.java:1016)
>        at
> org.apache.catalina.core.StandardService.initialize(StandardService.java:580)
>        at
> org.apache.catalina.core.StandardServer.initialize(StandardServer.java:791)
>        at org.apache.catalina.startup.Catalina.load(Catalina.java:503)
>        at org.apache.catalina.startup.Catalina.load(Catalina.java:523)
>        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>        at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>        at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>        at java.lang.reflect.Method.invoke(Method.java:597)
>        at org.apache.catalina.startup.Bootstrap.load(Bootstrap.java:247)
>        at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:412)
> Jun 11, 2008 1:11:03 PM org.apache.catalina.startup.Catalina load
> SEVERE: Catalina.start
> LifecycleException:  Protocol handler initialization failed:
> java.net.BindException: Address already in use:8080
>        at org.apache.catalina.connector.Connector.initialize(Connector.java:1018)
>        at
> org.apache.catalina.core.StandardService.initialize(StandardService.java:580)
>        at
> org.apache.catalina.core.StandardServer.initialize(StandardServer.java:791)
>        at org.apache.catalina.startup.Catalina.load(Catalina.java:503)
>        at org.apache.catalina.startup.Catalina.load(Catalina.java:523)
>        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>        at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>        at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>        at java.lang.reflect.Method.invoke(Method.java:597)
>        at org.apache.catalina.startup.Bootstrap.load(Bootstrap.java:247)
>        at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:412)
> Jun 11, 2008 1:11:03 PM org.apache.catalina.startup.Catalina load
> INFO: Initialization processed in 491 ms
> Jun 11, 2008 1:11:03 PM org.apache.catalina.core.StandardService start
> INFO: Starting service Catalina
> Jun 11, 2008 1:11:03 PM org.apache.catalina.core.StandardEngine start
> INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
> Jun 11, 2008 1:11:03 PM org.apache.catalina.core.StandardHost start
> INFO: XML validation disabled
> Jun 11, 2008 1:11:04 PM org.apache.coyote.http11.Http11BaseProtocol start
> SEVERE: Error starting endpoint
> java.net.BindException: Address already in use:8080
>        at
> org.apache.tomcat.util.net.PoolTcpEndpoint.initEndpoint(PoolTcpEndpoint.java:297)
>        at
> org.apache.tomcat.util.net.PoolTcpEndpoint.startEndpoint(PoolTcpEndpoint.java:312)
>        at
> org.apache.coyote.http11.Http11BaseProtocol.start(Http11BaseProtocol.java:150)
>        at org.apache.coyote.http11.Http11Protocol.start(Http11Protocol.java:75)
>        at org.apache.catalina.connector.Connector.start(Connector.java:1089)
>        at org.apache.catalina.core.StandardService.start(StandardService.java:459)
>        at org.apache.catalina.core.StandardServer.start(StandardServer.java:709)
>        at org.apache.catalina.startup.Catalina.start(Catalina.java:551)
>        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>        at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>        at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>        at java.lang.reflect.Method.invoke(Method.java:597)
>        at org.apache.catalina.startup.Bootstrap.start(Bootstrap.java:275)
>        at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:413)
> Jun 11, 2008 1:11:04 PM org.apache.catalina.startup.Catalina start
> SEVERE: Catalina.start:
> LifecycleException:  service.getName(): "Catalina";  Protocol handler start
> failed: java.net.BindException: Address already in use:8080
>        at org.apache.catalina.connector.Connector.start(Connector.java:1096)
>        at org.apache.catalina.core.StandardService.start(StandardService.java:459)
>        at org.apache.catalina.core.StandardServer.start(StandardServer.java:709)
>        at org.apache.catalina.startup.Catalina.start(Catalina.java:551)
>        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>        at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>        at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>        at java.lang.reflect.Method.invoke(Method.java:597)
>        at org.apache.catalina.startup.Bootstrap.start(Bootstrap.java:275)
>        at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:413)
> Jun 11, 2008 1:11:04 PM org.apache.catalina.startup.Catalina start
> INFO: Server startup in 822 ms
> Jun 11, 2008 1:11:04 PM org.apache.catalina.core.StandardServer await
> SEVERE: StandardServer.await: create[8005]:
> java.net.BindException: Address already in use
>        at java.net.PlainSocketImpl.socketBind(Native Method)
>        at java.net.PlainSocketImpl.bind(PlainSocketImpl.java:359)
>        at java.net.ServerSocket.bind(ServerSocket.java:319)
>        at java.net.ServerSocket.<init>(ServerSocket.java:185)
>        at org.apache.catalina.core.StandardServer.await(StandardServer.java:372)
>        at org.apache.catalina.startup.Catalina.await(Catalina.java:615)
>        at org.apache.catalina.startup.Catalina.start(Catalina.java:575)
>        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>        at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>        at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>        at java.lang.reflect.Method.invoke(Method.java:597)
>        at org.apache.catalina.startup.Bootstrap.start(Bootstrap.java:275)
>        at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:413)
> Jun 11, 2008 1:11:04 PM org.apache.coyote.http11.Http11BaseProtocol pause
> INFO: Pausing Coyote HTTP/1.1 on http-8080
> Jun 11, 2008 1:11:04 PM org.apache.catalina.connector.Connector pause
> SEVERE: Protocol handler pause failed
> java.lang.NullPointerException
>        at org.apache.jk.server.JkMain.pause(JkMain.java:677)
>        at org.apache.jk.server.JkCoyoteHandler.pause(JkCoyoteHandler.java:162)
>        at org.apache.catalina.connector.Connector.pause(Connector.java:1031)
>        at org.apache.catalina.core.StandardService.stop(StandardService.java:491)
>        at org.apache.catalina.core.StandardServer.stop(StandardServer.java:743)
>        at org.apache.catalina.startup.Catalina.stop(Catalina.java:601)
>        at
> org.apache.catalina.startup.Catalina$CatalinaShutdownHook.run(Catalina.java:644)
> Jun 11, 2008 1:21:40 PM org.apache.catalina.core.AprLifecycleListener
> lifecycleEvent
> INFO: The Apache Tomcat Native library which allows optimal performance in
> production environments was not found on the java.library.path:
> /opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
> Jun 11, 2008 1:21:40 PM org.apache.coyote.http11.Http11BaseProtocol init
> INFO: Initializing Coyote HTTP/1.1 on http-8080
> Jun 11, 2008 1:21:40 PM org.apache.catalina.startup.Catalina load
> INFO: Initialization processed in 506 ms
> Jun 11, 2008 1:21:40 PM org.apache.catalina.core.StandardService start
> INFO: Starting service Catalina
> Jun 11, 2008 1:21:40 PM org.apache.catalina.core.StandardEngine start
> INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
> Jun 11, 2008 1:21:40 PM org.apache.catalina.core.StandardHost start
> INFO: XML validation disabled
> Jun 11, 2008 1:21:41 PM org.apache.coyote.http11.Http11BaseProtocol start
> INFO: Starting Coyote HTTP/1.1 on http-8080
> Jun 11, 2008 1:21:41 PM org.apache.jk.common.ChannelSocket init
> INFO: JK: ajp13 listening on /0.0.0.0:8009
> Jun 11, 2008 1:21:41 PM org.apache.jk.server.JkMain start
> INFO: Jk running ID=0 time=0/13  config=null
> Jun 11, 2008 1:21:41 PM org.apache.catalina.storeconfig.StoreLoader load
> INFO: Find registry server-registry.xml at classpath resource
> Jun 11, 2008 1:21:41 PM org.apache.catalina.startup.Catalina start
> INFO: Server startup in 957 ms
> Jun 11, 2008 1:30:33 PM org.apache.catalina.startup.HostConfig deployWAR
> INFO: Deploying web application archive nutch-0.8.1.war
> Jun 11, 2008 1:41:00 PM org.apache.coyote.http11.Http11BaseProtocol pause
> INFO: Pausing Coyote HTTP/1.1 on http-8080
> Jun 11, 2008 1:41:01 PM org.apache.catalina.core.StandardService stop
> INFO: Stopping service Catalina
> Jun 11, 2008 1:41:01 PM org.apache.coyote.http11.Http11BaseProtocol destroy
> INFO: Stopping Coyote HTTP/1.1 on http-8080
> Jun 11, 2008 1:41:01 PM org.apache.catalina.core.AprLifecycleListener
> lifecycleEvent
> INFO: Failed shutdown of Apache Portable Runtime
> Jun 11, 2008 1:41:08 PM org.apache.catalina.core.AprLifecycleListener
> lifecycleEvent
> INFO: The Apache Tomcat Native library which allows optimal performance in
> production environments was not found on the java.library.path:
> /opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
> Jun 11, 2008 1:41:08 PM org.apache.coyote.http11.Http11BaseProtocol init
> INFO: Initializing Coyote HTTP/1.1 on http-8080
> Jun 11, 2008 1:41:08 PM org.apache.catalina.startup.Catalina load
> INFO: Initialization processed in 484 ms
> Jun 11, 2008 1:41:08 PM org.apache.catalina.core.StandardService start
> INFO: Starting service Catalina
> Jun 11, 2008 1:41:08 PM org.apache.catalina.core.StandardEngine start
> INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
> Jun 11, 2008 1:41:08 PM org.apache.catalina.core.StandardHost start
> INFO: XML validation disabled
> Jun 11, 2008 1:41:08 PM org.apache.catalina.startup.HostConfig deployWAR
> INFO: Deploying web application archive nutch-0.8.1.war
> Jun 11, 2008 1:41:08 PM org.apache.coyote.http11.Http11BaseProtocol start
> INFO: Starting Coyote HTTP/1.1 on http-8080
> Jun 11, 2008 1:41:09 PM org.apache.jk.common.ChannelSocket init
> INFO: JK: ajp13 listening on /0.0.0.0:8009
> Jun 11, 2008 1:41:09 PM org.apache.jk.server.JkMain start
> INFO: Jk running ID=0 time=0/13  config=null
> Jun 11, 2008 1:41:09 PM org.apache.catalina.storeconfig.StoreLoader load
> INFO: Find registry server-registry.xml at classpath resource
> Jun 11, 2008 1:41:09 PM org.apache.catalina.startup.Catalina start
> INFO: Server startup in 1043 ms
> Jun 11, 2008 1:52:22 PM org.apache.coyote.http11.Http11BaseProtocol pause
> INFO: Pausing Coyote HTTP/1.1 on http-8080
> Jun 11, 2008 1:52:23 PM org.apache.catalina.core.StandardService stop
> INFO: Stopping service Catalina
> Jun 11, 2008 1:52:23 PM org.apache.coyote.http11.Http11BaseProtocol destroy
> INFO: Stopping Coyote HTTP/1.1 on http-8080
> Jun 11, 2008 1:52:23 PM org.apache.catalina.core.AprLifecycleListener
> lifecycleEvent
> INFO: Failed shutdown of Apache Portable Runtime
> Jun 11, 2008 1:52:30 PM org.apache.catalina.core.AprLifecycleListener
> lifecycleEvent
> INFO: The Apache Tomcat Native library which allows optimal performance in
> production environments was not found on the java.library.path:
> /opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
> Jun 11, 2008 1:52:30 PM org.apache.coyote.http11.Http11BaseProtocol init
> INFO: Initializing Coyote HTTP/1.1 on http-8080
> Jun 11, 2008 1:52:30 PM org.apache.catalina.startup.Catalina load
> INFO: Initialization processed in 518 ms
> Jun 11, 2008 1:52:30 PM org.apache.catalina.core.StandardService start
> INFO: Starting service Catalina
> Jun 11, 2008 1:52:30 PM org.apache.catalina.core.StandardEngine start
> INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
> Jun 11, 2008 1:52:30 PM org.apache.catalina.core.StandardHost start
> INFO: XML validation disabled
> Jun 11, 2008 1:52:30 PM org.apache.catalina.startup.HostConfig deployWAR
> INFO: Deploying web application archive nutch-0.8.1.war
> Jun 11, 2008 1:52:31 PM org.apache.coyote.http11.Http11BaseProtocol start
> INFO: Starting Coyote HTTP/1.1 on http-8080
> Jun 11, 2008 1:52:31 PM org.apache.jk.common.ChannelSocket init
> INFO: JK: ajp13 listening on /0.0.0.0:8009
> Jun 11, 2008 1:52:31 PM org.apache.jk.server.JkMain start
> INFO: Jk running ID=0 time=0/12  config=null
> Jun 11, 2008 1:52:31 PM org.apache.catalina.storeconfig.StoreLoader load
> INFO: Find registry server-registry.xml at classpath resource
> Jun 11, 2008 1:52:31 PM org.apache.catalina.startup.Catalina start
> INFO: Server startup in 1018 ms
> 2008-06-11 13:56:49,919 INFO  Configuration - parsing
> jar:file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/lib/hadoop-0.4.0-patched.jar!/hadoop-default.xml
> 2008-06-11 13:56:49,928 INFO  Configuration - parsing
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-default.xml
> 2008-06-11 13:56:49,942 INFO  Configuration - parsing
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-site.xml
> 2008-06-11 13:56:49,944 INFO  Configuration - parsing
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/hadoop-site.xml
> 2008-06-11 13:56:49,952 INFO  PluginRepository - Plugins: looking in:
> /opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/plugins
> 2008-06-11 13:56:50,052 INFO  PluginRepository - Plugin Auto-activation
> mode: [true]
> 2008-06-11 13:56:50,052 INFO  PluginRepository - Registered Plugins:
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        the nutch core extension
> points (nutch-extensionpoints)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        Basic Query Filter
> (query-basic)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        Basic Indexing Filter
> (index-basic)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        Html Parse Plug-in
> (parse-html)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        Site Query Filter
> (query-site)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        Basic Summarizer Plug-in
> (summary-basic)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        HTTP Framework (lib-http)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        Text Parse Plug-in
> (parse-text)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        Regex URL Filter
> (urlfilter-regex)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        Http Protocol Plug-in
> (protocol-http)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        OPIC Scoring Plug-in
> (scoring-opic)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        CyberNeko HTML Parser
> (lib-nekohtml)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        JavaScript Parser
> (parse-js)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        URL Query Filter
> (query-url)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        Regex URL Filter Framework
> (lib-regex-filter)
> 2008-06-11 13:56:50,052 INFO  PluginRepository - Registered
> Extension-Points:
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        Nutch Summarizer
> (org.apache.nutch.searcher.Summarizer)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        Nutch Protocol
> (org.apache.nutch.protocol.Protocol)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        Nutch Analysis
> (org.apache.nutch.analysis.NutchAnalyzer)
> 2008-06-11 13:56:50,052 INFO  PluginRepository -        Nutch URL Filter
> (org.apache.nutch.net.URLFilter)
> 2008-06-11 13:56:50,053 INFO  PluginRepository -        Nutch Indexing Filter
> (org.apache.nutch.indexer.IndexingFilter)
> 2008-06-11 13:56:50,053 INFO  PluginRepository -        Nutch Online Search
> Results Clustering Plugin (org.apache.nutch.clustering.OnlineClusterer)
> 2008-06-11 13:56:50,053 INFO  PluginRepository -        HTML Parse Filter
> (org.apache.nutch.parse.HtmlParseFilter)
> 2008-06-11 13:56:50,053 INFO  PluginRepository -        Nutch Content Parser
> (org.apache.nutch.parse.Parser)
> 2008-06-11 13:56:50,053 INFO  PluginRepository -        Nutch Scoring
> (org.apache.nutch.scoring.ScoringFilter)
> 2008-06-11 13:56:50,053 INFO  PluginRepository -        Nutch Query Filter
> (org.apache.nutch.searcher.QueryFilter)
> 2008-06-11 13:56:50,053 INFO  PluginRepository -        Ontology Model Loader
> (org.apache.nutch.ontology.Ontology)
> 2008-06-11 13:56:50,061 INFO  NutchBean - creating new bean
> 2008-06-11 13:56:50,070 INFO  NutchBean - opening indexes in crawl/indexes
> 2008-06-11 13:56:50,122 INFO  Configuration - found resource
> common-terms.utf8 at
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/common-terms.utf8
> 2008-06-11 13:56:50,129 INFO  NutchBean - opening segments in crawl/segments
> 2008-06-11 13:56:50,144 INFO  SummarizerFactory - Using the first summarizer
> extension found: Basic Summarizer
> 2008-06-11 13:56:50,144 INFO  NutchBean - opening linkdb in crawl/linkdb
> 2008-06-11 13:56:50,151 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 13:56:50,165 INFO  NutchBean - query: horses
> 2008-06-11 13:56:50,165 INFO  NutchBean - lang: en
> 2008-06-11 13:56:50,196 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 13:56:50,231 INFO  NutchBean - total hits: 0
> 2008-06-11 13:56:57,489 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 13:56:57,490 INFO  NutchBean - query: wikipedia
> 2008-06-11 13:56:57,490 INFO  NutchBean - lang: en
> 2008-06-11 13:56:57,495 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 13:56:57,495 INFO  NutchBean - total hits: 0
> 2008-06-11 13:57:07,127 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 13:57:07,127 INFO  NutchBean - query: h
> 2008-06-11 13:57:07,127 INFO  NutchBean - lang: en
> 2008-06-11 13:57:07,129 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 13:57:07,129 INFO  NutchBean - total hits: 0
> 2008-06-11 13:57:17,289 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 13:57:17,289 INFO  NutchBean - query: nutch
> 2008-06-11 13:57:17,289 INFO  NutchBean - lang: en
> 2008-06-11 13:57:17,290 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 13:57:17,291 INFO  NutchBean - total hits: 0
> 2008-06-11 13:59:07,446 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 13:59:07,447 INFO  NutchBean - query: pest
> 2008-06-11 13:59:07,447 INFO  NutchBean - lang: en
> 2008-06-11 13:59:07,448 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 13:59:07,449 INFO  NutchBean - total hits: 0
> 2008-06-11 14:00:16,126 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 14:00:16,126 INFO  NutchBean - query: horses
> 2008-06-11 14:00:16,126 INFO  NutchBean - lang: en
> 2008-06-11 14:00:16,127 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 14:00:16,128 INFO  NutchBean - total hits: 0
> 2008-06-11 14:03:22,463 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 14:03:22,464 INFO  NutchBean - query: horse
> 2008-06-11 14:03:22,464 INFO  NutchBean - lang: en
> 2008-06-11 14:03:22,465 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 14:03:22,465 INFO  NutchBean - total hits: 0
> 2008-06-11 14:31:18,657 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 14:31:18,657 INFO  NutchBean - query: 78
> 2008-06-11 14:31:18,657 INFO  NutchBean - lang: pt
> 2008-06-11 14:31:18,658 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 14:31:18,659 INFO  NutchBean - total hits: 0
> 2008-06-11 14:31:24,065 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 14:31:24,066 INFO  NutchBean - query: 7
> 2008-06-11 14:31:24,066 INFO  NutchBean - lang: en
> 2008-06-11 14:31:24,066 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 14:31:24,067 INFO  NutchBean - total hits: 0
> 2008-06-11 18:14:33,501 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 18:14:33,501 INFO  NutchBean - query: horsse
> 2008-06-11 18:14:33,501 INFO  NutchBean - lang: en
> 2008-06-11 18:14:33,503 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 18:14:33,504 INFO  NutchBean - total hits: 0
> 2008-06-11 18:14:37,954 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 18:14:37,955 INFO  NutchBean - query: horse
> 2008-06-11 18:14:37,955 INFO  NutchBean - lang: en
> 2008-06-11 18:14:37,956 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 18:14:37,956 INFO  NutchBean - total hits: 0
> 2008-06-11 18:14:40,675 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 18:14:40,675 INFO  NutchBean - query: horse
> 2008-06-11 18:14:40,675 INFO  NutchBean - lang: en
> 2008-06-11 18:14:40,676 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 18:14:40,676 INFO  NutchBean - total hits: 0
> 2008-06-11 18:14:41,971 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 18:14:41,971 INFO  NutchBean - query: horse
> 2008-06-11 18:14:41,972 INFO  NutchBean - lang: en
> 2008-06-11 18:14:41,972 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 18:14:41,973 INFO  NutchBean - total hits: 0
> 2008-06-11 18:14:47,557 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 18:14:47,558 INFO  NutchBean - query: google
> 2008-06-11 18:14:47,558 INFO  NutchBean - lang: en
> 2008-06-11 18:14:47,559 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 18:14:47,559 INFO  NutchBean - total hits: 0
> 2008-06-11 18:45:29,142 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 18:45:29,143 INFO  NutchBean - query:
> 2008-06-11 18:45:29,143 INFO  NutchBean - lang: en
> 2008-06-11 18:45:29,144 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 18:45:29,148 INFO  NutchBean - total hits: 0
> 2008-06-11 18:47:12,968 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 18:47:12,968 INFO  NutchBean - query: horses
> 2008-06-11 18:47:12,969 INFO  NutchBean - lang: en
> 2008-06-11 18:47:12,969 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 18:47:12,970 INFO  NutchBean - total hits: 0
> 2008-06-11 21:08:27,848 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 21:08:27,848 INFO  NutchBean - query: check
> 2008-06-11 21:08:27,848 INFO  NutchBean - lang: en
> 2008-06-11 21:08:27,849 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 21:08:27,850 INFO  NutchBean - total hits: 0
> 2008-06-11 21:08:32,650 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 21:08:32,650 INFO  NutchBean - query: hits
> 2008-06-11 21:08:32,650 INFO  NutchBean - lang: en
> 2008-06-11 21:08:32,651 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 21:08:32,651 INFO  NutchBean - total hits: 0
> 2008-06-11 21:08:40,582 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-11 21:08:40,582 INFO  NutchBean - query: google
> 2008-06-11 21:08:40,582 INFO  NutchBean - lang: en
> 2008-06-11 21:08:40,583 INFO  NutchBean - searching for 20 raw hits
> 2008-06-11 21:08:40,584 INFO  NutchBean - total hits: 0
> Jun 11, 2008 9:14:01 PM org.apache.coyote.http11.Http11BaseProtocol pause
> INFO: Pausing Coyote HTTP/1.1 on http-8080
> Jun 11, 2008 9:14:01 PM org.apache.catalina.connector.Connector pause
> SEVERE: Protocol handler pause failed
> java.net.SocketException: Network is unreachable
>        at java.net.PlainSocketImpl.socketConnect(Native Method)
>        at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
>        at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
>        at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
>        at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
>        at java.net.Socket.connect(Socket.java:519)
>        at java.net.Socket.connect(Socket.java:469)
>        at java.net.Socket.<init>(Socket.java:366)
>        at java.net.Socket.<init>(Socket.java:209)
>        at org.apache.jk.common.ChannelSocket.unLockSocket(ChannelSocket.java:473)
>        at org.apache.jk.common.ChannelSocket.pause(ChannelSocket.java:270)
>        at org.apache.jk.server.JkMain.pause(JkMain.java:679)
>        at org.apache.jk.server.JkCoyoteHandler.pause(JkCoyoteHandler.java:162)
>        at org.apache.catalina.connector.Connector.pause(Connector.java:1031)
>        at org.apache.catalina.core.StandardService.stop(StandardService.java:491)
>        at org.apache.catalina.core.StandardServer.stop(StandardServer.java:743)
>        at org.apache.catalina.startup.Catalina.stop(Catalina.java:601)
>        at
> org.apache.catalina.startup.Catalina$CatalinaShutdownHook.run(Catalina.java:644)
> Jun 12, 2008 8:09:21 AM org.apache.catalina.core.AprLifecycleListener
> lifecycleEvent
> INFO: The Apache Tomcat Native library which allows optimal performance in
> production environments was not found on the java.library.path:
> /opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
> Jun 12, 2008 8:09:21 AM org.apache.coyote.http11.Http11BaseProtocol init
> INFO: Initializing Coyote HTTP/1.1 on http-8080
> Jun 12, 2008 8:09:21 AM org.apache.catalina.startup.Catalina load
> INFO: Initialization processed in 3557 ms
> Jun 12, 2008 8:09:22 AM org.apache.catalina.core.StandardService start
> INFO: Starting service Catalina
> Jun 12, 2008 8:09:22 AM org.apache.catalina.core.StandardEngine start
> INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
> Jun 12, 2008 8:09:22 AM org.apache.catalina.core.StandardHost start
> INFO: XML validation disabled
> Jun 12, 2008 8:09:26 AM org.apache.catalina.startup.HostConfig deployWAR
> INFO: Deploying web application archive nutch-0.8.1.war
> Jun 12, 2008 8:09:28 AM org.apache.coyote.http11.Http11BaseProtocol start
> INFO: Starting Coyote HTTP/1.1 on http-8080
> Jun 12, 2008 8:09:29 AM org.apache.jk.common.ChannelSocket init
> INFO: JK: ajp13 listening on /0.0.0.0:8009
> Jun 12, 2008 8:09:29 AM org.apache.jk.server.JkMain start
> INFO: Jk running ID=0 time=0/58  config=null
> Jun 12, 2008 8:09:29 AM org.apache.catalina.storeconfig.StoreLoader load
> INFO: Find registry server-registry.xml at classpath resource
> Jun 12, 2008 8:09:29 AM org.apache.catalina.startup.Catalina start
> INFO: Server startup in 7427 ms
> 2008-06-12 08:09:43,382 INFO  Configuration - parsing
> jar:file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/lib/hadoop-0.4.0-patched.jar!/hadoop-default.xml
> 2008-06-12 08:09:43,466 INFO  Configuration - parsing
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-default.xml
> 2008-06-12 08:09:43,493 INFO  Configuration - parsing
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-site.xml
> 2008-06-12 08:09:43,495 INFO  Configuration - parsing
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/hadoop-site.xml
> 2008-06-12 08:09:43,515 INFO  PluginRepository - Plugins: looking in:
> /opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/plugins
> 2008-06-12 08:09:43,782 INFO  PluginRepository - Plugin Auto-activation
> mode: [true]
> 2008-06-12 08:09:43,782 INFO  PluginRepository - Registered Plugins:
> 2008-06-12 08:09:43,782 INFO  PluginRepository -        the nutch core extension
> points (nutch-extensionpoints)
> 2008-06-12 08:09:43,782 INFO  PluginRepository -        Basic Query Filter
> (query-basic)
> 2008-06-12 08:09:43,782 INFO  PluginRepository -        Basic Indexing Filter
> (index-basic)
> 2008-06-12 08:09:43,782 INFO  PluginRepository -        Html Parse Plug-in
> (parse-html)
> 2008-06-12 08:09:43,782 INFO  PluginRepository -        Site Query Filter
> (query-site)
> 2008-06-12 08:09:43,782 INFO  PluginRepository -        Basic Summarizer Plug-in
> (summary-basic)
> 2008-06-12 08:09:43,782 INFO  PluginRepository -        HTTP Framework (lib-http)
> 2008-06-12 08:09:43,782 INFO  PluginRepository -        Text Parse Plug-in
> (parse-text)
> 2008-06-12 08:09:43,782 INFO  PluginRepository -        Regex URL Filter
> (urlfilter-regex)
> 2008-06-12 08:09:43,782 INFO  PluginRepository -        Http Protocol Plug-in
> (protocol-http)
> 2008-06-12 08:09:43,782 INFO  PluginRepository -        OPIC Scoring Plug-in
> (scoring-opic)
> 2008-06-12 08:09:43,782 INFO  PluginRepository -        CyberNeko HTML Parser
> (lib-nekohtml)
> 2008-06-12 08:09:43,782 INFO  PluginRepository -        JavaScript Parser
> (parse-js)
> 2008-06-12 08:09:43,782 INFO  PluginRepository -        URL Query Filter
> (query-url)
> 2008-06-12 08:09:43,782 INFO  PluginRepository -        Regex URL Filter Framework
> (lib-regex-filter)
> 2008-06-12 08:09:43,782 INFO  PluginRepository - Registered
> Extension-Points:
> 2008-06-12 08:09:43,783 INFO  PluginRepository -        Nutch Summarizer
> (org.apache.nutch.searcher.Summarizer)
> 2008-06-12 08:09:43,783 INFO  PluginRepository -        Nutch Protocol
> (org.apache.nutch.protocol.Protocol)
> 2008-06-12 08:09:43,783 INFO  PluginRepository -        Nutch Analysis
> (org.apache.nutch.analysis.NutchAnalyzer)
> 2008-06-12 08:09:43,783 INFO  PluginRepository -        Nutch URL Filter
> (org.apache.nutch.net.URLFilter)
> 2008-06-12 08:09:43,783 INFO  PluginRepository -        Nutch Indexing Filter
> (org.apache.nutch.indexer.IndexingFilter)
> 2008-06-12 08:09:43,783 INFO  PluginRepository -        Nutch Online Search
> Results Clustering Plugin (org.apache.nutch.clustering.OnlineClusterer)
> 2008-06-12 08:09:43,783 INFO  PluginRepository -        HTML Parse Filter
> (org.apache.nutch.parse.HtmlParseFilter)
> 2008-06-12 08:09:43,783 INFO  PluginRepository -        Nutch Content Parser
> (org.apache.nutch.parse.Parser)
> 2008-06-12 08:09:43,783 INFO  PluginRepository -        Nutch Scoring
> (org.apache.nutch.scoring.ScoringFilter)
> 2008-06-12 08:09:43,783 INFO  PluginRepository -        Nutch Query Filter
> (org.apache.nutch.searcher.QueryFilter)
> 2008-06-12 08:09:43,783 INFO  PluginRepository -        Ontology Model Loader
> (org.apache.nutch.ontology.Ontology)
> 2008-06-12 08:09:43,792 INFO  NutchBean - creating new bean
> 2008-06-12 08:09:43,809 INFO  NutchBean - opening indexes in crawl/indexes
> 2008-06-12 08:09:43,922 INFO  Configuration - found resource
> common-terms.utf8 at
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/common-terms.utf8
> 2008-06-12 08:09:43,928 INFO  NutchBean - opening segments in crawl/segments
> 2008-06-12 08:09:43,948 INFO  SummarizerFactory - Using the first summarizer
> extension found: Basic Summarizer
> 2008-06-12 08:09:43,948 INFO  NutchBean - opening linkdb in crawl/linkdb
> 2008-06-12 08:09:43,972 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-12 08:09:43,984 INFO  NutchBean - query: hellp
> 2008-06-12 08:09:43,984 INFO  NutchBean - lang: en
> 2008-06-12 08:09:44,044 INFO  NutchBean - searching for 20 raw hits
> 2008-06-12 08:09:44,108 INFO  NutchBean - total hits: 0
> 2008-06-12 08:09:49,223 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-12 08:09:49,224 INFO  NutchBean - query: horses
> 2008-06-12 08:09:49,224 INFO  NutchBean - lang: en
> 2008-06-12 08:09:49,225 INFO  NutchBean - searching for 20 raw hits
> 2008-06-12 08:09:49,225 INFO  NutchBean - total hits: 0
> Jun 12, 2008 8:26:02 AM org.apache.catalina.core.AprLifecycleListener
> lifecycleEvent
> INFO: The Apache Tomcat Native library which allows optimal performance in
> production environments was not found on the java.library.path:
> /opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
> Jun 12, 2008 8:26:02 AM org.apache.coyote.http11.Http11BaseProtocol init
> SEVERE: Error initializing endpoint
> java.net.BindException: Address already in use:8080
>        at
> org.apache.tomcat.util.net.PoolTcpEndpoint.initEndpoint(PoolTcpEndpoint.java:297)
>        at
> org.apache.coyote.http11.Http11BaseProtocol.init(Http11BaseProtocol.java:138)
>        at org.apache.catalina.connector.Connector.initialize(Connector.java:1016)
>        at
> org.apache.catalina.core.StandardService.initialize(StandardService.java:580)
>        at
> org.apache.catalina.core.StandardServer.initialize(StandardServer.java:791)
>        at org.apache.catalina.startup.Catalina.load(Catalina.java:503)
>        at org.apache.catalina.startup.Catalina.load(Catalina.java:523)
>        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>        at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>        at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>        at java.lang.reflect.Method.invoke(Method.java:597)
>        at org.apache.catalina.startup.Bootstrap.load(Bootstrap.java:247)
>        at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:412)
> Jun 12, 2008 8:26:02 AM org.apache.catalina.startup.Catalina load
> SEVERE: Catalina.start
> LifecycleException:  Protocol handler initialization failed:
> java.net.BindException: Address already in use:8080
>        at org.apache.catalina.connector.Connector.initialize(Connector.java:1018)
>        at
> org.apache.catalina.core.StandardService.initialize(StandardService.java:580)
>        at
> org.apache.catalina.core.StandardServer.initialize(StandardServer.java:791)
>        at org.apache.catalina.startup.Catalina.load(Catalina.java:503)
>        at org.apache.catalina.startup.Catalina.load(Catalina.java:523)
>        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>        at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>        at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>        at java.lang.reflect.Method.invoke(Method.java:597)
>        at org.apache.catalina.startup.Bootstrap.load(Bootstrap.java:247)
>        at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:412)
> Jun 12, 2008 8:26:02 AM org.apache.catalina.startup.Catalina load
> INFO: Initialization processed in 480 ms
> Jun 12, 2008 8:26:02 AM org.apache.catalina.core.StandardService start
> INFO: Starting service Catalina
> Jun 12, 2008 8:26:02 AM org.apache.catalina.core.StandardEngine start
> INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
> Jun 12, 2008 8:26:02 AM org.apache.catalina.core.StandardHost start
> INFO: XML validation disabled
> Jun 12, 2008 8:26:03 AM org.apache.catalina.startup.HostConfig deployWAR
> INFO: Deploying web application archive nutch-0.8.1.war
> Jun 12, 2008 8:26:03 AM org.apache.coyote.http11.Http11BaseProtocol start
> SEVERE: Error starting endpoint
> java.net.BindException: Address already in use:8080
>        at
> org.apache.tomcat.util.net.PoolTcpEndpoint.initEndpoint(PoolTcpEndpoint.java:297)
>        at
> org.apache.tomcat.util.net.PoolTcpEndpoint.startEndpoint(PoolTcpEndpoint.java:312)
>        at
> org.apache.coyote.http11.Http11BaseProtocol.start(Http11BaseProtocol.java:150)
>        at org.apache.coyote.http11.Http11Protocol.start(Http11Protocol.java:75)
>        at org.apache.catalina.connector.Connector.start(Connector.java:1089)
>        at org.apache.catalina.core.StandardService.start(StandardService.java:459)
>        at org.apache.catalina.core.StandardServer.start(StandardServer.java:709)
>        at org.apache.catalina.startup.Catalina.start(Catalina.java:551)
>        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>        at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>        at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>        at java.lang.reflect.Method.invoke(Method.java:597)
>        at org.apache.catalina.startup.Bootstrap.start(Bootstrap.java:275)
>        at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:413)
> Jun 12, 2008 8:26:03 AM org.apache.catalina.startup.Catalina start
> SEVERE: Catalina.start:
> LifecycleException:  service.getName(): "Catalina";  Protocol handler start
> failed: java.net.BindException: Address already in use:8080
>        at org.apache.catalina.connector.Connector.start(Connector.java:1096)
>        at org.apache.catalina.core.StandardService.start(StandardService.java:459)
>        at org.apache.catalina.core.StandardServer.start(StandardServer.java:709)
>        at org.apache.catalina.startup.Catalina.start(Catalina.java:551)
>        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>        at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>        at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>        at java.lang.reflect.Method.invoke(Method.java:597)
>        at org.apache.catalina.startup.Bootstrap.start(Bootstrap.java:275)
>        at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:413)
> Jun 12, 2008 8:26:03 AM org.apache.catalina.startup.Catalina start
> INFO: Server startup in 893 ms
> Jun 12, 2008 8:26:03 AM org.apache.catalina.core.StandardServer await
> SEVERE: StandardServer.await: create[8005]:
> java.net.BindException: Address already in use
>        at java.net.PlainSocketImpl.socketBind(Native Method)
>        at java.net.PlainSocketImpl.bind(PlainSocketImpl.java:359)
>        at java.net.ServerSocket.bind(ServerSocket.java:319)
>        at java.net.ServerSocket.<init>(ServerSocket.java:185)
>        at org.apache.catalina.core.StandardServer.await(StandardServer.java:372)
>        at org.apache.catalina.startup.Catalina.await(Catalina.java:615)
>        at org.apache.catalina.startup.Catalina.start(Catalina.java:575)
>        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>        at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>        at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>        at java.lang.reflect.Method.invoke(Method.java:597)
>        at org.apache.catalina.startup.Bootstrap.start(Bootstrap.java:275)
>        at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:413)
> Jun 12, 2008 8:26:03 AM org.apache.coyote.http11.Http11BaseProtocol pause
> INFO: Pausing Coyote HTTP/1.1 on http-8080
> Jun 12, 2008 8:26:03 AM org.apache.catalina.connector.Connector pause
> SEVERE: Protocol handler pause failed
> java.lang.NullPointerException
>        at org.apache.jk.server.JkMain.pause(JkMain.java:677)
>        at org.apache.jk.server.JkCoyoteHandler.pause(JkCoyoteHandler.java:162)
>        at org.apache.catalina.connector.Connector.pause(Connector.java:1031)
>        at org.apache.catalina.core.StandardService.stop(StandardService.java:491)
>        at org.apache.catalina.core.StandardServer.stop(StandardServer.java:743)
>        at org.apache.catalina.startup.Catalina.stop(Catalina.java:601)
>        at
> org.apache.catalina.startup.Catalina$CatalinaShutdownHook.run(Catalina.java:644)
> 2008-06-12 08:26:32,086 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-12 08:26:32,089 INFO  NutchBean - query: horses
> 2008-06-12 08:26:32,089 INFO  NutchBean - lang: en
> 2008-06-12 08:26:32,090 INFO  NutchBean - searching for 20 raw hits
> 2008-06-12 08:26:32,091 INFO  NutchBean - total hits: 0
> 2008-06-12 08:29:09,089 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-12 08:29:09,089 INFO  NutchBean - query: horses
> 2008-06-12 08:29:09,089 INFO  NutchBean - lang: en
> 2008-06-12 08:29:09,090 INFO  NutchBean - searching for 20 raw hits
> 2008-06-12 08:29:09,091 INFO  NutchBean - total hits: 0
> Jun 12, 2008 8:36:45 AM org.apache.coyote.http11.Http11BaseProtocol pause
> INFO: Pausing Coyote HTTP/1.1 on http-8080
> Jun 12, 2008 8:36:46 AM org.apache.catalina.core.StandardService stop
> INFO: Stopping service Catalina
> Jun 12, 2008 8:36:47 AM org.apache.coyote.http11.Http11BaseProtocol destroy
> INFO: Stopping Coyote HTTP/1.1 on http-8080
> Jun 12, 2008 8:36:47 AM org.apache.catalina.core.AprLifecycleListener
> lifecycleEvent
> INFO: Failed shutdown of Apache Portable Runtime
> Jun 12, 2008 8:36:50 AM org.apache.catalina.core.AprLifecycleListener
> lifecycleEvent
> INFO: The Apache Tomcat Native library which allows optimal performance in
> production environments was not found on the java.library.path:
> /opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
> Jun 12, 2008 8:36:50 AM org.apache.coyote.http11.Http11BaseProtocol init
> INFO: Initializing Coyote HTTP/1.1 on http-8080
> Jun 12, 2008 8:36:50 AM org.apache.catalina.startup.Catalina load
> INFO: Initialization processed in 414 ms
> Jun 12, 2008 8:36:50 AM org.apache.catalina.core.StandardService start
> INFO: Starting service Catalina
> Jun 12, 2008 8:36:50 AM org.apache.catalina.core.StandardEngine start
> INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
> Jun 12, 2008 8:36:50 AM org.apache.catalina.core.StandardHost start
> INFO: XML validation disabled
> Jun 12, 2008 8:36:51 AM org.apache.catalina.startup.HostConfig deployWAR
> INFO: Deploying web application archive nutch-0.8.1.war
> Jun 12, 2008 8:36:51 AM org.apache.coyote.http11.Http11BaseProtocol start
> INFO: Starting Coyote HTTP/1.1 on http-8080
> Jun 12, 2008 8:36:51 AM org.apache.jk.common.ChannelSocket init
> INFO: JK: ajp13 listening on /0.0.0.0:8009
> Jun 12, 2008 8:36:51 AM org.apache.jk.server.JkMain start
> INFO: Jk running ID=0 time=0/34  config=null
> Jun 12, 2008 8:36:51 AM org.apache.catalina.storeconfig.StoreLoader load
> INFO: Find registry server-registry.xml at classpath resource
> Jun 12, 2008 8:36:51 AM org.apache.catalina.startup.Catalina start
> INFO: Server startup in 968 ms
> 2008-06-12 08:37:04,238 INFO  Configuration - parsing
> jar:file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/lib/hadoop-0.4.0-patched.jar!/hadoop-default.xml
> 2008-06-12 08:37:04,296 INFO  Configuration - parsing
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-default.xml
> 2008-06-12 08:37:04,321 INFO  Configuration - parsing
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-site.xml
> 2008-06-12 08:37:04,322 INFO  Configuration - parsing
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/hadoop-site.xml
> 2008-06-12 08:37:04,330 INFO  PluginRepository - Plugins: looking in:
> /opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/plugins
> 2008-06-12 08:37:04,425 INFO  PluginRepository - Plugin Auto-activation
> mode: [true]
> 2008-06-12 08:37:04,425 INFO  PluginRepository - Registered Plugins:
> 2008-06-12 08:37:04,425 INFO  PluginRepository -        the nutch core extension
> points (nutch-extensionpoints)
> 2008-06-12 08:37:04,425 INFO  PluginRepository -        Basic Query Filter
> (query-basic)
> 2008-06-12 08:37:04,425 INFO  PluginRepository -        Basic Indexing Filter
> (index-basic)
> 2008-06-12 08:37:04,425 INFO  PluginRepository -        Html Parse Plug-in
> (parse-html)
> 2008-06-12 08:37:04,425 INFO  PluginRepository -        Site Query Filter
> (query-site)
> 2008-06-12 08:37:04,425 INFO  PluginRepository -        Basic Summarizer Plug-in
> (summary-basic)
> 2008-06-12 08:37:04,425 INFO  PluginRepository -        HTTP Framework (lib-http)
> 2008-06-12 08:37:04,425 INFO  PluginRepository -        Text Parse Plug-in
> (parse-text)
> 2008-06-12 08:37:04,425 INFO  PluginRepository -        Regex URL Filter
> (urlfilter-regex)
> 2008-06-12 08:37:04,425 INFO  PluginRepository -        Http Protocol Plug-in
> (protocol-http)
> 2008-06-12 08:37:04,425 INFO  PluginRepository -        OPIC Scoring Plug-in
> (scoring-opic)
> 2008-06-12 08:37:04,425 INFO  PluginRepository -        CyberNeko HTML Parser
> (lib-nekohtml)
> 2008-06-12 08:37:04,425 INFO  PluginRepository -        JavaScript Parser
> (parse-js)
> 2008-06-12 08:37:04,425 INFO  PluginRepository -        URL Query Filter
> (query-url)
> 2008-06-12 08:37:04,425 INFO  PluginRepository -        Regex URL Filter Framework
> (lib-regex-filter)
> 2008-06-12 08:37:04,425 INFO  PluginRepository - Registered
> Extension-Points:
> 2008-06-12 08:37:04,426 INFO  PluginRepository -        Nutch Summarizer
> (org.apache.nutch.searcher.Summarizer)
> 2008-06-12 08:37:04,426 INFO  PluginRepository -        Nutch Protocol
> (org.apache.nutch.protocol.Protocol)
> 2008-06-12 08:37:04,426 INFO  PluginRepository -        Nutch Analysis
> (org.apache.nutch.analysis.NutchAnalyzer)
> 2008-06-12 08:37:04,426 INFO  PluginRepository -        Nutch URL Filter
> (org.apache.nutch.net.URLFilter)
> 2008-06-12 08:37:04,426 INFO  PluginRepository -        Nutch Indexing Filter
> (org.apache.nutch.indexer.IndexingFilter)
> 2008-06-12 08:37:04,426 INFO  PluginRepository -        Nutch Online Search
> Results Clustering Plugin (org.apache.nutch.clustering.OnlineClusterer)
> 2008-06-12 08:37:04,426 INFO  PluginRepository -        HTML Parse Filter
> (org.apache.nutch.parse.HtmlParseFilter)
> 2008-06-12 08:37:04,426 INFO  PluginRepository -        Nutch Content Parser
> (org.apache.nutch.parse.Parser)
> 2008-06-12 08:37:04,426 INFO  PluginRepository -        Nutch Scoring
> (org.apache.nutch.scoring.ScoringFilter)
> 2008-06-12 08:37:04,426 INFO  PluginRepository -        Nutch Query Filter
> (org.apache.nutch.searcher.QueryFilter)
> 2008-06-12 08:37:04,426 INFO  PluginRepository -        Ontology Model Loader
> (org.apache.nutch.ontology.Ontology)
> 2008-06-12 08:37:04,435 INFO  NutchBean - creating new bean
> 2008-06-12 08:37:04,443 INFO  NutchBean - opening indexes in crawl/indexes
> 2008-06-12 08:37:04,485 INFO  Configuration - found resource
> common-terms.utf8 at
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/common-terms.utf8
> 2008-06-12 08:37:04,491 INFO  NutchBean - opening segments in crawl/segments
> 2008-06-12 08:37:04,504 INFO  SummarizerFactory - Using the first summarizer
> extension found: Basic Summarizer
> 2008-06-12 08:37:04,504 INFO  NutchBean - opening linkdb in crawl/linkdb
> 2008-06-12 08:37:04,510 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-12 08:37:04,522 INFO  NutchBean - query: horses
> 2008-06-12 08:37:04,522 INFO  NutchBean - lang: en
> 2008-06-12 08:37:04,564 INFO  NutchBean - searching for 20 raw hits
> 2008-06-12 08:37:04,607 INFO  NutchBean - total hits: 0
> 2008-06-12 08:37:15,225 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-12 08:37:15,226 INFO  NutchBean - query: pest
> 2008-06-12 08:37:15,226 INFO  NutchBean - lang: en
> 2008-06-12 08:37:15,229 INFO  NutchBean - searching for 20 raw hits
> 2008-06-12 08:37:15,230 INFO  NutchBean - total hits: 0
> Jun 12, 2008 9:04:11 AM org.apache.coyote.http11.Http11BaseProtocol pause
> INFO: Pausing Coyote HTTP/1.1 on http-8080
> Jun 12, 2008 9:04:12 AM org.apache.catalina.core.StandardService stop
> INFO: Stopping service Catalina
> Jun 12, 2008 9:04:12 AM org.apache.coyote.http11.Http11BaseProtocol destroy
> INFO: Stopping Coyote HTTP/1.1 on http-8080
> Jun 12, 2008 9:04:12 AM org.apache.catalina.core.AprLifecycleListener
> lifecycleEvent
> INFO: Failed shutdown of Apache Portable Runtime
> Jun 12, 2008 9:04:19 AM org.apache.catalina.core.AprLifecycleListener
> lifecycleEvent
> INFO: The Apache Tomcat Native library which allows optimal performance in
> production environments was not found on the java.library.path:
> /opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
> Jun 12, 2008 9:04:20 AM org.apache.coyote.http11.Http11BaseProtocol init
> INFO: Initializing Coyote HTTP/1.1 on http-8080
> Jun 12, 2008 9:04:20 AM org.apache.catalina.startup.Catalina load
> INFO: Initialization processed in 481 ms
> Jun 12, 2008 9:04:20 AM org.apache.catalina.core.StandardService start
> INFO: Starting service Catalina
> Jun 12, 2008 9:04:20 AM org.apache.catalina.core.StandardEngine start
> INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
> Jun 12, 2008 9:04:20 AM org.apache.catalina.core.StandardHost start
> INFO: XML validation disabled
> Jun 12, 2008 9:04:20 AM org.apache.catalina.startup.HostConfig deployWAR
> INFO: Deploying web application archive nutch-0.8.1.war
> Jun 12, 2008 9:04:21 AM org.apache.coyote.http11.Http11BaseProtocol start
> INFO: Starting Coyote HTTP/1.1 on http-8080
> Jun 12, 2008 9:04:21 AM org.apache.jk.common.ChannelSocket init
> INFO: JK: ajp13 listening on /0.0.0.0:8009
> Jun 12, 2008 9:04:21 AM org.apache.jk.server.JkMain start
> INFO: Jk running ID=0 time=0/14  config=null
> Jun 12, 2008 9:04:21 AM org.apache.catalina.storeconfig.StoreLoader load
> INFO: Find registry server-registry.xml at classpath resource
> Jun 12, 2008 9:04:21 AM org.apache.catalina.startup.Catalina start
> INFO: Server startup in 1131 ms
> 2008-06-12 09:04:35,798 INFO  Configuration - parsing
> jar:file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/lib/hadoop-0.4.0-patched.jar!/hadoop-default.xml
> 2008-06-12 09:04:35,857 INFO  Configuration - parsing
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-default.xml
> 2008-06-12 09:04:35,882 INFO  Configuration - parsing
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-site.xml
> 2008-06-12 09:04:35,884 INFO  Configuration - parsing
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/hadoop-site.xml
> 2008-06-12 09:04:35,892 INFO  PluginRepository - Plugins: looking in:
> /opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/plugins
> 2008-06-12 09:04:36,021 INFO  PluginRepository - Plugin Auto-activation
> mode: [true]
> 2008-06-12 09:04:36,021 INFO  PluginRepository - Registered Plugins:
> 2008-06-12 09:04:36,021 INFO  PluginRepository -        the nutch core extension
> points (nutch-extensionpoints)
> 2008-06-12 09:04:36,021 INFO  PluginRepository -        Basic Query Filter
> (query-basic)
> 2008-06-12 09:04:36,021 INFO  PluginRepository -        Basic Indexing Filter
> (index-basic)
> 2008-06-12 09:04:36,021 INFO  PluginRepository -        Html Parse Plug-in
> (parse-html)
> 2008-06-12 09:04:36,021 INFO  PluginRepository -        Site Query Filter
> (query-site)
> 2008-06-12 09:04:36,021 INFO  PluginRepository -        Basic Summarizer Plug-in
> (summary-basic)
> 2008-06-12 09:04:36,021 INFO  PluginRepository -        HTTP Framework (lib-http)
> 2008-06-12 09:04:36,021 INFO  PluginRepository -        Text Parse Plug-in
> (parse-text)
> 2008-06-12 09:04:36,021 INFO  PluginRepository -        Regex URL Filter
> (urlfilter-regex)
> 2008-06-12 09:04:36,021 INFO  PluginRepository -        Http Protocol Plug-in
> (protocol-http)
> 2008-06-12 09:04:36,021 INFO  PluginRepository -        OPIC Scoring Plug-in
> (scoring-opic)
> 2008-06-12 09:04:36,021 INFO  PluginRepository -        CyberNeko HTML Parser
> (lib-nekohtml)
> 2008-06-12 09:04:36,022 INFO  PluginRepository -        JavaScript Parser
> (parse-js)
> 2008-06-12 09:04:36,022 INFO  PluginRepository -        URL Query Filter
> (query-url)
> 2008-06-12 09:04:36,022 INFO  PluginRepository -        Regex URL Filter Framework
> (lib-regex-filter)
> 2008-06-12 09:04:36,022 INFO  PluginRepository - Registered
> Extension-Points:
> 2008-06-12 09:04:36,022 INFO  PluginRepository -        Nutch Summarizer
> (org.apache.nutch.searcher.Summarizer)
> 2008-06-12 09:04:36,022 INFO  PluginRepository -        Nutch Protocol
> (org.apache.nutch.protocol.Protocol)
> 2008-06-12 09:04:36,022 INFO  PluginRepository -        Nutch Analysis
> (org.apache.nutch.analysis.NutchAnalyzer)
> 2008-06-12 09:04:36,022 INFO  PluginRepository -        Nutch URL Filter
> (org.apache.nutch.net.URLFilter)
> 2008-06-12 09:04:36,022 INFO  PluginRepository -        Nutch Indexing Filter
> (org.apache.nutch.indexer.IndexingFilter)
> 2008-06-12 09:04:36,022 INFO  PluginRepository -        Nutch Online Search
> Results Clustering Plugin (org.apache.nutch.clustering.OnlineClusterer)
> 2008-06-12 09:04:36,022 INFO  PluginRepository -        HTML Parse Filter
> (org.apache.nutch.parse.HtmlParseFilter)
> 2008-06-12 09:04:36,022 INFO  PluginRepository -        Nutch Content Parser
> (org.apache.nutch.parse.Parser)
> 2008-06-12 09:04:36,022 INFO  PluginRepository -        Nutch Scoring
> (org.apache.nutch.scoring.ScoringFilter)
> 2008-06-12 09:04:36,022 INFO  PluginRepository -        Nutch Query Filter
> (org.apache.nutch.searcher.QueryFilter)
> 2008-06-12 09:04:36,022 INFO  PluginRepository -        Ontology Model Loader
> (org.apache.nutch.ontology.Ontology)
> 2008-06-12 09:04:36,035 INFO  NutchBean - creating new bean
> 2008-06-12 09:04:36,048 INFO  NutchBean - opening indexes in crawl/indexes
> 2008-06-12 09:04:36,090 INFO  Configuration - found resource
> common-terms.utf8 at
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/common-terms.utf8
> 2008-06-12 09:04:36,096 INFO  NutchBean - opening segments in crawl/segments
> 2008-06-12 09:04:36,110 INFO  SummarizerFactory - Using the first summarizer
> extension found: Basic Summarizer
> 2008-06-12 09:04:36,110 INFO  NutchBean - opening linkdb in crawl/linkdb
> 2008-06-12 09:04:36,115 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-12 09:04:36,127 INFO  NutchBean - query: horses
> 2008-06-12 09:04:36,127 INFO  NutchBean - lang: en
> 2008-06-12 09:04:36,171 INFO  NutchBean - searching for 20 raw hits
> 2008-06-12 09:04:36,202 INFO  NutchBean - total hits: 0
> 2008-06-12 09:16:45,571 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-12 09:16:45,571 INFO  NutchBean - query: horses
> 2008-06-12 09:16:45,571 INFO  NutchBean - lang: en
> 2008-06-12 09:16:45,572 INFO  NutchBean - searching for 20 raw hits
> 2008-06-12 09:16:45,573 INFO  NutchBean - total hits: 0
> 2008-06-12 09:16:48,412 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-12 09:16:48,412 INFO  NutchBean - query: horses
> 2008-06-12 09:16:48,412 INFO  NutchBean - lang: en
> 2008-06-12 09:16:48,413 INFO  NutchBean - searching for 20 raw hits
> 2008-06-12 09:16:48,413 INFO  NutchBean - total hits: 0
> Jun 12, 2008 9:36:23 AM org.apache.coyote.http11.Http11BaseProtocol pause
> INFO: Pausing Coyote HTTP/1.1 on http-8080
> Jun 12, 2008 9:36:23 AM org.apache.catalina.connector.Connector pause
> SEVERE: Protocol handler pause failed
> java.net.SocketException: Network is unreachable
>        at java.net.PlainSocketImpl.socketConnect(Native Method)
>        at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
>        at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
>        at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
>        at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
>        at java.net.Socket.connect(Socket.java:519)
>        at java.net.Socket.connect(Socket.java:469)
>        at java.net.Socket.<init>(Socket.java:366)
>        at java.net.Socket.<init>(Socket.java:209)
>        at org.apache.jk.common.ChannelSocket.unLockSocket(ChannelSocket.java:473)
>        at org.apache.jk.common.ChannelSocket.pause(ChannelSocket.java:270)
>        at org.apache.jk.server.JkMain.pause(JkMain.java:679)
>        at org.apache.jk.server.JkCoyoteHandler.pause(JkCoyoteHandler.java:162)
>        at org.apache.catalina.connector.Connector.pause(Connector.java:1031)
>        at org.apache.catalina.core.StandardService.stop(StandardService.java:491)
>        at org.apache.catalina.core.StandardServer.stop(StandardServer.java:743)
>        at org.apache.catalina.startup.Catalina.stop(Catalina.java:601)
>        at
> org.apache.catalina.startup.Catalina$CatalinaShutdownHook.run(Catalina.java:644)
> Jun 12, 2008 10:30:44 AM org.apache.catalina.core.AprLifecycleListener
> lifecycleEvent
> INFO: The Apache Tomcat Native library which allows optimal performance in
> production environments was not found on the java.library.path:
> /opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
> Jun 12, 2008 10:30:45 AM org.apache.coyote.http11.Http11BaseProtocol init
> INFO: Initializing Coyote HTTP/1.1 on http-8080
> Jun 12, 2008 10:30:45 AM org.apache.catalina.startup.Catalina load
> INFO: Initialization processed in 1599 ms
> Jun 12, 2008 10:30:45 AM org.apache.catalina.core.StandardService start
> INFO: Starting service Catalina
> Jun 12, 2008 10:30:45 AM org.apache.catalina.core.StandardEngine start
> INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
> Jun 12, 2008 10:30:45 AM org.apache.catalina.core.StandardHost start
> INFO: XML validation disabled
> Jun 12, 2008 10:30:46 AM org.apache.catalina.startup.HostConfig deployWAR
> INFO: Deploying web application archive nutch-0.8.1.war
> Jun 12, 2008 10:30:47 AM org.apache.coyote.http11.Http11BaseProtocol start
> INFO: Starting Coyote HTTP/1.1 on http-8080
> Jun 12, 2008 10:30:47 AM org.apache.jk.common.ChannelSocket init
> INFO: JK: ajp13 listening on /0.0.0.0:8009
> Jun 12, 2008 10:30:47 AM org.apache.jk.server.JkMain start
> INFO: Jk running ID=0 time=0/30  config=null
> Jun 12, 2008 10:30:47 AM org.apache.catalina.storeconfig.StoreLoader load
> INFO: Find registry server-registry.xml at classpath resource
> Jun 12, 2008 10:30:47 AM org.apache.catalina.startup.Catalina start
> INFO: Server startup in 2626 ms
> 2008-06-12 10:30:56,210 INFO  Configuration - parsing
> jar:file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/lib/hadoop-0.4.0-patched.jar!/hadoop-default.xml
> 2008-06-12 10:30:56,288 INFO  Configuration - parsing
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-default.xml
> 2008-06-12 10:30:56,317 INFO  Configuration - parsing
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-site.xml
> 2008-06-12 10:30:56,319 INFO  Configuration - parsing
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/hadoop-site.xml
> 2008-06-12 10:30:56,339 INFO  PluginRepository - Plugins: looking in:
> /opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/plugins
> 2008-06-12 10:30:56,599 INFO  PluginRepository - Plugin Auto-activation
> mode: [true]
> 2008-06-12 10:30:56,599 INFO  PluginRepository - Registered Plugins:
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        the nutch core extension
> points (nutch-extensionpoints)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        Basic Query Filter
> (query-basic)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        Basic Indexing Filter
> (index-basic)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        Html Parse Plug-in
> (parse-html)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        Site Query Filter
> (query-site)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        Basic Summarizer Plug-in
> (summary-basic)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        HTTP Framework (lib-http)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        Text Parse Plug-in
> (parse-text)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        Regex URL Filter
> (urlfilter-regex)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        Http Protocol Plug-in
> (protocol-http)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        OPIC Scoring Plug-in
> (scoring-opic)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        CyberNeko HTML Parser
> (lib-nekohtml)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        JavaScript Parser
> (parse-js)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        URL Query Filter
> (query-url)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        Regex URL Filter Framework
> (lib-regex-filter)
> 2008-06-12 10:30:56,599 INFO  PluginRepository - Registered
> Extension-Points:
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        Nutch Summarizer
> (org.apache.nutch.searcher.Summarizer)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        Nutch Protocol
> (org.apache.nutch.protocol.Protocol)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        Nutch Analysis
> (org.apache.nutch.analysis.NutchAnalyzer)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        Nutch URL Filter
> (org.apache.nutch.net.URLFilter)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        Nutch Indexing Filter
> (org.apache.nutch.indexer.IndexingFilter)
> 2008-06-12 10:30:56,599 INFO  PluginRepository -        Nutch Online Search
> Results Clustering Plugin (org.apache.nutch.clustering.OnlineClusterer)
> 2008-06-12 10:30:56,600 INFO  PluginRepository -        HTML Parse Filter
> (org.apache.nutch.parse.HtmlParseFilter)
> 2008-06-12 10:30:56,600 INFO  PluginRepository -        Nutch Content Parser
> (org.apache.nutch.parse.Parser)
> 2008-06-12 10:30:56,600 INFO  PluginRepository -        Nutch Scoring
> (org.apache.nutch.scoring.ScoringFilter)
> 2008-06-12 10:30:56,600 INFO  PluginRepository -        Nutch Query Filter
> (org.apache.nutch.searcher.QueryFilter)
> 2008-06-12 10:30:56,600 INFO  PluginRepository -        Ontology Model Loader
> (org.apache.nutch.ontology.Ontology)
> 2008-06-12 10:30:56,608 INFO  NutchBean - creating new bean
> 2008-06-12 10:30:56,625 INFO  NutchBean - opening indexes in crawl/indexes
> 2008-06-12 10:30:56,724 INFO  Configuration - found resource
> common-terms.utf8 at
> file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/common-terms.utf8
> 2008-06-12 10:30:56,730 INFO  NutchBean - opening segments in crawl/segments
> 2008-06-12 10:30:56,752 INFO  SummarizerFactory - Using the first summarizer
> extension found: Basic Summarizer
> 2008-06-12 10:30:56,753 INFO  NutchBean - opening linkdb in crawl/linkdb
> 2008-06-12 10:30:56,776 INFO  NutchBean - query request from 127.0.0.1
> 2008-06-12 10:30:56,788 INFO  NutchBean - query: horses
> 2008-06-12 10:30:56,788 INFO  NutchBean - lang: en
> 2008-06-12 10:30:56,822 INFO  NutchBean - searching for 20 raw hits
> 2008-06-12 10:30:56,889 INFO  NutchBean - total hits: 0
> "
>
> I don't know if that helps, but it does talk about nutch, so...
> By the way, big thanks for your fast response- i'm really short on time.
>
> post your tomcat logs as you do a search...
>
> Jason
>
> --
> View this message in context: http://www.nabble.com/Nutch--crawling--tp17801131p17802984.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>
>

Re: Nutch- crawling?

Posted by nutch_newbie <ka...@hotmail.com>.

Which ones? there are 11 of them in the "logs" folder...

here is one of them: catalina.out:

"Jun 11, 2008 1:11:03 PM org.apache.catalina.core.AprLifecycleListener
lifecycleEvent
INFO: The Apache Tomcat Native library which allows optimal performance in
production environments was not found on the java.library.path:
/opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
Jun 11, 2008 1:11:03 PM org.apache.coyote.http11.Http11BaseProtocol init
SEVERE: Error initializing endpoint
java.net.BindException: Address already in use:8080
	at
org.apache.tomcat.util.net.PoolTcpEndpoint.initEndpoint(PoolTcpEndpoint.java:297)
	at
org.apache.coyote.http11.Http11BaseProtocol.init(Http11BaseProtocol.java:138)
	at org.apache.catalina.connector.Connector.initialize(Connector.java:1016)
	at
org.apache.catalina.core.StandardService.initialize(StandardService.java:580)
	at
org.apache.catalina.core.StandardServer.initialize(StandardServer.java:791)
	at org.apache.catalina.startup.Catalina.load(Catalina.java:503)
	at org.apache.catalina.startup.Catalina.load(Catalina.java:523)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.apache.catalina.startup.Bootstrap.load(Bootstrap.java:247)
	at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:412)
Jun 11, 2008 1:11:03 PM org.apache.catalina.startup.Catalina load
SEVERE: Catalina.start
LifecycleException:  Protocol handler initialization failed:
java.net.BindException: Address already in use:8080
	at org.apache.catalina.connector.Connector.initialize(Connector.java:1018)
	at
org.apache.catalina.core.StandardService.initialize(StandardService.java:580)
	at
org.apache.catalina.core.StandardServer.initialize(StandardServer.java:791)
	at org.apache.catalina.startup.Catalina.load(Catalina.java:503)
	at org.apache.catalina.startup.Catalina.load(Catalina.java:523)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.apache.catalina.startup.Bootstrap.load(Bootstrap.java:247)
	at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:412)
Jun 11, 2008 1:11:03 PM org.apache.catalina.startup.Catalina load
INFO: Initialization processed in 491 ms
Jun 11, 2008 1:11:03 PM org.apache.catalina.core.StandardService start
INFO: Starting service Catalina
Jun 11, 2008 1:11:03 PM org.apache.catalina.core.StandardEngine start
INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
Jun 11, 2008 1:11:03 PM org.apache.catalina.core.StandardHost start
INFO: XML validation disabled
Jun 11, 2008 1:11:04 PM org.apache.coyote.http11.Http11BaseProtocol start
SEVERE: Error starting endpoint
java.net.BindException: Address already in use:8080
	at
org.apache.tomcat.util.net.PoolTcpEndpoint.initEndpoint(PoolTcpEndpoint.java:297)
	at
org.apache.tomcat.util.net.PoolTcpEndpoint.startEndpoint(PoolTcpEndpoint.java:312)
	at
org.apache.coyote.http11.Http11BaseProtocol.start(Http11BaseProtocol.java:150)
	at org.apache.coyote.http11.Http11Protocol.start(Http11Protocol.java:75)
	at org.apache.catalina.connector.Connector.start(Connector.java:1089)
	at org.apache.catalina.core.StandardService.start(StandardService.java:459)
	at org.apache.catalina.core.StandardServer.start(StandardServer.java:709)
	at org.apache.catalina.startup.Catalina.start(Catalina.java:551)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.apache.catalina.startup.Bootstrap.start(Bootstrap.java:275)
	at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:413)
Jun 11, 2008 1:11:04 PM org.apache.catalina.startup.Catalina start
SEVERE: Catalina.start: 
LifecycleException:  service.getName(): "Catalina";  Protocol handler start
failed: java.net.BindException: Address already in use:8080
	at org.apache.catalina.connector.Connector.start(Connector.java:1096)
	at org.apache.catalina.core.StandardService.start(StandardService.java:459)
	at org.apache.catalina.core.StandardServer.start(StandardServer.java:709)
	at org.apache.catalina.startup.Catalina.start(Catalina.java:551)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.apache.catalina.startup.Bootstrap.start(Bootstrap.java:275)
	at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:413)
Jun 11, 2008 1:11:04 PM org.apache.catalina.startup.Catalina start
INFO: Server startup in 822 ms
Jun 11, 2008 1:11:04 PM org.apache.catalina.core.StandardServer await
SEVERE: StandardServer.await: create[8005]: 
java.net.BindException: Address already in use
	at java.net.PlainSocketImpl.socketBind(Native Method)
	at java.net.PlainSocketImpl.bind(PlainSocketImpl.java:359)
	at java.net.ServerSocket.bind(ServerSocket.java:319)
	at java.net.ServerSocket.<init>(ServerSocket.java:185)
	at org.apache.catalina.core.StandardServer.await(StandardServer.java:372)
	at org.apache.catalina.startup.Catalina.await(Catalina.java:615)
	at org.apache.catalina.startup.Catalina.start(Catalina.java:575)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.apache.catalina.startup.Bootstrap.start(Bootstrap.java:275)
	at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:413)
Jun 11, 2008 1:11:04 PM org.apache.coyote.http11.Http11BaseProtocol pause
INFO: Pausing Coyote HTTP/1.1 on http-8080
Jun 11, 2008 1:11:04 PM org.apache.catalina.connector.Connector pause
SEVERE: Protocol handler pause failed
java.lang.NullPointerException
	at org.apache.jk.server.JkMain.pause(JkMain.java:677)
	at org.apache.jk.server.JkCoyoteHandler.pause(JkCoyoteHandler.java:162)
	at org.apache.catalina.connector.Connector.pause(Connector.java:1031)
	at org.apache.catalina.core.StandardService.stop(StandardService.java:491)
	at org.apache.catalina.core.StandardServer.stop(StandardServer.java:743)
	at org.apache.catalina.startup.Catalina.stop(Catalina.java:601)
	at
org.apache.catalina.startup.Catalina$CatalinaShutdownHook.run(Catalina.java:644)
Jun 11, 2008 1:21:40 PM org.apache.catalina.core.AprLifecycleListener
lifecycleEvent
INFO: The Apache Tomcat Native library which allows optimal performance in
production environments was not found on the java.library.path:
/opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
Jun 11, 2008 1:21:40 PM org.apache.coyote.http11.Http11BaseProtocol init
INFO: Initializing Coyote HTTP/1.1 on http-8080
Jun 11, 2008 1:21:40 PM org.apache.catalina.startup.Catalina load
INFO: Initialization processed in 506 ms
Jun 11, 2008 1:21:40 PM org.apache.catalina.core.StandardService start
INFO: Starting service Catalina
Jun 11, 2008 1:21:40 PM org.apache.catalina.core.StandardEngine start
INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
Jun 11, 2008 1:21:40 PM org.apache.catalina.core.StandardHost start
INFO: XML validation disabled
Jun 11, 2008 1:21:41 PM org.apache.coyote.http11.Http11BaseProtocol start
INFO: Starting Coyote HTTP/1.1 on http-8080
Jun 11, 2008 1:21:41 PM org.apache.jk.common.ChannelSocket init
INFO: JK: ajp13 listening on /0.0.0.0:8009
Jun 11, 2008 1:21:41 PM org.apache.jk.server.JkMain start
INFO: Jk running ID=0 time=0/13  config=null
Jun 11, 2008 1:21:41 PM org.apache.catalina.storeconfig.StoreLoader load
INFO: Find registry server-registry.xml at classpath resource
Jun 11, 2008 1:21:41 PM org.apache.catalina.startup.Catalina start
INFO: Server startup in 957 ms
Jun 11, 2008 1:30:33 PM org.apache.catalina.startup.HostConfig deployWAR
INFO: Deploying web application archive nutch-0.8.1.war
Jun 11, 2008 1:41:00 PM org.apache.coyote.http11.Http11BaseProtocol pause
INFO: Pausing Coyote HTTP/1.1 on http-8080
Jun 11, 2008 1:41:01 PM org.apache.catalina.core.StandardService stop
INFO: Stopping service Catalina
Jun 11, 2008 1:41:01 PM org.apache.coyote.http11.Http11BaseProtocol destroy
INFO: Stopping Coyote HTTP/1.1 on http-8080
Jun 11, 2008 1:41:01 PM org.apache.catalina.core.AprLifecycleListener
lifecycleEvent
INFO: Failed shutdown of Apache Portable Runtime
Jun 11, 2008 1:41:08 PM org.apache.catalina.core.AprLifecycleListener
lifecycleEvent
INFO: The Apache Tomcat Native library which allows optimal performance in
production environments was not found on the java.library.path:
/opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
Jun 11, 2008 1:41:08 PM org.apache.coyote.http11.Http11BaseProtocol init
INFO: Initializing Coyote HTTP/1.1 on http-8080
Jun 11, 2008 1:41:08 PM org.apache.catalina.startup.Catalina load
INFO: Initialization processed in 484 ms
Jun 11, 2008 1:41:08 PM org.apache.catalina.core.StandardService start
INFO: Starting service Catalina
Jun 11, 2008 1:41:08 PM org.apache.catalina.core.StandardEngine start
INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
Jun 11, 2008 1:41:08 PM org.apache.catalina.core.StandardHost start
INFO: XML validation disabled
Jun 11, 2008 1:41:08 PM org.apache.catalina.startup.HostConfig deployWAR
INFO: Deploying web application archive nutch-0.8.1.war
Jun 11, 2008 1:41:08 PM org.apache.coyote.http11.Http11BaseProtocol start
INFO: Starting Coyote HTTP/1.1 on http-8080
Jun 11, 2008 1:41:09 PM org.apache.jk.common.ChannelSocket init
INFO: JK: ajp13 listening on /0.0.0.0:8009
Jun 11, 2008 1:41:09 PM org.apache.jk.server.JkMain start
INFO: Jk running ID=0 time=0/13  config=null
Jun 11, 2008 1:41:09 PM org.apache.catalina.storeconfig.StoreLoader load
INFO: Find registry server-registry.xml at classpath resource
Jun 11, 2008 1:41:09 PM org.apache.catalina.startup.Catalina start
INFO: Server startup in 1043 ms
Jun 11, 2008 1:52:22 PM org.apache.coyote.http11.Http11BaseProtocol pause
INFO: Pausing Coyote HTTP/1.1 on http-8080
Jun 11, 2008 1:52:23 PM org.apache.catalina.core.StandardService stop
INFO: Stopping service Catalina
Jun 11, 2008 1:52:23 PM org.apache.coyote.http11.Http11BaseProtocol destroy
INFO: Stopping Coyote HTTP/1.1 on http-8080
Jun 11, 2008 1:52:23 PM org.apache.catalina.core.AprLifecycleListener
lifecycleEvent
INFO: Failed shutdown of Apache Portable Runtime
Jun 11, 2008 1:52:30 PM org.apache.catalina.core.AprLifecycleListener
lifecycleEvent
INFO: The Apache Tomcat Native library which allows optimal performance in
production environments was not found on the java.library.path:
/opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
Jun 11, 2008 1:52:30 PM org.apache.coyote.http11.Http11BaseProtocol init
INFO: Initializing Coyote HTTP/1.1 on http-8080
Jun 11, 2008 1:52:30 PM org.apache.catalina.startup.Catalina load
INFO: Initialization processed in 518 ms
Jun 11, 2008 1:52:30 PM org.apache.catalina.core.StandardService start
INFO: Starting service Catalina
Jun 11, 2008 1:52:30 PM org.apache.catalina.core.StandardEngine start
INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
Jun 11, 2008 1:52:30 PM org.apache.catalina.core.StandardHost start
INFO: XML validation disabled
Jun 11, 2008 1:52:30 PM org.apache.catalina.startup.HostConfig deployWAR
INFO: Deploying web application archive nutch-0.8.1.war
Jun 11, 2008 1:52:31 PM org.apache.coyote.http11.Http11BaseProtocol start
INFO: Starting Coyote HTTP/1.1 on http-8080
Jun 11, 2008 1:52:31 PM org.apache.jk.common.ChannelSocket init
INFO: JK: ajp13 listening on /0.0.0.0:8009
Jun 11, 2008 1:52:31 PM org.apache.jk.server.JkMain start
INFO: Jk running ID=0 time=0/12  config=null
Jun 11, 2008 1:52:31 PM org.apache.catalina.storeconfig.StoreLoader load
INFO: Find registry server-registry.xml at classpath resource
Jun 11, 2008 1:52:31 PM org.apache.catalina.startup.Catalina start
INFO: Server startup in 1018 ms
2008-06-11 13:56:49,919 INFO  Configuration - parsing
jar:file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/lib/hadoop-0.4.0-patched.jar!/hadoop-default.xml
2008-06-11 13:56:49,928 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-default.xml
2008-06-11 13:56:49,942 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-site.xml
2008-06-11 13:56:49,944 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/hadoop-site.xml
2008-06-11 13:56:49,952 INFO  PluginRepository - Plugins: looking in:
/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/plugins
2008-06-11 13:56:50,052 INFO  PluginRepository - Plugin Auto-activation
mode: [true]
2008-06-11 13:56:50,052 INFO  PluginRepository - Registered Plugins:
2008-06-11 13:56:50,052 INFO  PluginRepository - 	the nutch core extension
points (nutch-extensionpoints)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	Basic Query Filter
(query-basic)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	Basic Indexing Filter
(index-basic)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	Html Parse Plug-in
(parse-html)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	Site Query Filter
(query-site)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	Basic Summarizer Plug-in
(summary-basic)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	HTTP Framework (lib-http)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	Text Parse Plug-in
(parse-text)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	Regex URL Filter
(urlfilter-regex)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	Http Protocol Plug-in
(protocol-http)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	OPIC Scoring Plug-in
(scoring-opic)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	CyberNeko HTML Parser
(lib-nekohtml)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	JavaScript Parser
(parse-js)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	URL Query Filter
(query-url)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	Regex URL Filter Framework
(lib-regex-filter)
2008-06-11 13:56:50,052 INFO  PluginRepository - Registered
Extension-Points:
2008-06-11 13:56:50,052 INFO  PluginRepository - 	Nutch Summarizer
(org.apache.nutch.searcher.Summarizer)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	Nutch Protocol
(org.apache.nutch.protocol.Protocol)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	Nutch Analysis
(org.apache.nutch.analysis.NutchAnalyzer)
2008-06-11 13:56:50,052 INFO  PluginRepository - 	Nutch URL Filter
(org.apache.nutch.net.URLFilter)
2008-06-11 13:56:50,053 INFO  PluginRepository - 	Nutch Indexing Filter
(org.apache.nutch.indexer.IndexingFilter)
2008-06-11 13:56:50,053 INFO  PluginRepository - 	Nutch Online Search
Results Clustering Plugin (org.apache.nutch.clustering.OnlineClusterer)
2008-06-11 13:56:50,053 INFO  PluginRepository - 	HTML Parse Filter
(org.apache.nutch.parse.HtmlParseFilter)
2008-06-11 13:56:50,053 INFO  PluginRepository - 	Nutch Content Parser
(org.apache.nutch.parse.Parser)
2008-06-11 13:56:50,053 INFO  PluginRepository - 	Nutch Scoring
(org.apache.nutch.scoring.ScoringFilter)
2008-06-11 13:56:50,053 INFO  PluginRepository - 	Nutch Query Filter
(org.apache.nutch.searcher.QueryFilter)
2008-06-11 13:56:50,053 INFO  PluginRepository - 	Ontology Model Loader
(org.apache.nutch.ontology.Ontology)
2008-06-11 13:56:50,061 INFO  NutchBean - creating new bean
2008-06-11 13:56:50,070 INFO  NutchBean - opening indexes in crawl/indexes
2008-06-11 13:56:50,122 INFO  Configuration - found resource
common-terms.utf8 at
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/common-terms.utf8
2008-06-11 13:56:50,129 INFO  NutchBean - opening segments in crawl/segments
2008-06-11 13:56:50,144 INFO  SummarizerFactory - Using the first summarizer
extension found: Basic Summarizer
2008-06-11 13:56:50,144 INFO  NutchBean - opening linkdb in crawl/linkdb
2008-06-11 13:56:50,151 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 13:56:50,165 INFO  NutchBean - query: horses
2008-06-11 13:56:50,165 INFO  NutchBean - lang: en
2008-06-11 13:56:50,196 INFO  NutchBean - searching for 20 raw hits
2008-06-11 13:56:50,231 INFO  NutchBean - total hits: 0
2008-06-11 13:56:57,489 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 13:56:57,490 INFO  NutchBean - query: wikipedia
2008-06-11 13:56:57,490 INFO  NutchBean - lang: en
2008-06-11 13:56:57,495 INFO  NutchBean - searching for 20 raw hits
2008-06-11 13:56:57,495 INFO  NutchBean - total hits: 0
2008-06-11 13:57:07,127 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 13:57:07,127 INFO  NutchBean - query: h
2008-06-11 13:57:07,127 INFO  NutchBean - lang: en
2008-06-11 13:57:07,129 INFO  NutchBean - searching for 20 raw hits
2008-06-11 13:57:07,129 INFO  NutchBean - total hits: 0
2008-06-11 13:57:17,289 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 13:57:17,289 INFO  NutchBean - query: nutch
2008-06-11 13:57:17,289 INFO  NutchBean - lang: en
2008-06-11 13:57:17,290 INFO  NutchBean - searching for 20 raw hits
2008-06-11 13:57:17,291 INFO  NutchBean - total hits: 0
2008-06-11 13:59:07,446 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 13:59:07,447 INFO  NutchBean - query: pest
2008-06-11 13:59:07,447 INFO  NutchBean - lang: en
2008-06-11 13:59:07,448 INFO  NutchBean - searching for 20 raw hits
2008-06-11 13:59:07,449 INFO  NutchBean - total hits: 0
2008-06-11 14:00:16,126 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 14:00:16,126 INFO  NutchBean - query: horses
2008-06-11 14:00:16,126 INFO  NutchBean - lang: en
2008-06-11 14:00:16,127 INFO  NutchBean - searching for 20 raw hits
2008-06-11 14:00:16,128 INFO  NutchBean - total hits: 0
2008-06-11 14:03:22,463 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 14:03:22,464 INFO  NutchBean - query: horse
2008-06-11 14:03:22,464 INFO  NutchBean - lang: en
2008-06-11 14:03:22,465 INFO  NutchBean - searching for 20 raw hits
2008-06-11 14:03:22,465 INFO  NutchBean - total hits: 0
2008-06-11 14:31:18,657 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 14:31:18,657 INFO  NutchBean - query: 78
2008-06-11 14:31:18,657 INFO  NutchBean - lang: pt
2008-06-11 14:31:18,658 INFO  NutchBean - searching for 20 raw hits
2008-06-11 14:31:18,659 INFO  NutchBean - total hits: 0
2008-06-11 14:31:24,065 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 14:31:24,066 INFO  NutchBean - query: 7
2008-06-11 14:31:24,066 INFO  NutchBean - lang: en
2008-06-11 14:31:24,066 INFO  NutchBean - searching for 20 raw hits
2008-06-11 14:31:24,067 INFO  NutchBean - total hits: 0
2008-06-11 18:14:33,501 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 18:14:33,501 INFO  NutchBean - query: horsse
2008-06-11 18:14:33,501 INFO  NutchBean - lang: en
2008-06-11 18:14:33,503 INFO  NutchBean - searching for 20 raw hits
2008-06-11 18:14:33,504 INFO  NutchBean - total hits: 0
2008-06-11 18:14:37,954 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 18:14:37,955 INFO  NutchBean - query: horse
2008-06-11 18:14:37,955 INFO  NutchBean - lang: en
2008-06-11 18:14:37,956 INFO  NutchBean - searching for 20 raw hits
2008-06-11 18:14:37,956 INFO  NutchBean - total hits: 0
2008-06-11 18:14:40,675 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 18:14:40,675 INFO  NutchBean - query: horse
2008-06-11 18:14:40,675 INFO  NutchBean - lang: en
2008-06-11 18:14:40,676 INFO  NutchBean - searching for 20 raw hits
2008-06-11 18:14:40,676 INFO  NutchBean - total hits: 0
2008-06-11 18:14:41,971 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 18:14:41,971 INFO  NutchBean - query: horse
2008-06-11 18:14:41,972 INFO  NutchBean - lang: en
2008-06-11 18:14:41,972 INFO  NutchBean - searching for 20 raw hits
2008-06-11 18:14:41,973 INFO  NutchBean - total hits: 0
2008-06-11 18:14:47,557 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 18:14:47,558 INFO  NutchBean - query: google
2008-06-11 18:14:47,558 INFO  NutchBean - lang: en
2008-06-11 18:14:47,559 INFO  NutchBean - searching for 20 raw hits
2008-06-11 18:14:47,559 INFO  NutchBean - total hits: 0
2008-06-11 18:45:29,142 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 18:45:29,143 INFO  NutchBean - query: 
2008-06-11 18:45:29,143 INFO  NutchBean - lang: en
2008-06-11 18:45:29,144 INFO  NutchBean - searching for 20 raw hits
2008-06-11 18:45:29,148 INFO  NutchBean - total hits: 0
2008-06-11 18:47:12,968 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 18:47:12,968 INFO  NutchBean - query: horses
2008-06-11 18:47:12,969 INFO  NutchBean - lang: en
2008-06-11 18:47:12,969 INFO  NutchBean - searching for 20 raw hits
2008-06-11 18:47:12,970 INFO  NutchBean - total hits: 0
2008-06-11 21:08:27,848 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 21:08:27,848 INFO  NutchBean - query: check
2008-06-11 21:08:27,848 INFO  NutchBean - lang: en
2008-06-11 21:08:27,849 INFO  NutchBean - searching for 20 raw hits
2008-06-11 21:08:27,850 INFO  NutchBean - total hits: 0
2008-06-11 21:08:32,650 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 21:08:32,650 INFO  NutchBean - query: hits
2008-06-11 21:08:32,650 INFO  NutchBean - lang: en
2008-06-11 21:08:32,651 INFO  NutchBean - searching for 20 raw hits
2008-06-11 21:08:32,651 INFO  NutchBean - total hits: 0
2008-06-11 21:08:40,582 INFO  NutchBean - query request from 127.0.0.1
2008-06-11 21:08:40,582 INFO  NutchBean - query: google
2008-06-11 21:08:40,582 INFO  NutchBean - lang: en
2008-06-11 21:08:40,583 INFO  NutchBean - searching for 20 raw hits
2008-06-11 21:08:40,584 INFO  NutchBean - total hits: 0
Jun 11, 2008 9:14:01 PM org.apache.coyote.http11.Http11BaseProtocol pause
INFO: Pausing Coyote HTTP/1.1 on http-8080
Jun 11, 2008 9:14:01 PM org.apache.catalina.connector.Connector pause
SEVERE: Protocol handler pause failed
java.net.SocketException: Network is unreachable
	at java.net.PlainSocketImpl.socketConnect(Native Method)
	at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
	at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
	at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
	at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
	at java.net.Socket.connect(Socket.java:519)
	at java.net.Socket.connect(Socket.java:469)
	at java.net.Socket.<init>(Socket.java:366)
	at java.net.Socket.<init>(Socket.java:209)
	at org.apache.jk.common.ChannelSocket.unLockSocket(ChannelSocket.java:473)
	at org.apache.jk.common.ChannelSocket.pause(ChannelSocket.java:270)
	at org.apache.jk.server.JkMain.pause(JkMain.java:679)
	at org.apache.jk.server.JkCoyoteHandler.pause(JkCoyoteHandler.java:162)
	at org.apache.catalina.connector.Connector.pause(Connector.java:1031)
	at org.apache.catalina.core.StandardService.stop(StandardService.java:491)
	at org.apache.catalina.core.StandardServer.stop(StandardServer.java:743)
	at org.apache.catalina.startup.Catalina.stop(Catalina.java:601)
	at
org.apache.catalina.startup.Catalina$CatalinaShutdownHook.run(Catalina.java:644)
Jun 12, 2008 8:09:21 AM org.apache.catalina.core.AprLifecycleListener
lifecycleEvent
INFO: The Apache Tomcat Native library which allows optimal performance in
production environments was not found on the java.library.path:
/opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
Jun 12, 2008 8:09:21 AM org.apache.coyote.http11.Http11BaseProtocol init
INFO: Initializing Coyote HTTP/1.1 on http-8080
Jun 12, 2008 8:09:21 AM org.apache.catalina.startup.Catalina load
INFO: Initialization processed in 3557 ms
Jun 12, 2008 8:09:22 AM org.apache.catalina.core.StandardService start
INFO: Starting service Catalina
Jun 12, 2008 8:09:22 AM org.apache.catalina.core.StandardEngine start
INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
Jun 12, 2008 8:09:22 AM org.apache.catalina.core.StandardHost start
INFO: XML validation disabled
Jun 12, 2008 8:09:26 AM org.apache.catalina.startup.HostConfig deployWAR
INFO: Deploying web application archive nutch-0.8.1.war
Jun 12, 2008 8:09:28 AM org.apache.coyote.http11.Http11BaseProtocol start
INFO: Starting Coyote HTTP/1.1 on http-8080
Jun 12, 2008 8:09:29 AM org.apache.jk.common.ChannelSocket init
INFO: JK: ajp13 listening on /0.0.0.0:8009
Jun 12, 2008 8:09:29 AM org.apache.jk.server.JkMain start
INFO: Jk running ID=0 time=0/58  config=null
Jun 12, 2008 8:09:29 AM org.apache.catalina.storeconfig.StoreLoader load
INFO: Find registry server-registry.xml at classpath resource
Jun 12, 2008 8:09:29 AM org.apache.catalina.startup.Catalina start
INFO: Server startup in 7427 ms
2008-06-12 08:09:43,382 INFO  Configuration - parsing
jar:file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/lib/hadoop-0.4.0-patched.jar!/hadoop-default.xml
2008-06-12 08:09:43,466 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-default.xml
2008-06-12 08:09:43,493 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-site.xml
2008-06-12 08:09:43,495 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/hadoop-site.xml
2008-06-12 08:09:43,515 INFO  PluginRepository - Plugins: looking in:
/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/plugins
2008-06-12 08:09:43,782 INFO  PluginRepository - Plugin Auto-activation
mode: [true]
2008-06-12 08:09:43,782 INFO  PluginRepository - Registered Plugins:
2008-06-12 08:09:43,782 INFO  PluginRepository - 	the nutch core extension
points (nutch-extensionpoints)
2008-06-12 08:09:43,782 INFO  PluginRepository - 	Basic Query Filter
(query-basic)
2008-06-12 08:09:43,782 INFO  PluginRepository - 	Basic Indexing Filter
(index-basic)
2008-06-12 08:09:43,782 INFO  PluginRepository - 	Html Parse Plug-in
(parse-html)
2008-06-12 08:09:43,782 INFO  PluginRepository - 	Site Query Filter
(query-site)
2008-06-12 08:09:43,782 INFO  PluginRepository - 	Basic Summarizer Plug-in
(summary-basic)
2008-06-12 08:09:43,782 INFO  PluginRepository - 	HTTP Framework (lib-http)
2008-06-12 08:09:43,782 INFO  PluginRepository - 	Text Parse Plug-in
(parse-text)
2008-06-12 08:09:43,782 INFO  PluginRepository - 	Regex URL Filter
(urlfilter-regex)
2008-06-12 08:09:43,782 INFO  PluginRepository - 	Http Protocol Plug-in
(protocol-http)
2008-06-12 08:09:43,782 INFO  PluginRepository - 	OPIC Scoring Plug-in
(scoring-opic)
2008-06-12 08:09:43,782 INFO  PluginRepository - 	CyberNeko HTML Parser
(lib-nekohtml)
2008-06-12 08:09:43,782 INFO  PluginRepository - 	JavaScript Parser
(parse-js)
2008-06-12 08:09:43,782 INFO  PluginRepository - 	URL Query Filter
(query-url)
2008-06-12 08:09:43,782 INFO  PluginRepository - 	Regex URL Filter Framework
(lib-regex-filter)
2008-06-12 08:09:43,782 INFO  PluginRepository - Registered
Extension-Points:
2008-06-12 08:09:43,783 INFO  PluginRepository - 	Nutch Summarizer
(org.apache.nutch.searcher.Summarizer)
2008-06-12 08:09:43,783 INFO  PluginRepository - 	Nutch Protocol
(org.apache.nutch.protocol.Protocol)
2008-06-12 08:09:43,783 INFO  PluginRepository - 	Nutch Analysis
(org.apache.nutch.analysis.NutchAnalyzer)
2008-06-12 08:09:43,783 INFO  PluginRepository - 	Nutch URL Filter
(org.apache.nutch.net.URLFilter)
2008-06-12 08:09:43,783 INFO  PluginRepository - 	Nutch Indexing Filter
(org.apache.nutch.indexer.IndexingFilter)
2008-06-12 08:09:43,783 INFO  PluginRepository - 	Nutch Online Search
Results Clustering Plugin (org.apache.nutch.clustering.OnlineClusterer)
2008-06-12 08:09:43,783 INFO  PluginRepository - 	HTML Parse Filter
(org.apache.nutch.parse.HtmlParseFilter)
2008-06-12 08:09:43,783 INFO  PluginRepository - 	Nutch Content Parser
(org.apache.nutch.parse.Parser)
2008-06-12 08:09:43,783 INFO  PluginRepository - 	Nutch Scoring
(org.apache.nutch.scoring.ScoringFilter)
2008-06-12 08:09:43,783 INFO  PluginRepository - 	Nutch Query Filter
(org.apache.nutch.searcher.QueryFilter)
2008-06-12 08:09:43,783 INFO  PluginRepository - 	Ontology Model Loader
(org.apache.nutch.ontology.Ontology)
2008-06-12 08:09:43,792 INFO  NutchBean - creating new bean
2008-06-12 08:09:43,809 INFO  NutchBean - opening indexes in crawl/indexes
2008-06-12 08:09:43,922 INFO  Configuration - found resource
common-terms.utf8 at
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/common-terms.utf8
2008-06-12 08:09:43,928 INFO  NutchBean - opening segments in crawl/segments
2008-06-12 08:09:43,948 INFO  SummarizerFactory - Using the first summarizer
extension found: Basic Summarizer
2008-06-12 08:09:43,948 INFO  NutchBean - opening linkdb in crawl/linkdb
2008-06-12 08:09:43,972 INFO  NutchBean - query request from 127.0.0.1
2008-06-12 08:09:43,984 INFO  NutchBean - query: hellp
2008-06-12 08:09:43,984 INFO  NutchBean - lang: en
2008-06-12 08:09:44,044 INFO  NutchBean - searching for 20 raw hits
2008-06-12 08:09:44,108 INFO  NutchBean - total hits: 0
2008-06-12 08:09:49,223 INFO  NutchBean - query request from 127.0.0.1
2008-06-12 08:09:49,224 INFO  NutchBean - query: horses
2008-06-12 08:09:49,224 INFO  NutchBean - lang: en
2008-06-12 08:09:49,225 INFO  NutchBean - searching for 20 raw hits
2008-06-12 08:09:49,225 INFO  NutchBean - total hits: 0
Jun 12, 2008 8:26:02 AM org.apache.catalina.core.AprLifecycleListener
lifecycleEvent
INFO: The Apache Tomcat Native library which allows optimal performance in
production environments was not found on the java.library.path:
/opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
Jun 12, 2008 8:26:02 AM org.apache.coyote.http11.Http11BaseProtocol init
SEVERE: Error initializing endpoint
java.net.BindException: Address already in use:8080
	at
org.apache.tomcat.util.net.PoolTcpEndpoint.initEndpoint(PoolTcpEndpoint.java:297)
	at
org.apache.coyote.http11.Http11BaseProtocol.init(Http11BaseProtocol.java:138)
	at org.apache.catalina.connector.Connector.initialize(Connector.java:1016)
	at
org.apache.catalina.core.StandardService.initialize(StandardService.java:580)
	at
org.apache.catalina.core.StandardServer.initialize(StandardServer.java:791)
	at org.apache.catalina.startup.Catalina.load(Catalina.java:503)
	at org.apache.catalina.startup.Catalina.load(Catalina.java:523)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.apache.catalina.startup.Bootstrap.load(Bootstrap.java:247)
	at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:412)
Jun 12, 2008 8:26:02 AM org.apache.catalina.startup.Catalina load
SEVERE: Catalina.start
LifecycleException:  Protocol handler initialization failed:
java.net.BindException: Address already in use:8080
	at org.apache.catalina.connector.Connector.initialize(Connector.java:1018)
	at
org.apache.catalina.core.StandardService.initialize(StandardService.java:580)
	at
org.apache.catalina.core.StandardServer.initialize(StandardServer.java:791)
	at org.apache.catalina.startup.Catalina.load(Catalina.java:503)
	at org.apache.catalina.startup.Catalina.load(Catalina.java:523)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.apache.catalina.startup.Bootstrap.load(Bootstrap.java:247)
	at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:412)
Jun 12, 2008 8:26:02 AM org.apache.catalina.startup.Catalina load
INFO: Initialization processed in 480 ms
Jun 12, 2008 8:26:02 AM org.apache.catalina.core.StandardService start
INFO: Starting service Catalina
Jun 12, 2008 8:26:02 AM org.apache.catalina.core.StandardEngine start
INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
Jun 12, 2008 8:26:02 AM org.apache.catalina.core.StandardHost start
INFO: XML validation disabled
Jun 12, 2008 8:26:03 AM org.apache.catalina.startup.HostConfig deployWAR
INFO: Deploying web application archive nutch-0.8.1.war
Jun 12, 2008 8:26:03 AM org.apache.coyote.http11.Http11BaseProtocol start
SEVERE: Error starting endpoint
java.net.BindException: Address already in use:8080
	at
org.apache.tomcat.util.net.PoolTcpEndpoint.initEndpoint(PoolTcpEndpoint.java:297)
	at
org.apache.tomcat.util.net.PoolTcpEndpoint.startEndpoint(PoolTcpEndpoint.java:312)
	at
org.apache.coyote.http11.Http11BaseProtocol.start(Http11BaseProtocol.java:150)
	at org.apache.coyote.http11.Http11Protocol.start(Http11Protocol.java:75)
	at org.apache.catalina.connector.Connector.start(Connector.java:1089)
	at org.apache.catalina.core.StandardService.start(StandardService.java:459)
	at org.apache.catalina.core.StandardServer.start(StandardServer.java:709)
	at org.apache.catalina.startup.Catalina.start(Catalina.java:551)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.apache.catalina.startup.Bootstrap.start(Bootstrap.java:275)
	at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:413)
Jun 12, 2008 8:26:03 AM org.apache.catalina.startup.Catalina start
SEVERE: Catalina.start: 
LifecycleException:  service.getName(): "Catalina";  Protocol handler start
failed: java.net.BindException: Address already in use:8080
	at org.apache.catalina.connector.Connector.start(Connector.java:1096)
	at org.apache.catalina.core.StandardService.start(StandardService.java:459)
	at org.apache.catalina.core.StandardServer.start(StandardServer.java:709)
	at org.apache.catalina.startup.Catalina.start(Catalina.java:551)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.apache.catalina.startup.Bootstrap.start(Bootstrap.java:275)
	at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:413)
Jun 12, 2008 8:26:03 AM org.apache.catalina.startup.Catalina start
INFO: Server startup in 893 ms
Jun 12, 2008 8:26:03 AM org.apache.catalina.core.StandardServer await
SEVERE: StandardServer.await: create[8005]: 
java.net.BindException: Address already in use
	at java.net.PlainSocketImpl.socketBind(Native Method)
	at java.net.PlainSocketImpl.bind(PlainSocketImpl.java:359)
	at java.net.ServerSocket.bind(ServerSocket.java:319)
	at java.net.ServerSocket.<init>(ServerSocket.java:185)
	at org.apache.catalina.core.StandardServer.await(StandardServer.java:372)
	at org.apache.catalina.startup.Catalina.await(Catalina.java:615)
	at org.apache.catalina.startup.Catalina.start(Catalina.java:575)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.apache.catalina.startup.Bootstrap.start(Bootstrap.java:275)
	at org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:413)
Jun 12, 2008 8:26:03 AM org.apache.coyote.http11.Http11BaseProtocol pause
INFO: Pausing Coyote HTTP/1.1 on http-8080
Jun 12, 2008 8:26:03 AM org.apache.catalina.connector.Connector pause
SEVERE: Protocol handler pause failed
java.lang.NullPointerException
	at org.apache.jk.server.JkMain.pause(JkMain.java:677)
	at org.apache.jk.server.JkCoyoteHandler.pause(JkCoyoteHandler.java:162)
	at org.apache.catalina.connector.Connector.pause(Connector.java:1031)
	at org.apache.catalina.core.StandardService.stop(StandardService.java:491)
	at org.apache.catalina.core.StandardServer.stop(StandardServer.java:743)
	at org.apache.catalina.startup.Catalina.stop(Catalina.java:601)
	at
org.apache.catalina.startup.Catalina$CatalinaShutdownHook.run(Catalina.java:644)
2008-06-12 08:26:32,086 INFO  NutchBean - query request from 127.0.0.1
2008-06-12 08:26:32,089 INFO  NutchBean - query: horses
2008-06-12 08:26:32,089 INFO  NutchBean - lang: en
2008-06-12 08:26:32,090 INFO  NutchBean - searching for 20 raw hits
2008-06-12 08:26:32,091 INFO  NutchBean - total hits: 0
2008-06-12 08:29:09,089 INFO  NutchBean - query request from 127.0.0.1
2008-06-12 08:29:09,089 INFO  NutchBean - query: horses
2008-06-12 08:29:09,089 INFO  NutchBean - lang: en
2008-06-12 08:29:09,090 INFO  NutchBean - searching for 20 raw hits
2008-06-12 08:29:09,091 INFO  NutchBean - total hits: 0
Jun 12, 2008 8:36:45 AM org.apache.coyote.http11.Http11BaseProtocol pause
INFO: Pausing Coyote HTTP/1.1 on http-8080
Jun 12, 2008 8:36:46 AM org.apache.catalina.core.StandardService stop
INFO: Stopping service Catalina
Jun 12, 2008 8:36:47 AM org.apache.coyote.http11.Http11BaseProtocol destroy
INFO: Stopping Coyote HTTP/1.1 on http-8080
Jun 12, 2008 8:36:47 AM org.apache.catalina.core.AprLifecycleListener
lifecycleEvent
INFO: Failed shutdown of Apache Portable Runtime
Jun 12, 2008 8:36:50 AM org.apache.catalina.core.AprLifecycleListener
lifecycleEvent
INFO: The Apache Tomcat Native library which allows optimal performance in
production environments was not found on the java.library.path:
/opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
Jun 12, 2008 8:36:50 AM org.apache.coyote.http11.Http11BaseProtocol init
INFO: Initializing Coyote HTTP/1.1 on http-8080
Jun 12, 2008 8:36:50 AM org.apache.catalina.startup.Catalina load
INFO: Initialization processed in 414 ms
Jun 12, 2008 8:36:50 AM org.apache.catalina.core.StandardService start
INFO: Starting service Catalina
Jun 12, 2008 8:36:50 AM org.apache.catalina.core.StandardEngine start
INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
Jun 12, 2008 8:36:50 AM org.apache.catalina.core.StandardHost start
INFO: XML validation disabled
Jun 12, 2008 8:36:51 AM org.apache.catalina.startup.HostConfig deployWAR
INFO: Deploying web application archive nutch-0.8.1.war
Jun 12, 2008 8:36:51 AM org.apache.coyote.http11.Http11BaseProtocol start
INFO: Starting Coyote HTTP/1.1 on http-8080
Jun 12, 2008 8:36:51 AM org.apache.jk.common.ChannelSocket init
INFO: JK: ajp13 listening on /0.0.0.0:8009
Jun 12, 2008 8:36:51 AM org.apache.jk.server.JkMain start
INFO: Jk running ID=0 time=0/34  config=null
Jun 12, 2008 8:36:51 AM org.apache.catalina.storeconfig.StoreLoader load
INFO: Find registry server-registry.xml at classpath resource
Jun 12, 2008 8:36:51 AM org.apache.catalina.startup.Catalina start
INFO: Server startup in 968 ms
2008-06-12 08:37:04,238 INFO  Configuration - parsing
jar:file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/lib/hadoop-0.4.0-patched.jar!/hadoop-default.xml
2008-06-12 08:37:04,296 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-default.xml
2008-06-12 08:37:04,321 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-site.xml
2008-06-12 08:37:04,322 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/hadoop-site.xml
2008-06-12 08:37:04,330 INFO  PluginRepository - Plugins: looking in:
/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/plugins
2008-06-12 08:37:04,425 INFO  PluginRepository - Plugin Auto-activation
mode: [true]
2008-06-12 08:37:04,425 INFO  PluginRepository - Registered Plugins:
2008-06-12 08:37:04,425 INFO  PluginRepository - 	the nutch core extension
points (nutch-extensionpoints)
2008-06-12 08:37:04,425 INFO  PluginRepository - 	Basic Query Filter
(query-basic)
2008-06-12 08:37:04,425 INFO  PluginRepository - 	Basic Indexing Filter
(index-basic)
2008-06-12 08:37:04,425 INFO  PluginRepository - 	Html Parse Plug-in
(parse-html)
2008-06-12 08:37:04,425 INFO  PluginRepository - 	Site Query Filter
(query-site)
2008-06-12 08:37:04,425 INFO  PluginRepository - 	Basic Summarizer Plug-in
(summary-basic)
2008-06-12 08:37:04,425 INFO  PluginRepository - 	HTTP Framework (lib-http)
2008-06-12 08:37:04,425 INFO  PluginRepository - 	Text Parse Plug-in
(parse-text)
2008-06-12 08:37:04,425 INFO  PluginRepository - 	Regex URL Filter
(urlfilter-regex)
2008-06-12 08:37:04,425 INFO  PluginRepository - 	Http Protocol Plug-in
(protocol-http)
2008-06-12 08:37:04,425 INFO  PluginRepository - 	OPIC Scoring Plug-in
(scoring-opic)
2008-06-12 08:37:04,425 INFO  PluginRepository - 	CyberNeko HTML Parser
(lib-nekohtml)
2008-06-12 08:37:04,425 INFO  PluginRepository - 	JavaScript Parser
(parse-js)
2008-06-12 08:37:04,425 INFO  PluginRepository - 	URL Query Filter
(query-url)
2008-06-12 08:37:04,425 INFO  PluginRepository - 	Regex URL Filter Framework
(lib-regex-filter)
2008-06-12 08:37:04,425 INFO  PluginRepository - Registered
Extension-Points:
2008-06-12 08:37:04,426 INFO  PluginRepository - 	Nutch Summarizer
(org.apache.nutch.searcher.Summarizer)
2008-06-12 08:37:04,426 INFO  PluginRepository - 	Nutch Protocol
(org.apache.nutch.protocol.Protocol)
2008-06-12 08:37:04,426 INFO  PluginRepository - 	Nutch Analysis
(org.apache.nutch.analysis.NutchAnalyzer)
2008-06-12 08:37:04,426 INFO  PluginRepository - 	Nutch URL Filter
(org.apache.nutch.net.URLFilter)
2008-06-12 08:37:04,426 INFO  PluginRepository - 	Nutch Indexing Filter
(org.apache.nutch.indexer.IndexingFilter)
2008-06-12 08:37:04,426 INFO  PluginRepository - 	Nutch Online Search
Results Clustering Plugin (org.apache.nutch.clustering.OnlineClusterer)
2008-06-12 08:37:04,426 INFO  PluginRepository - 	HTML Parse Filter
(org.apache.nutch.parse.HtmlParseFilter)
2008-06-12 08:37:04,426 INFO  PluginRepository - 	Nutch Content Parser
(org.apache.nutch.parse.Parser)
2008-06-12 08:37:04,426 INFO  PluginRepository - 	Nutch Scoring
(org.apache.nutch.scoring.ScoringFilter)
2008-06-12 08:37:04,426 INFO  PluginRepository - 	Nutch Query Filter
(org.apache.nutch.searcher.QueryFilter)
2008-06-12 08:37:04,426 INFO  PluginRepository - 	Ontology Model Loader
(org.apache.nutch.ontology.Ontology)
2008-06-12 08:37:04,435 INFO  NutchBean - creating new bean
2008-06-12 08:37:04,443 INFO  NutchBean - opening indexes in crawl/indexes
2008-06-12 08:37:04,485 INFO  Configuration - found resource
common-terms.utf8 at
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/common-terms.utf8
2008-06-12 08:37:04,491 INFO  NutchBean - opening segments in crawl/segments
2008-06-12 08:37:04,504 INFO  SummarizerFactory - Using the first summarizer
extension found: Basic Summarizer
2008-06-12 08:37:04,504 INFO  NutchBean - opening linkdb in crawl/linkdb
2008-06-12 08:37:04,510 INFO  NutchBean - query request from 127.0.0.1
2008-06-12 08:37:04,522 INFO  NutchBean - query: horses
2008-06-12 08:37:04,522 INFO  NutchBean - lang: en
2008-06-12 08:37:04,564 INFO  NutchBean - searching for 20 raw hits
2008-06-12 08:37:04,607 INFO  NutchBean - total hits: 0
2008-06-12 08:37:15,225 INFO  NutchBean - query request from 127.0.0.1
2008-06-12 08:37:15,226 INFO  NutchBean - query: pest
2008-06-12 08:37:15,226 INFO  NutchBean - lang: en
2008-06-12 08:37:15,229 INFO  NutchBean - searching for 20 raw hits
2008-06-12 08:37:15,230 INFO  NutchBean - total hits: 0
Jun 12, 2008 9:04:11 AM org.apache.coyote.http11.Http11BaseProtocol pause
INFO: Pausing Coyote HTTP/1.1 on http-8080
Jun 12, 2008 9:04:12 AM org.apache.catalina.core.StandardService stop
INFO: Stopping service Catalina
Jun 12, 2008 9:04:12 AM org.apache.coyote.http11.Http11BaseProtocol destroy
INFO: Stopping Coyote HTTP/1.1 on http-8080
Jun 12, 2008 9:04:12 AM org.apache.catalina.core.AprLifecycleListener
lifecycleEvent
INFO: Failed shutdown of Apache Portable Runtime
Jun 12, 2008 9:04:19 AM org.apache.catalina.core.AprLifecycleListener
lifecycleEvent
INFO: The Apache Tomcat Native library which allows optimal performance in
production environments was not found on the java.library.path:
/opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
Jun 12, 2008 9:04:20 AM org.apache.coyote.http11.Http11BaseProtocol init
INFO: Initializing Coyote HTTP/1.1 on http-8080
Jun 12, 2008 9:04:20 AM org.apache.catalina.startup.Catalina load
INFO: Initialization processed in 481 ms
Jun 12, 2008 9:04:20 AM org.apache.catalina.core.StandardService start
INFO: Starting service Catalina
Jun 12, 2008 9:04:20 AM org.apache.catalina.core.StandardEngine start
INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
Jun 12, 2008 9:04:20 AM org.apache.catalina.core.StandardHost start
INFO: XML validation disabled
Jun 12, 2008 9:04:20 AM org.apache.catalina.startup.HostConfig deployWAR
INFO: Deploying web application archive nutch-0.8.1.war
Jun 12, 2008 9:04:21 AM org.apache.coyote.http11.Http11BaseProtocol start
INFO: Starting Coyote HTTP/1.1 on http-8080
Jun 12, 2008 9:04:21 AM org.apache.jk.common.ChannelSocket init
INFO: JK: ajp13 listening on /0.0.0.0:8009
Jun 12, 2008 9:04:21 AM org.apache.jk.server.JkMain start
INFO: Jk running ID=0 time=0/14  config=null
Jun 12, 2008 9:04:21 AM org.apache.catalina.storeconfig.StoreLoader load
INFO: Find registry server-registry.xml at classpath resource
Jun 12, 2008 9:04:21 AM org.apache.catalina.startup.Catalina start
INFO: Server startup in 1131 ms
2008-06-12 09:04:35,798 INFO  Configuration - parsing
jar:file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/lib/hadoop-0.4.0-patched.jar!/hadoop-default.xml
2008-06-12 09:04:35,857 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-default.xml
2008-06-12 09:04:35,882 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-site.xml
2008-06-12 09:04:35,884 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/hadoop-site.xml
2008-06-12 09:04:35,892 INFO  PluginRepository - Plugins: looking in:
/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/plugins
2008-06-12 09:04:36,021 INFO  PluginRepository - Plugin Auto-activation
mode: [true]
2008-06-12 09:04:36,021 INFO  PluginRepository - Registered Plugins:
2008-06-12 09:04:36,021 INFO  PluginRepository - 	the nutch core extension
points (nutch-extensionpoints)
2008-06-12 09:04:36,021 INFO  PluginRepository - 	Basic Query Filter
(query-basic)
2008-06-12 09:04:36,021 INFO  PluginRepository - 	Basic Indexing Filter
(index-basic)
2008-06-12 09:04:36,021 INFO  PluginRepository - 	Html Parse Plug-in
(parse-html)
2008-06-12 09:04:36,021 INFO  PluginRepository - 	Site Query Filter
(query-site)
2008-06-12 09:04:36,021 INFO  PluginRepository - 	Basic Summarizer Plug-in
(summary-basic)
2008-06-12 09:04:36,021 INFO  PluginRepository - 	HTTP Framework (lib-http)
2008-06-12 09:04:36,021 INFO  PluginRepository - 	Text Parse Plug-in
(parse-text)
2008-06-12 09:04:36,021 INFO  PluginRepository - 	Regex URL Filter
(urlfilter-regex)
2008-06-12 09:04:36,021 INFO  PluginRepository - 	Http Protocol Plug-in
(protocol-http)
2008-06-12 09:04:36,021 INFO  PluginRepository - 	OPIC Scoring Plug-in
(scoring-opic)
2008-06-12 09:04:36,021 INFO  PluginRepository - 	CyberNeko HTML Parser
(lib-nekohtml)
2008-06-12 09:04:36,022 INFO  PluginRepository - 	JavaScript Parser
(parse-js)
2008-06-12 09:04:36,022 INFO  PluginRepository - 	URL Query Filter
(query-url)
2008-06-12 09:04:36,022 INFO  PluginRepository - 	Regex URL Filter Framework
(lib-regex-filter)
2008-06-12 09:04:36,022 INFO  PluginRepository - Registered
Extension-Points:
2008-06-12 09:04:36,022 INFO  PluginRepository - 	Nutch Summarizer
(org.apache.nutch.searcher.Summarizer)
2008-06-12 09:04:36,022 INFO  PluginRepository - 	Nutch Protocol
(org.apache.nutch.protocol.Protocol)
2008-06-12 09:04:36,022 INFO  PluginRepository - 	Nutch Analysis
(org.apache.nutch.analysis.NutchAnalyzer)
2008-06-12 09:04:36,022 INFO  PluginRepository - 	Nutch URL Filter
(org.apache.nutch.net.URLFilter)
2008-06-12 09:04:36,022 INFO  PluginRepository - 	Nutch Indexing Filter
(org.apache.nutch.indexer.IndexingFilter)
2008-06-12 09:04:36,022 INFO  PluginRepository - 	Nutch Online Search
Results Clustering Plugin (org.apache.nutch.clustering.OnlineClusterer)
2008-06-12 09:04:36,022 INFO  PluginRepository - 	HTML Parse Filter
(org.apache.nutch.parse.HtmlParseFilter)
2008-06-12 09:04:36,022 INFO  PluginRepository - 	Nutch Content Parser
(org.apache.nutch.parse.Parser)
2008-06-12 09:04:36,022 INFO  PluginRepository - 	Nutch Scoring
(org.apache.nutch.scoring.ScoringFilter)
2008-06-12 09:04:36,022 INFO  PluginRepository - 	Nutch Query Filter
(org.apache.nutch.searcher.QueryFilter)
2008-06-12 09:04:36,022 INFO  PluginRepository - 	Ontology Model Loader
(org.apache.nutch.ontology.Ontology)
2008-06-12 09:04:36,035 INFO  NutchBean - creating new bean
2008-06-12 09:04:36,048 INFO  NutchBean - opening indexes in crawl/indexes
2008-06-12 09:04:36,090 INFO  Configuration - found resource
common-terms.utf8 at
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/common-terms.utf8
2008-06-12 09:04:36,096 INFO  NutchBean - opening segments in crawl/segments
2008-06-12 09:04:36,110 INFO  SummarizerFactory - Using the first summarizer
extension found: Basic Summarizer
2008-06-12 09:04:36,110 INFO  NutchBean - opening linkdb in crawl/linkdb
2008-06-12 09:04:36,115 INFO  NutchBean - query request from 127.0.0.1
2008-06-12 09:04:36,127 INFO  NutchBean - query: horses
2008-06-12 09:04:36,127 INFO  NutchBean - lang: en
2008-06-12 09:04:36,171 INFO  NutchBean - searching for 20 raw hits
2008-06-12 09:04:36,202 INFO  NutchBean - total hits: 0
2008-06-12 09:16:45,571 INFO  NutchBean - query request from 127.0.0.1
2008-06-12 09:16:45,571 INFO  NutchBean - query: horses
2008-06-12 09:16:45,571 INFO  NutchBean - lang: en
2008-06-12 09:16:45,572 INFO  NutchBean - searching for 20 raw hits
2008-06-12 09:16:45,573 INFO  NutchBean - total hits: 0
2008-06-12 09:16:48,412 INFO  NutchBean - query request from 127.0.0.1
2008-06-12 09:16:48,412 INFO  NutchBean - query: horses
2008-06-12 09:16:48,412 INFO  NutchBean - lang: en
2008-06-12 09:16:48,413 INFO  NutchBean - searching for 20 raw hits
2008-06-12 09:16:48,413 INFO  NutchBean - total hits: 0
Jun 12, 2008 9:36:23 AM org.apache.coyote.http11.Http11BaseProtocol pause
INFO: Pausing Coyote HTTP/1.1 on http-8080
Jun 12, 2008 9:36:23 AM org.apache.catalina.connector.Connector pause
SEVERE: Protocol handler pause failed
java.net.SocketException: Network is unreachable
	at java.net.PlainSocketImpl.socketConnect(Native Method)
	at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:333)
	at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:195)
	at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:182)
	at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
	at java.net.Socket.connect(Socket.java:519)
	at java.net.Socket.connect(Socket.java:469)
	at java.net.Socket.<init>(Socket.java:366)
	at java.net.Socket.<init>(Socket.java:209)
	at org.apache.jk.common.ChannelSocket.unLockSocket(ChannelSocket.java:473)
	at org.apache.jk.common.ChannelSocket.pause(ChannelSocket.java:270)
	at org.apache.jk.server.JkMain.pause(JkMain.java:679)
	at org.apache.jk.server.JkCoyoteHandler.pause(JkCoyoteHandler.java:162)
	at org.apache.catalina.connector.Connector.pause(Connector.java:1031)
	at org.apache.catalina.core.StandardService.stop(StandardService.java:491)
	at org.apache.catalina.core.StandardServer.stop(StandardServer.java:743)
	at org.apache.catalina.startup.Catalina.stop(Catalina.java:601)
	at
org.apache.catalina.startup.Catalina$CatalinaShutdownHook.run(Catalina.java:644)
Jun 12, 2008 10:30:44 AM org.apache.catalina.core.AprLifecycleListener
lifecycleEvent
INFO: The Apache Tomcat Native library which allows optimal performance in
production environments was not found on the java.library.path:
/opt/jdk/jdk1.6.0_06/jre/lib/i386/server:/opt/jdk/jdk1.6.0_06/jre/lib/i386:/opt/jdk/jdk1.6.0_06/jre/../lib/i386:/usr/java/packages/lib/i386:/lib:/usr/lib
Jun 12, 2008 10:30:45 AM org.apache.coyote.http11.Http11BaseProtocol init
INFO: Initializing Coyote HTTP/1.1 on http-8080
Jun 12, 2008 10:30:45 AM org.apache.catalina.startup.Catalina load
INFO: Initialization processed in 1599 ms
Jun 12, 2008 10:30:45 AM org.apache.catalina.core.StandardService start
INFO: Starting service Catalina
Jun 12, 2008 10:30:45 AM org.apache.catalina.core.StandardEngine start
INFO: Starting Servlet Engine: Apache Tomcat/5.5.16
Jun 12, 2008 10:30:45 AM org.apache.catalina.core.StandardHost start
INFO: XML validation disabled
Jun 12, 2008 10:30:46 AM org.apache.catalina.startup.HostConfig deployWAR
INFO: Deploying web application archive nutch-0.8.1.war
Jun 12, 2008 10:30:47 AM org.apache.coyote.http11.Http11BaseProtocol start
INFO: Starting Coyote HTTP/1.1 on http-8080
Jun 12, 2008 10:30:47 AM org.apache.jk.common.ChannelSocket init
INFO: JK: ajp13 listening on /0.0.0.0:8009
Jun 12, 2008 10:30:47 AM org.apache.jk.server.JkMain start
INFO: Jk running ID=0 time=0/30  config=null
Jun 12, 2008 10:30:47 AM org.apache.catalina.storeconfig.StoreLoader load
INFO: Find registry server-registry.xml at classpath resource
Jun 12, 2008 10:30:47 AM org.apache.catalina.startup.Catalina start
INFO: Server startup in 2626 ms
2008-06-12 10:30:56,210 INFO  Configuration - parsing
jar:file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/lib/hadoop-0.4.0-patched.jar!/hadoop-default.xml
2008-06-12 10:30:56,288 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-default.xml
2008-06-12 10:30:56,317 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/nutch-site.xml
2008-06-12 10:30:56,319 INFO  Configuration - parsing
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/hadoop-site.xml
2008-06-12 10:30:56,339 INFO  PluginRepository - Plugins: looking in:
/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/plugins
2008-06-12 10:30:56,599 INFO  PluginRepository - Plugin Auto-activation
mode: [true]
2008-06-12 10:30:56,599 INFO  PluginRepository - Registered Plugins:
2008-06-12 10:30:56,599 INFO  PluginRepository - 	the nutch core extension
points (nutch-extensionpoints)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	Basic Query Filter
(query-basic)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	Basic Indexing Filter
(index-basic)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	Html Parse Plug-in
(parse-html)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	Site Query Filter
(query-site)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	Basic Summarizer Plug-in
(summary-basic)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	HTTP Framework (lib-http)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	Text Parse Plug-in
(parse-text)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	Regex URL Filter
(urlfilter-regex)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	Http Protocol Plug-in
(protocol-http)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	OPIC Scoring Plug-in
(scoring-opic)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	CyberNeko HTML Parser
(lib-nekohtml)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	JavaScript Parser
(parse-js)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	URL Query Filter
(query-url)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	Regex URL Filter Framework
(lib-regex-filter)
2008-06-12 10:30:56,599 INFO  PluginRepository - Registered
Extension-Points:
2008-06-12 10:30:56,599 INFO  PluginRepository - 	Nutch Summarizer
(org.apache.nutch.searcher.Summarizer)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	Nutch Protocol
(org.apache.nutch.protocol.Protocol)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	Nutch Analysis
(org.apache.nutch.analysis.NutchAnalyzer)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	Nutch URL Filter
(org.apache.nutch.net.URLFilter)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	Nutch Indexing Filter
(org.apache.nutch.indexer.IndexingFilter)
2008-06-12 10:30:56,599 INFO  PluginRepository - 	Nutch Online Search
Results Clustering Plugin (org.apache.nutch.clustering.OnlineClusterer)
2008-06-12 10:30:56,600 INFO  PluginRepository - 	HTML Parse Filter
(org.apache.nutch.parse.HtmlParseFilter)
2008-06-12 10:30:56,600 INFO  PluginRepository - 	Nutch Content Parser
(org.apache.nutch.parse.Parser)
2008-06-12 10:30:56,600 INFO  PluginRepository - 	Nutch Scoring
(org.apache.nutch.scoring.ScoringFilter)
2008-06-12 10:30:56,600 INFO  PluginRepository - 	Nutch Query Filter
(org.apache.nutch.searcher.QueryFilter)
2008-06-12 10:30:56,600 INFO  PluginRepository - 	Ontology Model Loader
(org.apache.nutch.ontology.Ontology)
2008-06-12 10:30:56,608 INFO  NutchBean - creating new bean
2008-06-12 10:30:56,625 INFO  NutchBean - opening indexes in crawl/indexes
2008-06-12 10:30:56,724 INFO  Configuration - found resource
common-terms.utf8 at
file:/opt/apache-tomcat-5.5.16/webapps/nutch-0.8.1/WEB-INF/classes/common-terms.utf8
2008-06-12 10:30:56,730 INFO  NutchBean - opening segments in crawl/segments
2008-06-12 10:30:56,752 INFO  SummarizerFactory - Using the first summarizer
extension found: Basic Summarizer
2008-06-12 10:30:56,753 INFO  NutchBean - opening linkdb in crawl/linkdb
2008-06-12 10:30:56,776 INFO  NutchBean - query request from 127.0.0.1
2008-06-12 10:30:56,788 INFO  NutchBean - query: horses
2008-06-12 10:30:56,788 INFO  NutchBean - lang: en
2008-06-12 10:30:56,822 INFO  NutchBean - searching for 20 raw hits
2008-06-12 10:30:56,889 INFO  NutchBean - total hits: 0
"

I don't know if that helps, but it does talk about nutch, so...
By the way, big thanks for your fast response- i'm really short on time. 

post your tomcat logs as you do a search...

Jason

-- 
View this message in context: http://www.nabble.com/Nutch--crawling--tp17801131p17802984.html
Sent from the Nutch - User mailing list archive at Nabble.com.


Re: Nutch- crawling?

Posted by Jason Boss <jb...@gmail.com>.
post your tomcat logs as you do a search...

Jason

On Thu, Jun 12, 2008 at 7:30 AM, nutch_newbie <ka...@hotmail.com> wrote:
>
> Yes, i keep restarting tomcat, but that dosn't help. Java- jdk-1.6.0, nutch-
> 0.8.1, tomcat- 5.5.16
>
> Jason Boss wrote:
>>
>> What version of nutch, java, and tomcat?
>>
>> Make sure you are restarting tomcat.  Read your tomcat logs and you
>> will see what the issue is.
>>
>> Jason
>>
>>
>> On Thu, Jun 12, 2008 at 7:19 AM, nutch_newbie <ka...@hotmail.com>
>> wrote:
>>>
>>> I ran the crawler, and it seems just fine. and  in
>>> localhost:8080/nutch-0.8.1
>>> the nutch search window is displayed, but whenever something is searched,
>>> the results always say "Hits 0-0 (out of about 0 total matching pages): "
>>> here is the piece of my crawl-urlfilter.txt that i modified:
>>>
>>> # accept hosts in MY.DOMAIN.NAME
>>> +^http://([a-z0-9]*\.)*MY.DOMAIN.NAME/
>>> +^http://([a-z0-9]*\.)*www.en.wikipedia.org
>>> +^http://([a-z0-9]*\.)*www.google.com
>>> +^http://([a-z0-9]*\.)*www.search.yahoo.com/
>>>
>>> what else am i supposed to do?  i'm really confused and running short on
>>> time. any and all help would be greatly appreciated. thanks in advance.
>>>
>>> PS: my computer is linux- FC5- but the folders and config files are still
>>> the same. and i also tried restarting tomcat- which didn;t help.
>>>
>>>
>>> --
>>> View this message in context:
>>> http://www.nabble.com/Nutch--crawling--tp17801131p17801131.html
>>> Sent from the Nutch - User mailing list archive at Nabble.com.
>>>
>>>
>>
>>
>
> --
> View this message in context: http://www.nabble.com/Nutch--crawling--tp17801131p17801391.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>
>

Re: Nutch- crawling?

Posted by nutch_newbie <ka...@hotmail.com>.
Yes, i keep restarting tomcat, but that dosn't help. Java- jdk-1.6.0, nutch-
0.8.1, tomcat- 5.5.16

Jason Boss wrote:
> 
> What version of nutch, java, and tomcat?
> 
> Make sure you are restarting tomcat.  Read your tomcat logs and you
> will see what the issue is.
> 
> Jason
> 
> 
> On Thu, Jun 12, 2008 at 7:19 AM, nutch_newbie <ka...@hotmail.com>
> wrote:
>>
>> I ran the crawler, and it seems just fine. and  in
>> localhost:8080/nutch-0.8.1
>> the nutch search window is displayed, but whenever something is searched,
>> the results always say "Hits 0-0 (out of about 0 total matching pages): "
>> here is the piece of my crawl-urlfilter.txt that i modified:
>>
>> # accept hosts in MY.DOMAIN.NAME
>> +^http://([a-z0-9]*\.)*MY.DOMAIN.NAME/
>> +^http://([a-z0-9]*\.)*www.en.wikipedia.org
>> +^http://([a-z0-9]*\.)*www.google.com
>> +^http://([a-z0-9]*\.)*www.search.yahoo.com/
>>
>> what else am i supposed to do?  i'm really confused and running short on
>> time. any and all help would be greatly appreciated. thanks in advance.
>>
>> PS: my computer is linux- FC5- but the folders and config files are still
>> the same. and i also tried restarting tomcat- which didn;t help.
>>
>>
>> --
>> View this message in context:
>> http://www.nabble.com/Nutch--crawling--tp17801131p17801131.html
>> Sent from the Nutch - User mailing list archive at Nabble.com.
>>
>>
> 
> 

-- 
View this message in context: http://www.nabble.com/Nutch--crawling--tp17801131p17801391.html
Sent from the Nutch - User mailing list archive at Nabble.com.


Re: Nutch- crawling?

Posted by Jason Boss <jb...@gmail.com>.
What version of nutch, java, and tomcat?

Make sure you are restarting tomcat.  Read your tomcat logs and you
will see what the issue is.

Jason


On Thu, Jun 12, 2008 at 7:19 AM, nutch_newbie <ka...@hotmail.com> wrote:
>
> I ran the crawler, and it seems just fine. and  in localhost:8080/nutch-0.8.1
> the nutch search window is displayed, but whenever something is searched,
> the results always say "Hits 0-0 (out of about 0 total matching pages): "
> here is the piece of my crawl-urlfilter.txt that i modified:
>
> # accept hosts in MY.DOMAIN.NAME
> +^http://([a-z0-9]*\.)*MY.DOMAIN.NAME/
> +^http://([a-z0-9]*\.)*www.en.wikipedia.org
> +^http://([a-z0-9]*\.)*www.google.com
> +^http://([a-z0-9]*\.)*www.search.yahoo.com/
>
> what else am i supposed to do?  i'm really confused and running short on
> time. any and all help would be greatly appreciated. thanks in advance.
>
> PS: my computer is linux- FC5- but the folders and config files are still
> the same. and i also tried restarting tomcat- which didn;t help.
>
>
> --
> View this message in context: http://www.nabble.com/Nutch--crawling--tp17801131p17801131.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>
>