You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] Commented: (NUTCH-191) InputFormat used in job must be in JobTracker classpath (not loaded from job JAR) - posted by "Owen O'Malley (JIRA)" <ji...@apache.org> on 2006/02/01 00:01:32 UTC, 2 replies.
- [jira] Created: (NUTCH-195) RPC call times out while indexing map task is computing splits - posted by "Chris Schneider (JIRA)" <ji...@apache.org> on 2006/02/01 00:26:32 UTC, 0 replies.
- [jira] Updated: (NUTCH-192) meta data support for CrawlDatum - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/02/01 02:12:34 UTC, 5 replies.
- Nutch Adminstration Interface - posted by Stefan Groschupf <sg...@media-style.com> on 2006/02/01 02:43:12 UTC, 0 replies.
- Proxy Exceptions - posted by "Guenter, Matthias" <Ma...@ipi.ch> on 2006/02/01 10:21:05 UTC, 0 replies.
- [jira] Commented: (NUTCH-192) meta data support for CrawlDatum - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/02/01 10:31:04 UTC, 13 replies.
- [jira] Closed: (NUTCH-194) Nutch-169 introduced two tiny bugs - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/02/01 14:01:50 UTC, 0 replies.
- Integrating Nutch w/Alexa - posted by Ken Krugler <kk...@transpac.com> on 2006/02/01 17:26:08 UTC, 1 replies.
- [jira] Created: (NUTCH-196) lib-xml and lib-log4j plugins - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/02/01 18:36:40 UTC, 0 replies.
- [jira] Commented: (NUTCH-196) lib-xml and lib-log4j plugins - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/02/01 19:15:32 UTC, 5 replies.
- [jira] Updated: (NUTCH-185) XMLParser is configurable xml parser plugin. - posted by "Rida Benjelloun (JIRA)" <ji...@apache.org> on 2006/02/01 21:11:28 UTC, 0 replies.
- Cmd line for running plugins - posted by Andrzej Bialecki <ab...@getopt.org> on 2006/02/01 22:35:27 UTC, 5 replies.
- [jira] Created: (NUTCH-197) NullPointerException in TaskRunner if application jar does not have "lib" directory - posted by "Owen O'Malley (JIRA)" <ji...@apache.org> on 2006/02/02 00:02:05 UTC, 0 replies.
- [jira] Updated: (NUTCH-197) NullPointerException in TaskRunner if application jar does not have "lib" directory - posted by "Owen O'Malley (JIRA)" <ji...@apache.org> on 2006/02/02 00:04:03 UTC, 0 replies.
- [jira] Resolved: (NUTCH-197) NullPointerException in TaskRunner if application jar does not have "lib" directory - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/02/02 00:22:03 UTC, 0 replies.
- [jira] Created: (NUTCH-198) SWF parser - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/02/02 13:26:04 UTC, 0 replies.
- [jira] Updated: (NUTCH-198) SWF parser - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/02/02 13:26:05 UTC, 0 replies.
- [jira] Commented: (NUTCH-198) SWF parser - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/02/02 20:14:03 UTC, 4 replies.
- Some bugs I'm trying to characterize.... - posted by "Bryan A. Pendleton" <bp...@geekdom.net> on 2006/02/02 21:06:43 UTC, 1 replies.
- [jira] Commented: (NUTCH-59) meta data support in webdb - posted by "James Jonas (JIRA)" <ji...@apache.org> on 2006/02/02 23:45:04 UTC, 2 replies.
- [jira] Created: (NUTCH-199) tool to mount ndfs on linux - posted by "John Xing (JIRA)" <ji...@apache.org> on 2006/02/03 08:00:04 UTC, 0 replies.
- [jira] Updated: (NUTCH-199) tool to mount ndfs on linux - posted by "John Xing (JIRA)" <ji...@apache.org> on 2006/02/03 08:04:03 UTC, 0 replies.
- Re: tool to mount nutch filesystem - posted by John X <jo...@neasys.com> on 2006/02/03 08:19:41 UTC, 2 replies.
- [jira] Commented: (NUTCH-193) move NDFS and MapReduce to a separate project - posted by "John Xing (JIRA)" <ji...@apache.org> on 2006/02/03 08:29:03 UTC, 3 replies.
- [jira] Commented: (NUTCH-81) Webapp only works when deployed in root - posted by "Michael Nebel (JIRA)" <ji...@apache.org> on 2006/02/03 09:07:05 UTC, 0 replies.
- Carrot2 v. 1.0.1. [clustering plugin] - posted by Dawid Weiss <da...@cs.put.poznan.pl> on 2006/02/03 11:03:32 UTC, 3 replies.
- [jira] Commented: (NUTCH-139) Standard metadata property names in the ParseData metadata - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/02/03 12:36:04 UTC, 6 replies.
- [jira] Created: (NUTCH-200) OpenSearch Servlet ist broken - posted by "Michael Nebel (JIRA)" <ji...@apache.org> on 2006/02/03 14:35:03 UTC, 0 replies.
- incremental index task - posted by Derek Young <dm...@gmail.com> on 2006/02/03 15:36:23 UTC, 0 replies.
- [jira] Closed: (NUTCH-198) SWF parser - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/02/03 19:51:22 UTC, 0 replies.
- [jira] Assigned: (NUTCH-178) in search.jsp must be session creation "false" - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/03 20:27:03 UTC, 0 replies.
- [jira] Resolved: (NUTCH-178) in search.jsp must be session creation "false" - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/03 20:29:25 UTC, 0 replies.
- [jira] Created: (NUTCH-201) add support for subcollections - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/03 21:23:13 UTC, 0 replies.
- [jira] Updated: (NUTCH-201) add support for subcollections - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/03 21:25:04 UTC, 1 replies.
- [jira] Created: (NUTCH-202) Mapper, Reducer need an occasion to cleanup after the last record is processed. - posted by "Michel Tourn (JIRA)" <ji...@apache.org> on 2006/02/04 01:07:17 UTC, 0 replies.
- Re: svn commit: r374731 - in /lucene/nutch/trunk/src/web/jsp: anchors.jsp cached.jsp explain.jsp index.jsp search.jsp text.jsp - posted by Doug Cutting <cu...@apache.org> on 2006/02/04 01:43:45 UTC, 1 replies.
- [jira] Resolved: (NUTCH-193) move NDFS and MapReduce to a separate project - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/02/04 01:50:15 UTC, 3 replies.
- Nutch plugin - posted by Rida Benjelloun <ri...@doculibre.com> on 2006/02/04 02:39:17 UTC, 0 replies.
- RE: takes too long to remove a page from WEBDB - posted by Fuad Efendi <fu...@efendi.ca> on 2006/02/04 05:47:25 UTC, 0 replies.
- ProtocolStatus.MOVED - posted by Fuad Efendi <fu...@efendi.ca> on 2006/02/04 06:31:57 UTC, 1 replies.
- [jira] Commented: (NUTCH-170) Crash with multiple temp directories - posted by "Rod Taylor (JIRA)" <ji...@apache.org> on 2006/02/04 06:54:05 UTC, 0 replies.
- Wrong 'Next Fetch' Date - posted by mos <mo...@gmail.com> on 2006/02/04 17:44:09 UTC, 0 replies.
- Re: svn commit: r374842 - in /lucene/nutch/trunk/src/web/jsp: anchors.jsp cached.jsp explain.jsp refine-query-init.jsp search.jsp text.jsp - posted by Doug Cutting <cu...@apache.org> on 2006/02/04 23:14:07 UTC, 0 replies.
- ArrayIndexOutOfBoundsException during invert link phase - posted by Ken Krugler <kk...@transpac.com> on 2006/02/04 23:14:35 UTC, 1 replies.
- Re: Ideas for enhancements - posted by Stefan Groschupf <sg...@media-style.com> on 2006/02/05 17:32:59 UTC, 0 replies.
- [jira] Created: (NUTCH-203) ParseSegment throws InstantiationException - posted by "Marko Bauhardt (JIRA)" <ji...@apache.org> on 2006/02/06 09:42:04 UTC, 0 replies.
- [jira] Updated: (NUTCH-203) ParseSegment throws InstantiationException - posted by "Marko Bauhardt (JIRA)" <ji...@apache.org> on 2006/02/06 09:51:03 UTC, 0 replies.
- [jira] Updated: (NUTCH-200) OpenSearch Servlet ist broken - posted by "Marko Bauhardt (JIRA)" <ji...@apache.org> on 2006/02/06 11:28:09 UTC, 0 replies.
- [jira] Assigned: (NUTCH-200) OpenSearch Servlet ist broken - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/06 21:10:24 UTC, 0 replies.
- [jira] Resolved: (NUTCH-200) OpenSearch Servlet ist broken - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/06 21:14:16 UTC, 0 replies.
- [jira] Assigned: (NUTCH-81) Webapp only works when deployed in root - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/06 21:55:57 UTC, 0 replies.
- javaswf.jar - posted by Jérôme Charron <je...@gmail.com> on 2006/02/06 22:11:21 UTC, 2 replies.
- [jira] Resolved: (NUTCH-81) Webapp only works when deployed in root - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/06 22:12:57 UTC, 0 replies.
- [jira] Created: (NUTCH-204) multiple field values in HitDetails - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/02/07 00:21:57 UTC, 0 replies.
- [jira] Updated: (NUTCH-204) multiple field values in HitDetails - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/02/07 00:23:58 UTC, 1 replies.
- [jira] Updated: (NUTCH-81) Webapp only works when deployed in root - posted by "Michael Nebel (JIRA)" <ji...@apache.org> on 2006/02/07 10:09:57 UTC, 0 replies.
- [jira] Created: (NUTCH-205) Wrong 'fetch date' for non available pages - posted by "M.Oliver Scheele (JIRA)" <ji...@apache.org> on 2006/02/07 11:39:56 UTC, 0 replies.
- [jira] Commented: (NUTCH-205) Wrong 'fetch date' for non available pages - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/02/07 14:11:57 UTC, 2 replies.
- [jira] Created: (NUTCH-206) search server throws InstantiationException - posted by "jimmy (JIRA)" <ji...@apache.org> on 2006/02/07 16:46:15 UTC, 2 replies.
- [OT] Mailing lists - posted by Andrew McNabb <am...@mcnabbs.org> on 2006/02/07 19:27:26 UTC, 1 replies.
- [jira] Created: (NUTCH-207) Bandwidth target for fetcher rather than a thread count - posted by "Rod Taylor (JIRA)" <ji...@apache.org> on 2006/02/07 19:45:56 UTC, 0 replies.
- [jira] Updated: (NUTCH-207) Bandwidth target for fetcher rather than a thread count - posted by "Rod Taylor (JIRA)" <ji...@apache.org> on 2006/02/07 19:45:57 UTC, 0 replies.
- [jira] Commented: (NUTCH-207) Bandwidth target for fetcher rather than a thread count - posted by "Rod Taylor (JIRA)" <ji...@apache.org> on 2006/02/07 19:47:57 UTC, 0 replies.
- [jira] Commented: (NUTCH-158) Process Sitemap data in text, rss or xml format as well as OAI-PMH - posted by "raghavendra prabhu (JIRA)" <ji...@apache.org> on 2006/02/07 22:03:25 UTC, 0 replies.
- [jira] Resolved: (NUTCH-149) outlinks not shown properly in cached.jsp - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/02/07 22:23:57 UTC, 0 replies.
- [jira] Closed: (NUTCH-149) outlinks not shown properly in cached.jsp - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/02/07 22:23:58 UTC, 0 replies.
- [jira] Updated: (NUTCH-196) lib-xml and lib-log4j plugins - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/02/08 01:45:02 UTC, 0 replies.
- No node available for block errors - posted by Chris Schneider <Sc...@TransPac.com> on 2006/02/08 04:47:54 UTC, 0 replies.
- ignore eclipse .project and .classpath - posted by Chris Mattmann <ch...@jpl.nasa.gov> on 2006/02/08 06:16:51 UTC, 1 replies.
- [jira] Created: (NUTCH-208) http: proxy exception list: - posted by "Matthias Günter (JIRA)" <ji...@apache.org> on 2006/02/08 16:29:05 UTC, 0 replies.
- [jira] Updated: (NUTCH-208) http: proxy exception list: - posted by "Matthias Günter (JIRA)" <ji...@apache.org> on 2006/02/08 16:31:04 UTC, 1 replies.
- [jira] Updated: (NUTCH-139) Standard metadata property names in the ParseData metadata - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/02/08 17:45:41 UTC, 0 replies.
- Success with Nutch & GCJ - posted by Andrzej Bialecki <ab...@getopt.org> on 2006/02/08 18:38:43 UTC, 0 replies.
- [jira] Resolved: (NUTCH-139) Standard metadata property names in the ParseData metadata - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/02/08 22:52:57 UTC, 0 replies.
- whitespaces was: meta data support for CrawlDatum - posted by Stefan Groschupf <sg...@media-style.com> on 2006/02/09 00:05:56 UTC, 3 replies.
- process/create/hand over: crawl meta data - posted by Stefan Groschupf <sg...@media-style.com> on 2006/02/09 00:17:52 UTC, 1 replies.
- Empty Parse - posted by Jérôme Charron <je...@gmail.com> on 2006/02/09 16:30:45 UTC, 3 replies.
- Jakarta-POI 3.0-alpha1 - posted by Jérôme Charron <je...@gmail.com> on 2006/02/09 16:41:05 UTC, 0 replies.
- [jira] Created: (NUTCH-209) include nutch jar in mapred jobs - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/02/09 19:39:57 UTC, 0 replies.
- Re: ignore eclipse .project and .classpath - posted by og...@yahoo.com on 2006/02/09 21:13:10 UTC, 1 replies.
- Jira was: Re: whitespaces - posted by Stefan Groschupf <sg...@media-style.com> on 2006/02/09 22:50:24 UTC, 0 replies.
- [jira] Commented: (NUTCH-209) include nutch jar in mapred jobs - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/02/09 23:07:55 UTC, 3 replies.
- [jira] Resolved: (NUTCH-209) include nutch jar in mapred jobs - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/02/10 00:19:57 UTC, 0 replies.
- [jira] Closed: (NUTCH-192) meta data support for CrawlDatum - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/02/10 02:05:55 UTC, 0 replies.
- Fetch same URL two times - posted by Fuad Efendi <fu...@efendi.ca> on 2006/02/10 05:57:11 UTC, 0 replies.
- Word, Powerpoint and Excel parsers - posted by Jérôme Charron <je...@gmail.com> on 2006/02/10 16:21:27 UTC, 0 replies.
- [jira] Resolved: (NUTCH-52) Parser plugin for MS Excel files - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/02/10 18:09:55 UTC, 0 replies.
- RE: Authentication / Content-type - posted by Teruhiko Kurosaka <Ku...@basistech.com> on 2006/02/11 01:24:15 UTC, 2 replies.
- [jira] Commented: (NUTCH-53) Parser plugin for Zip files - posted by "ilango gurusamy (JIRA)" <ji...@apache.org> on 2006/02/12 06:19:28 UTC, 0 replies.
- [jira] Commented: (NUTCH-125) OpenOffice Parser plugin - posted by "ilango gurusamy (JIRA)" <ji...@apache.org> on 2006/02/12 06:38:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-23) content text/xml parser - posted by "ilango gurusamy (JIRA)" <ji...@apache.org> on 2006/02/12 06:40:28 UTC, 1 replies.
- [jira] Commented: (NUTCH-74) French Analyzer Plugin - posted by "ilango gurusamy (JIRA)" <ji...@apache.org> on 2006/02/12 06:48:29 UTC, 0 replies.
- duplicate libs - posted by Doug Cutting <cu...@apache.org> on 2006/02/14 00:26:04 UTC, 22 replies.
- Which extension point should I extend? - posted by Elwin <ma...@gmail.com> on 2006/02/14 08:11:28 UTC, 3 replies.
- A little hack: retrieve only new urls - posted by Enrico Triolo <en...@gmail.com> on 2006/02/14 11:54:29 UTC, 0 replies.
- Plugin dependencies - posted by Enrico Triolo <en...@gmail.com> on 2006/02/14 16:51:20 UTC, 2 replies.
- [jira] Resolved: (NUTCH-137) footer is not displayed in search result page - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/14 17:00:13 UTC, 0 replies.
- [jira] Resolved: (NUTCH-118) FAQ link points to invalid URL - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/14 17:02:09 UTC, 0 replies.
- [jira] Assigned: (NUTCH-184) Serbian (sr, Cyrilic) and Serbo-Croatian (sh, Latin) translation - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/14 17:02:10 UTC, 0 replies.
- [jira] Commented: (NUTCH-165) object pooling for nutch bean --- to impriove performance - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/14 17:06:10 UTC, 0 replies.
- [jira] Closed: (NUTCH-123) Cache.jsp some times generate NullPointerException - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/14 17:10:09 UTC, 0 replies.
- [jira] Resolved: (NUTCH-184) Serbian (sr, Cyrilic) and Serbo-Croatian (sh, Latin) translation - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/14 20:38:09 UTC, 0 replies.
- [jira] Assigned: (NUTCH-48) "Did you mean" query enhancement/refignment feature request - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/14 20:40:09 UTC, 0 replies.
- [jira] Resolved: (NUTCH-64) no results after a restart of a search--server (without tomcat restart) - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/14 20:55:09 UTC, 0 replies.
- [jira] Resolved: (NUTCH-90) reduce logging output of IndexSegment - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/02/14 21:01:09 UTC, 0 replies.
- [jira] Commented: (NUTCH-140) Add alias capability in parse-plugins.xml file that allows mimeType->extensionId mapping - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/02/14 21:06:08 UTC, 0 replies.
- [jira] Created: (NUTCH-210) Context.xml file for Nutch web application - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/02/15 08:30:41 UTC, 0 replies.
- [jira] Created: (NUTCH-211) FetchedSegments leave readers open - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/02/15 12:11:08 UTC, 0 replies.
- [jira] Commented: (NUTCH-204) multiple field values in HitDetails - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/02/15 13:31:14 UTC, 8 replies.
- [jira] Commented: (NUTCH-211) FetchedSegments leave readers open - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/02/15 18:21:09 UTC, 4 replies.
- [jira] Assigned: (NUTCH-211) FetchedSegments leave readers open - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/02/15 21:06:09 UTC, 0 replies.
- All tasktrackers access same site at the same time (hadoop) please help - posted by Gal Nitzan <gn...@usa.net> on 2006/02/15 21:33:57 UTC, 0 replies.
- Re: All tasktrackers access same site at the same time (hadoop) please help - posted by Andrzej Bialecki <ab...@getopt.org> on 2006/02/15 21:56:25 UTC, 2 replies.
- [jira] Updated: (NUTCH-211) FetchedSegments leave readers open - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/02/16 01:08:44 UTC, 1 replies.
- [jira] Updated: (NUTCH-140) Add alias capability in parse-plugins.xml file that allows mimeType->extensionId mapping - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/02/16 03:21:59 UTC, 0 replies.
- fExtensionPoints is a HashMap ? - posted by Elwin <ma...@gmail.com> on 2006/02/16 05:49:14 UTC, 1 replies.
- Maven - posted by Fuad Efendi <fu...@efendi.ca> on 2006/02/16 06:03:29 UTC, 0 replies.
- Re: All tasktrackers access same site at the same time (hadoop) please help - posted by Gal Nitzan <gn...@usa.net> on 2006/02/16 10:18:27 UTC, 0 replies.
- Unable to complete a full fetch, reason Child Error - posted by Gal Nitzan <gn...@usa.net> on 2006/02/16 10:21:05 UTC, 5 replies.
- updatedb does not work for already indexed segment - posted by Rozina Sorathia <Ro...@KPITCummins.com> on 2006/02/16 12:58:16 UTC, 1 replies.
- How to supprt multi-fields highlight? - posted by Jack Tang <hi...@gmail.com> on 2006/02/16 18:34:28 UTC, 0 replies.
- Global locking - posted by Gal Nitzan <gn...@usa.net> on 2006/02/16 22:29:07 UTC, 2 replies.
- [jira] Resolved: (NUTCH-211) FetchedSegments leave readers open - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/02/17 00:33:24 UTC, 0 replies.
- [jira] Created: (NUTCH-212) ant build problem with locale-sr - posted by "Alain Fankhauser (JIRA)" <ji...@apache.org> on 2006/02/17 13:08:24 UTC, 0 replies.
- [jira] Commented: (NUTCH-212) ant build problem with locale-sr - posted by "Alain Fankhauser (JIRA)" <ji...@apache.org> on 2006/02/17 13:10:25 UTC, 0 replies.
- [jira] Updated: (NUTCH-143) Improper error numbers returned on exit - posted by "Rod Taylor (JIRA)" <ji...@apache.org> on 2006/02/18 00:05:28 UTC, 0 replies.
- Nutch Improvement - HTML Parser - posted by Fuad Efendi <fu...@efendi.ca> on 2006/02/18 04:12:46 UTC, 10 replies.
- [jira] Created: (NUTCH-213) checkstyle - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/02/18 22:25:24 UTC, 0 replies.
- [jira] Updated: (NUTCH-213) checkstyle - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/02/18 22:55:25 UTC, 0 replies.
- Summarier threads in nutch - posted by Jack Tang <hi...@gmail.com> on 2006/02/19 18:20:54 UTC, 9 replies.
- Thread in nutch - posted by Jack Tang <hi...@gmail.com> on 2006/02/20 15:54:57 UTC, 0 replies.
- [jira] Created: (NUTCH-214) Added Links to web site to search mailling list - posted by "Jake Vanderdray (JIRA)" <ji...@apache.org> on 2006/02/20 22:00:25 UTC, 0 replies.
- URL Partitioning (Lexical vs. IP Address) - posted by Chris Schneider <Sc...@TransPac.com> on 2006/02/21 05:05:01 UTC, 4 replies.
- Single Map Task Requirement for Fetching - posted by Chris Schneider <Sc...@TransPac.com> on 2006/02/21 05:07:30 UTC, 3 replies.
- Redirection and Partitioning - posted by Chris Schneider <Sc...@TransPac.com> on 2006/02/21 05:16:01 UTC, 0 replies.
- SWF Parser on Nutch 0.7 - posted by Dima Mazmanov <di...@proservice.ge> on 2006/02/21 10:10:04 UTC, 0 replies.
- [jira] Closed: (NUTCH-140) Add alias capability in parse-plugins.xml file that allows mimeType->extensionId mapping - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/02/21 11:04:11 UTC, 0 replies.
- [jira] Created: (NUTCH-215) Plugin execution order - posted by "Enrico Triolo (JIRA)" <ji...@apache.org> on 2006/02/21 11:27:27 UTC, 0 replies.
- [jira] Updated: (NUTCH-215) Plugin execution order - posted by "Enrico Triolo (JIRA)" <ji...@apache.org> on 2006/02/21 11:49:28 UTC, 0 replies.
- [jira] Closed: (NUTCH-214) Added Links to web site to search mailling list - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/02/21 12:16:56 UTC, 0 replies.
- [jira] Commented: (NUTCH-215) Plugin execution order - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/02/21 12:54:01 UTC, 0 replies.
- [jira] Resolved: (NUTCH-212) ant build problem with locale-sr - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/02/21 15:13:37 UTC, 0 replies.
- [jira] Closed: (NUTCH-188) Add searchable mailing list links to http://lucene.apache.org/nutch/mailing_lists.html - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/02/22 11:56:40 UTC, 0 replies.
- Problem with DB_GONE status - posted by Andrzej Bialecki <ab...@getopt.org> on 2006/02/23 14:23:44 UTC, 2 replies.
- HEADS-UP: cmd-line change for "invertlinks" - posted by Andrzej Bialecki <ab...@getopt.org> on 2006/02/23 18:28:36 UTC, 1 replies.
- still need jetty jars? - posted by Stefan Groschupf <sg...@media-style.com> on 2006/02/23 22:46:21 UTC, 1 replies.
- Bug and Fix for DistributedSearch$Client - posted by Heiko Dietze <he...@biotec.tu-dresden.de> on 2006/02/24 11:06:11 UTC, 1 replies.
- [jira] Created: (NUTCH-216) cannot build in windows - posted by "bin zhu (JIRA)" <ji...@apache.org> on 2006/02/24 17:08:38 UTC, 0 replies.
- [jira] Updated: (NUTCH-216) cannot build in windows - posted by "bin zhu (JIRA)" <ji...@apache.org> on 2006/02/24 17:15:50 UTC, 0 replies.
- [jira] Resolved: (NUTCH-216) cannot build in windows - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/02/24 20:14:37 UTC, 0 replies.
- [jira] Commented: (NUTCH-100) New plugin urlfilter-db - posted by "Fuad Efendi (JIRA)" <ji...@apache.org> on 2006/02/24 21:40:44 UTC, 2 replies.
- FW: Good reading/research on PDF text extraction - posted by Richard Braman <rb...@bramantax.com> on 2006/02/26 21:43:28 UTC, 1 replies.
- Release Planning - posted by Nutch developer <nu...@googlemail.com> on 2006/02/26 22:31:47 UTC, 1 replies.
- [jira] Created: (NUTCH-217) InstantiationException when deserializing Query (no parameterless constructor) - posted by "Dawid Weiss (JIRA)" <ji...@apache.org> on 2006/02/27 08:42:41 UTC, 0 replies.
- Help need Nutch crawler. - posted by Rajpaul Cheenath <Ra...@mindtree.com> on 2006/02/27 12:53:10 UTC, 0 replies.
- [jira] Closed: (NUTCH-204) multiple field values in HitDetails - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/02/27 23:31:54 UTC, 0 replies.
- OPIC score calculation issues - posted by Andrzej Bialecki <ab...@getopt.org> on 2006/02/28 00:14:41 UTC, 1 replies.
- [jira] Updated: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/02/28 00:48:54 UTC, 0 replies.
- [jira] Commented: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/02/28 01:03:56 UTC, 1 replies.
- Nutch Parsing PDFs, and general PDF extraction - posted by Richard Braman <rb...@bramantax.com> on 2006/02/28 13:43:00 UTC, 10 replies.
- FW: Index aborted crawl. - posted by Richard Braman <rb...@bramantax.com> on 2006/02/28 17:46:43 UTC, 1 replies.
- [jira] Created: (NUTCH-218) need DOAP file for Nutch - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/02/28 18:13:47 UTC, 0 replies.
- [jira] Assigned: (NUTCH-218) need DOAP file for Nutch - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/02/28 19:23:41 UTC, 0 replies.
- [jira] Updated: (NUTCH-218) need DOAP file for Nutch - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/02/28 19:34:40 UTC, 0 replies.
- PDF Parse Error - posted by Richard Braman <rb...@bramantax.com> on 2006/02/28 22:00:33 UTC, 1 replies.
- FW: pdf to xml - posted by Richard Braman <rb...@bramantax.com> on 2006/02/28 22:59:39 UTC, 0 replies.