You are viewing a plain text version of this content. The canonical link for it is here.
- Re: refetching interval - posted by YourSoft <yo...@freemail.hu> on 2006/06/01 12:02:20 UTC, 0 replies.
- webgraph - posted by YourSoft <yo...@freemail.hu> on 2006/06/01 12:02:37 UTC, 0 replies.
- [jira] Created: (NUTCH-293) support for Crawl-delay in Robots.txt - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/01 19:24:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-293) support for Crawl-delay in Robots.txt - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/01 19:26:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-289) CrawlDatum should store IP address - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/01 20:41:30 UTC, 0 replies.
- how to turn on logging, excersize analyzer, tips on debugging plugins? - posted by Teruhiko Kurosaka <Ku...@basistech.com> on 2006/06/01 22:01:09 UTC, 0 replies.
- i18n in nutch home page is misnomor - posted by Teruhiko Kurosaka <Ku...@basistech.com> on 2006/06/01 23:54:38 UTC, 0 replies.
- [jira] Created: (NUTCH-294) Topic-maps of related searchwords - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/06/02 08:56:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-282) Showing too few results on a page (Paging not correct) - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/02 17:08:30 UTC, 1 replies.
- [jira] Commented: (NUTCH-286) Handling common error-pages as 404 - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/02 17:13:31 UTC, 1 replies.
- [jira] Commented: (NUTCH-288) hitsPerSite-functionality "flawed": problems writing a page-navigation - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/02 17:20:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-292) OpenSearchServlet: OutOfMemoryError: Java heap space - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/02 17:31:31 UTC, 0 replies.
- [jira] Commented: (NUTCH-291) OpenSearchServlet should return "date" as well as "lastModified" - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/02 17:39:30 UTC, 1 replies.
- [jira] Commented: (NUTCH-290) parse-pdf: Garbage indexed when text-extraction not allowed - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/02 17:45:31 UTC, 3 replies.
- [jira] Updated: (NUTCH-292) OpenSearchServlet: OutOfMemoryError: Java heap space - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/06/02 17:51:30 UTC, 0 replies.
- [jira] Closed: (NUTCH-287) Exception when searching with sort - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/02 17:55:30 UTC, 0 replies.
- [jira] Closed: (NUTCH-284) NullPointerException during index - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/02 17:57:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-284) NullPointerException during index - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/02 17:59:31 UTC, 0 replies.
- [jira] Commented: (NUTCH-281) cached.jsp: base-href needs to be outside comments - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/02 17:59:32 UTC, 0 replies.
- [jira] Commented: (NUTCH-275) Fetcher not parsing XHTML-pages at all - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/02 18:07:31 UTC, 2 replies.
- [jira] Commented: (NUTCH-274) Empty row in/at end of URL-list results in error - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/02 18:13:30 UTC, 0 replies.
- [jira] Updated: (NUTCH-274) Empty row in/at end of URL-list results in error - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/02 18:25:31 UTC, 0 replies.
- [jira] Resolved: (NUTCH-282) Showing too few results on a page (Paging not correct) - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/02 18:34:30 UTC, 0 replies.
- [jira] Closed: (NUTCH-286) Handling common error-pages as 404 - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/02 18:36:30 UTC, 0 replies.
- [jira] Created: (NUTCH-295) More description for fetcher.threads.fetch property - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2006/06/02 18:58:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-295) More description for fetcher.threads.fetch property - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2006/06/02 19:00:30 UTC, 0 replies.
- [jira] Created: (NUTCH-296) Image Search - posted by "Thomas Delnoij (JIRA)" <ji...@apache.org> on 2006/06/03 18:53:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-296) Image Search - posted by "Thomas Delnoij (JIRA)" <ji...@apache.org> on 2006/06/03 19:05:30 UTC, 0 replies.
- [jira] Created: (NUTCH-297) sandbox svn folder - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/03 19:13:29 UTC, 0 replies.
- [jira] Commented: (NUTCH-294) Topic-maps of related searchwords - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/06/03 19:59:30 UTC, 4 replies.
- [jira] Commented: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/06/03 20:10:30 UTC, 12 replies.
- [jira] Assigned: (NUTCH-236) PdfParser and RSSParser Log4j appender redirection - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/06/03 20:16:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-236) PdfParser and RSSParser Log4j appender redirection - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/06/03 20:18:30 UTC, 0 replies.
- [jira] Updated: (NUTCH-236) PdfParser and RSSParser Log4j appender redirection - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/06/03 20:18:31 UTC, 1 replies.
- [jira] Updated: (NUTCH-187) Cannot start Nutch datanodes on Windows outside of a cygwin environment because of DF - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/06/03 20:44:30 UTC, 0 replies.
- [jira] Created: (NUTCH-298) if a 404 for a robots.txt is returned no page is fetched at all from the host - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/03 21:44:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-298) if a 404 for a robots.txt is returned no page is fetched at all from the host - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/03 21:53:30 UTC, 0 replies.
- RobotRuleSet - posted by Stefan Groschupf <sg...@media-style.com> on 2006/06/03 21:58:35 UTC, 0 replies.
- [jira] Created: (NUTCH-299) Bittorrent Parser - posted by "Hasan Diwan (JIRA)" <ji...@apache.org> on 2006/06/04 01:04:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-299) Bittorrent Parser - posted by "Hasan Diwan (JIRA)" <ji...@apache.org> on 2006/06/04 01:07:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-299) Bittorrent Parser - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/06/04 16:15:30 UTC, 1 replies.
- [jira] Commented: (NUTCH-298) if a 404 for a robots.txt is returned no page is fetched at all from the host - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/06/04 17:56:30 UTC, 0 replies.
- [jira] Updated: (NUTCH-298) if a 404 for a robots.txt is returned a NPE is thrown - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/04 18:27:30 UTC, 0 replies.
- search engine spam detector - posted by Stefan Groschupf <sg...@media-style.com> on 2006/06/04 19:14:47 UTC, 4 replies.
- [jira] Resolved: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/06/04 20:20:30 UTC, 0 replies.
- [jira] Closed: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/06/04 20:22:30 UTC, 0 replies.
- Re: [Nutch-cvs] svn commit: r411594 - /lucene/nutch/trunk/contrib/web2/plugins/build.xml - posted by og...@yahoo.com on 2006/06/05 07:33:11 UTC, 5 replies.
- summary - posted by an...@orbita1.ru on 2006/06/05 11:43:05 UTC, 3 replies.
- parse OutOfMemoryError? - posted by Uygar Yüzsüren <uy...@gmail.com> on 2006/06/05 14:35:53 UTC, 0 replies.
- [jira] Created: (NUTCH-300) Clustering API improvements - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/06/05 17:20:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-289) CrawlDatum should store IP address - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/05 17:53:30 UTC, 2 replies.
- [jira] Updated: (NUTCH-300) Clustering API improvements - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/06/05 20:23:30 UTC, 0 replies.
- [jira] Reopened: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/06/05 20:40:30 UTC, 0 replies.
- [jira] Resolved: (NUTCH-201) add support for subcollections - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/06/05 22:14:30 UTC, 0 replies.
- [jira] Resolved: (NUTCH-298) if a 404 for a robots.txt is returned a NPE is thrown - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/06/05 23:48:30 UTC, 0 replies.
- Re: svn commit: r411943 - in /lucene/nutch/trunk/lib: commons-logging-1.0.4.jar hadoop-0.2.1.jar hadoop-0.3.1.jar log4j-1.2.13.jar - posted by Jérôme Charron <je...@gmail.com> on 2006/06/06 11:02:56 UTC, 3 replies.
- [jira] Commented: (NUTCH-48) "Did you mean" query enhancement/refignment feature request - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/06/06 21:04:31 UTC, 1 replies.
- Re: Nutch web site - posted by Sami Siren <ss...@gmail.com> on 2006/06/06 21:17:16 UTC, 0 replies.
- wildcard / regular expression searches - posted by Björn Wilmsmann <bj...@wilmsmann.de> on 2006/06/07 00:12:23 UTC, 0 replies.
- classloading problem hadoop .3.1 - posted by Stefan Groschupf <sg...@media-style.com> on 2006/06/07 01:41:09 UTC, 0 replies.
- [jira] Created: (NUTCH-301) CommonGrams loads analysis.common.terms.file for each query - posted by "Chris Schneider (JIRA)" <ji...@apache.org> on 2006/06/07 04:50:29 UTC, 0 replies.
- [jira] Commented: (NUTCH-301) CommonGrams loads analysis.common.terms.file for each query - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/06/07 10:29:30 UTC, 0 replies.
- Re: Status of language plugin - posted by Jérôme Charron <je...@gmail.com> on 2006/06/07 10:58:08 UTC, 0 replies.
- [jira] Resolved: (NUTCH-275) Fetcher not parsing XHTML-pages at all - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/06/07 15:08:31 UTC, 0 replies.
- [jira] Created: (NUTCH-302) java doc of CrawlDb is wrong - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/07 18:28:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-301) CommonGrams loads analysis.common.terms.file for each query - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/07 18:53:07 UTC, 0 replies.
- [jira] Commented: (NUTCH-293) support for Crawl-delay in Robots.txt - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/07 18:53:10 UTC, 2 replies.
- [jira] Created: (NUTCH-303) logging improvements - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/06/07 18:55:29 UTC, 0 replies.
- resolving IP in... - posted by Stefan Groschupf <sg...@media-style.com> on 2006/06/07 19:11:32 UTC, 6 replies.
- a little deterrent - posted by khz <kh...@tzi.org> on 2006/06/07 22:26:35 UTC, 0 replies.
- [jira] Resolved: (NUTCH-301) CommonGrams loads analysis.common.terms.file for each query - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/06/08 00:20:30 UTC, 0 replies.
- [jira] Created: (NUTCH-304) Change JIRA email address for nutch issues from apache incubator - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/06/09 06:13:29 UTC, 0 replies.
- How do I use nuch tomerge multiple webdb? - posted by Nutch开发邮件 <pr...@gmail.com> on 2006/06/09 06:41:51 UTC, 1 replies.
- anchor text modifications - posted by Brian Higgins <su...@gmail.com> on 2006/06/09 07:36:29 UTC, 1 replies.
- [jira] Created: (NUTCH-305) Update crawl and url filter lists to exclude jpeg|JPEG|bmp|BMP - posted by "chris finne (JIRA)" <ji...@apache.org> on 2006/06/09 08:51:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-305) Update crawl and url filter lists to exclude jpeg|JPEG|bmp|BMP - posted by "Stefan Neufeind (JIRA)" <ji...@apache.org> on 2006/06/09 12:08:30 UTC, 0 replies.
- Adding new urls in WebDB - posted by Lourival Júnior <ju...@gmail.com> on 2006/06/09 13:46:05 UTC, 4 replies.
- Nutch logging questions - posted by Jérôme Charron <je...@gmail.com> on 2006/06/09 19:35:17 UTC, 1 replies.
- [jira] Updated: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2006/06/09 22:02:30 UTC, 0 replies.
- [jira] Created: (NUTCH-306) DistributedSearch.Client liveAddresses concurrency problem - posted by "Grant Glouser (JIRA)" <ji...@apache.org> on 2006/06/10 03:34:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-306) DistributedSearch.Client liveAddresses concurrency problem - posted by "Grant Glouser (JIRA)" <ji...@apache.org> on 2006/06/10 03:36:30 UTC, 1 replies.
- 0.8 release - posted by Sami Siren <ss...@gmail.com> on 2006/06/10 07:32:34 UTC, 2 replies.
- how to manipulate with MapWritable metaData in CrawlDatum structure - posted by Feng Ji <fe...@gmail.com> on 2006/06/12 04:15:58 UTC, 2 replies.
- nutch-default.xml configuration - posted by Lourival Júnior <ju...@gmail.com> on 2006/06/12 16:33:15 UTC, 4 replies.
- Re: Re[2]: nutch-default.xml configuration - posted by Lourival Júnior <ju...@gmail.com> on 2006/06/12 17:06:52 UTC, 1 replies.
- Re[4]: nutch-default.xml configuration - posted by Dima Mazmanov <nu...@proservice.ge> on 2006/06/12 18:11:52 UTC, 0 replies.
- [jira] Resolved: (NUTCH-303) logging improvements - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/06/12 23:01:30 UTC, 3 replies.
- Cached.jsp to show images - posted by Marco Pereira <ma...@gmail.com> on 2006/06/12 23:47:22 UTC, 0 replies.
- [jira] Closed: (NUTCH-236) PdfParser and RSSParser Log4j appender redirection - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/06/13 12:07:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-224) Nutch doesn't handle Korean text at all - posted by "Sean Dean (JIRA)" <ji...@apache.org> on 2006/06/14 02:58:30 UTC, 0 replies.
- free disk space - posted by an...@orbita1.ru on 2006/06/14 08:43:55 UTC, 0 replies.
- No space left on device - posted by an...@orbita1.ru on 2006/06/14 14:57:49 UTC, 2 replies.
- IncrediBILL's Random Rants: How Much Nutch is TOO MUCH Nutch? - posted by Doug Cutting <cu...@apache.org> on 2006/06/14 19:03:10 UTC, 11 replies.
- nutch .72 out-of-the-box build issue - posted by "Dagum, Leo" <ld...@business.com> on 2006/06/14 19:34:15 UTC, 0 replies.
- search speed - posted by an...@orbita1.ru on 2006/06/15 10:08:48 UTC, 1 replies.
- [jira] Assigned: (NUTCH-306) DistributedSearch.Client liveAddresses concurrency problem - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/06/15 16:56:30 UTC, 0 replies.
- [jira] Resolved: (NUTCH-122) block numbers need a better random number generator - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/06/15 17:03:30 UTC, 0 replies.
- [jira] Closed: (NUTCH-187) Cannot start Nutch datanodes on Windows outside of a cygwin environment because of DF - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/06/15 17:07:30 UTC, 0 replies.
- Re: [Nutch-cvs] svn commit: r414681 - /lucene/nutch/trunk/src/java/org/apache/nutch/protocol/ProtocolFactory.java - posted by Andrzej Bialecki <ab...@getopt.org> on 2006/06/16 10:59:19 UTC, 2 replies.
- [jira] Updated: (NUTCH-110) OpenSearchServlet outputs illegal xml characters - posted by "John VanDyk (JIRA)" <ji...@apache.org> on 2006/06/16 16:14:31 UTC, 4 replies.
- [jira] Commented: (NUTCH-110) OpenSearchServlet outputs illegal xml characters - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/06/16 16:30:32 UTC, 1 replies.
- does nutch follow HEAD element? - posted by AJ Chen <ca...@gmail.com> on 2006/06/16 22:33:55 UTC, 2 replies.
- which web app? - posted by Bill de hÓra <bi...@dehora.net> on 2006/06/17 00:28:51 UTC, 0 replies.
- [jira] Commented: (NUTCH-306) DistributedSearch.Client liveAddresses concurrency problem - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/06/18 20:22:30 UTC, 0 replies.
- [jira] Created: (NUTCH-307) wrong configured log4j.properties - posted by "Stefan Groschupf (JIRA)" <ji...@apache.org> on 2006/06/19 15:36:29 UTC, 0 replies.
- [jira] Commented: (NUTCH-307) wrong configured log4j.properties - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/06/20 15:31:30 UTC, 0 replies.
- [jira] Assigned: (NUTCH-110) OpenSearchServlet outputs illegal xml characters - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/06/20 16:21:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-266) hadoop bug when doing updatedb - posted by "KuroSaka TeruHiko (JIRA)" <ji...@apache.org> on 2006/06/20 19:49:30 UTC, 7 replies.
- [jira] Resolved: (NUTCH-302) java doc of CrawlDb is wrong - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/06/20 20:56:30 UTC, 0 replies.
- [jira] Resolved: (NUTCH-166) secure jobtracker info pages with a password - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/06/20 20:58:30 UTC, 0 replies.
- [jira] Resolved: (NUTCH-110) OpenSearchServlet outputs illegal xml characters - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/06/20 21:14:31 UTC, 0 replies.
- [jira] Resolved: (NUTCH-292) OpenSearchServlet: OutOfMemoryError: Java heap space - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/06/20 21:55:30 UTC, 0 replies.
- [jira] Resolved: (NUTCH-156) nutch-daemon.sh should not overwrite old logs by default - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/06/20 22:03:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-180) Performance problem with widely used keywords - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/06/20 22:14:30 UTC, 0 replies.
- [jira] Resolved: (NUTCH-307) wrong configured log4j.properties - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/06/21 22:22:35 UTC, 0 replies.
- webdb: old code <-> new code - posted by Francesco Cipriani <f....@mclink.net> on 2006/06/22 00:15:11 UTC, 0 replies.
- [jira] Created: (NUTCH-308) Maximum search time limit - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/06/22 02:59:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-308) Maximum search time limit - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/06/22 03:01:30 UTC, 0 replies.
- following forms using nutch... - posted by bruce <be...@earthlink.net> on 2006/06/22 05:17:17 UTC, 1 replies.
- [jira] Created: (NUTCH-309) Uses commons logging Code Guards - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/06/22 12:48:31 UTC, 0 replies.
- [jira] Created: (NUTCH-310) Review Log Levels - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/06/22 12:54:30 UTC, 0 replies.
- Problem opening checksum file - posted by an...@orbita1.ru on 2006/06/22 13:49:13 UTC, 0 replies.
- [jira] Resolved: (NUTCH-309) Uses commons logging Code Guards - posted by "Jerome Charron (JIRA)" <ji...@apache.org> on 2006/06/22 14:22:31 UTC, 0 replies.
- do not index - posted by Stefan Groschupf <sg...@media-style.com> on 2006/06/22 15:26:21 UTC, 1 replies.
- Re: svn commit: r416346 [1/3] - in /lucene/nutch/trunk/src: java/org/apache/nutch/analysis/ java/org/apache/nutch/clustering/ java/org/apache/nutch/crawl/ java/org/apache/nutch/fetcher/ java/org/apache/nutch/indexer/ java/org/apache/nutch/net/ java/org/apa... - posted by Doug Cutting <cu...@apache.org> on 2006/06/22 20:09:50 UTC, 0 replies.
- [jira] Commented: (NUTCH-303) logging improvements - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/06/22 20:26:30 UTC, 0 replies.
- Re: svn commit: r416346 [1/3] - in /lucene/nutch/trunk/src: java/org/apache/nutch/analysis/ java/org/apache/nutch/clustering/ java/org/apache/nutch/crawl/ java/org/apache/nutch/fetcher/ java/org/apache/nutch/indexer/ java/org/apache/nutch/net/ java/o - posted by Jérôme Charron <je...@gmail.com> on 2006/06/22 22:58:11 UTC, 0 replies.
- [jira] Created: (NUTCH-311) Page with tens of thousands of links OOME'd. - posted by "stack@archive.org (JIRA)" <ji...@apache.org> on 2006/06/23 06:08:31 UTC, 0 replies.
- [jira] Updated: (NUTCH-311) Page with tens of thousands of links OOME'd. - posted by "stack@archive.org (JIRA)" <ji...@apache.org> on 2006/06/23 06:08:31 UTC, 0 replies.
- Plugin Repository caching - posted by Jérôme Charron <je...@gmail.com> on 2006/06/23 11:10:25 UTC, 0 replies.
- nutch - functionality.. - posted by bruce <be...@earthlink.net> on 2006/06/23 21:38:00 UTC, 0 replies.
- [jira] Commented: (NUTCH-129) rtf-parser does not work when opened with wordpad files and saved - posted by "Andy Hedges (JIRA)" <ji...@apache.org> on 2006/06/25 22:22:30 UTC, 0 replies.
- [jira] Closed: (NUTCH-308) Maximum search time limit - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/06/26 21:42:30 UTC, 0 replies.
- [jira] Updated: (NUTCH-61) Adaptive re-fetch interval. Detecting umodified content - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2006/06/26 22:33:30 UTC, 0 replies.
- org.farng and com.etranslate - posted by og...@yahoo.com on 2006/06/27 20:34:19 UTC, 2 replies.
- [jira] Resolved: (NUTCH-306) DistributedSearch.Client liveAddresses concurrency problem - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/06/27 21:34:30 UTC, 0 replies.
- [jira] Created: (NUTCH-312) Fix for upcoming incompatibility with Hadoop-0.4 - posted by "Milind Bhandarkar (JIRA)" <ji...@apache.org> on 2006/06/27 22:43:29 UTC, 0 replies.
- [jira] Updated: (NUTCH-312) Fix for upcoming incompatibility with Hadoop-0.4 - posted by "Milind Bhandarkar (JIRA)" <ji...@apache.org> on 2006/06/27 22:47:30 UTC, 1 replies.
- [jira] Created: (NUTCH-313) moreFrom property in search.properties cannot be translated into Japanese. Compound text issue. - posted by "KuroSaka TeruHiko (JIRA)" <ji...@apache.org> on 2006/06/28 00:31:29 UTC, 0 replies.
- Possible memory leak? - posted by Enrico Triolo <en...@gmail.com> on 2006/06/28 12:44:20 UTC, 3 replies.
- [jira] Created: (NUTCH-314) Multiple language identifier instances - posted by "Enrico Triolo (JIRA)" <ji...@apache.org> on 2006/06/28 14:30:29 UTC, 0 replies.
- Lucene Vs. Nutch features? - posted by Tonal Web Design - Stijn <St...@tonalweb.com> on 2006/06/28 18:01:22 UTC, 0 replies.
- Nutch Vs. other indexers - posted by Tonal Web Design - Stijn <St...@tonalweb.com> on 2006/06/28 18:02:01 UTC, 0 replies.
- [jira] Created: (NUTCH-315) CrawlDbReader usage text - implementation mismatch - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2006/06/28 19:44:31 UTC, 0 replies.
- [jira] Resolved: (NUTCH-312) Fix for upcoming incompatibility with Hadoop-0.4 - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2006/06/28 23:55:30 UTC, 0 replies.
- [jira] Commented: (NUTCH-309) Uses commons logging Code Guards - posted by "Dawid Weiss (JIRA)" <ji...@apache.org> on 2006/06/29 08:56:30 UTC, 1 replies.
- RE: compile search.jsp - posted by aoki <ao...@team-lab.com> on 2006/06/30 06:47:58 UTC, 0 replies.