You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Created] (NUTCH-1347) fetcher politeness related to map-reduce - posted by "behnam nikbakht (JIRA)" <ji...@apache.org> on 2012/05/01 09:29:45 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1347) fetcher politeness related to map-reduce - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/05/01 11:36:50 UTC, 2 replies.
- [jira] [Closed] (NUTCH-1343) Crawl sites with hashtags in url - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/01 13:39:51 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1343) Crawl sites with hashtags in url - posted by "Roberto Gardenier (JIRA)" <ji...@apache.org> on 2012/05/01 13:57:52 UTC, 2 replies.
- [jira] [Closed] (NUTCH-1332) db.max.outlinks.per.page not honored - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/01 14:43:51 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1346) Follow outlinks to ignore external - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/01 15:45:50 UTC, 1 replies.
- [jira] [Created] (NUTCH-1348) Solrindexer fails with a java.io.IOException error. - posted by "Christian Johnsson (JIRA)" <ji...@apache.org> on 2012/05/01 21:40:50 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1348) Solrindexer fails with a java.io.IOException error. - posted by "Christian Johnsson (JIRA)" <ji...@apache.org> on 2012/05/01 23:12:51 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1348) Solrindexer fails with a java.io.IOException error. - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/01 23:30:52 UTC, 4 replies.
- [jira] [Updated] (NUTCH-1339) Default URL normalization rules to remove page anchors completely - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/01 23:34:50 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #242 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/02 06:07:01 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1830 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/02 06:08:22 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/02 11:26:52 UTC, 5 replies.
- [jira] [Commented] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/02 13:24:50 UTC, 2 replies.
- [jira] [Updated] (NUTCH-1323) AjaxNormalizer - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/02 16:40:57 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1300) Indexer to normalize URL's - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/02 17:16:50 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1294) IndexClean job with solr implementation. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/02 22:42:49 UTC, 1 replies.
- Jenkins build is back to normal : Nutch-nutchgora #243 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/03 06:52:05 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #1831 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/03 06:59:32 UTC, 0 replies.
- [jira] [Reopened] (NUTCH-1293) IndexingFiltersChecker to store detected content type in crawldatum metadata - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/03 11:16:57 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1293) IndexingFiltersChecker to store detected content type in crawldatum metadata - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/03 11:18:58 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1293) IndexingFiltersChecker to store detected content type in crawldatum metadata - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/03 11:18:58 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1348) Solrindexer fails with a java.io.IOException error. - posted by "Christian Johnsson (JIRA)" <ji...@apache.org> on 2012/05/03 13:46:50 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/03 14:58:50 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1321) IDNNormalizer - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/03 16:18:49 UTC, 0 replies.
- [jira] [Closed] (NUTCH-896) Gora-based tests need to have their own config files - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/03 17:30:50 UTC, 0 replies.
- [jira] [Updated] (NUTCH-896) Gora-based tests need to have their own config files - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/03 17:30:50 UTC, 0 replies.
- [jira] [Commented] (NUTCH-902) Add all necessary files and configuration so that nutch can be used with different backends out-of-the-box - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/03 20:16:50 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1205) Upgrade gora modules to 0.2 in ivy/ivy.xml - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/03 20:18:50 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2012/05/03 20:43:32 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "CommandLineOptions" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2012/05/03 20:44:43 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1294) IndexClean job with solr implementation. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/03 21:28:48 UTC, 1 replies.
- [jira] [Created] (NUTCH-1349) Make batchId explcit within debug logging. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/03 21:48:50 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1349) Make batchId explcit within debug logging. - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/03 22:06:48 UTC, 0 replies.
- [jira] [Created] (NUTCH-1350) remove unused dependancy because of access restriction - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/04 10:04:53 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1350) remove unused dependancy because of access restriction - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/04 10:04:55 UTC, 0 replies.
- [jira] [Created] (NUTCH-1351) DomainStatistics to aggregate by TLD - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/04 13:56:48 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1351) DomainStatistics to aggregate by TLD - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/04 14:02:48 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1349) Make batchId explcit within debug logging and improve CLI - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/04 14:28:48 UTC, 3 replies.
- [jira] [Commented] (NUTCH-1349) Make batchId explcit within debug logging and improve CLI - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/04 16:18:51 UTC, 2 replies.
- Mapping file specifics - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/05/04 17:01:47 UTC, 2 replies.
- [jira] [Commented] (NUTCH-809) Parse-metatags plugin - posted by "Kristof (JIRA)" <ji...@apache.org> on 2012/05/04 22:04:49 UTC, 4 replies.
- [jira] [Closed] (NUTCH-902) Add all necessary files and configuration so that nutch can be used with different backends out-of-the-box - posted by "Enis Soztutar (JIRA)" <ji...@apache.org> on 2012/05/05 02:59:50 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1350) remove unused dependancy because of access restriction - posted by "Hudson (JIRA)" <ji...@apache.org> on 2012/05/05 06:21:37 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #246 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/06 06:07:11 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1834 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/06 06:09:05 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #247 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/07 06:18:40 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #1835 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/07 06:30:16 UTC, 0 replies.
- [jira] [Created] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/07 10:56:10 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/07 10:58:19 UTC, 3 replies.
- [jira] [Commented] (NUTCH-1352) Improve regex urlfilters/normalizers synchronization - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/07 11:02:05 UTC, 5 replies.
- [jira] [Updated] (NUTCH-1353) nutchgora DomainStatistics support crawlId, counter bug and reformatting - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/07 11:19:59 UTC, 0 replies.
- [jira] [Created] (NUTCH-1353) nutchgora DomainStatistics support crawlId, counter bug and reformatting - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/07 11:19:59 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1353) nutchgora DomainStatistics support crawlId, counter bug and reformatting - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/07 11:22:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-1354) nutchgora support fetcher.queue.depth.multiplier property - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/07 11:48:03 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1354) nutchgora support fetcher.queue.depth.multiplier property - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/07 11:49:54 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1354) nutchgora support fetcher.queue.depth.multiplier property - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/07 11:49:55 UTC, 0 replies.
- How to crawl https sites with certificate - posted by Siddharth Jain <si...@gmail.com> on 2012/05/07 12:02:41 UTC, 0 replies.
- [jira] [Created] (NUTCH-1355) nutchgora Configure minimum throughput for fetcher - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/07 13:45:49 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1355) nutchgora Configure minimum throughput for fetcher - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/07 13:47:49 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1356) ParseUtil use ExecutorService instead of manually thread handling. - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/07 14:11:48 UTC, 3 replies.
- [jira] [Created] (NUTCH-1356) ParseUtil use ExecutorService instead of manually thread handling. - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/07 14:11:48 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1356) ParseUtil use ExecutorService instead of manually thread handling. - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/07 14:32:50 UTC, 7 replies.
- [jira] [Closed] (NUTCH-1355) nutchgora Configure minimum throughput for fetcher - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/07 17:30:51 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1342) Read time out protocol-http - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/08 13:37:47 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1349) Make batchId explcit within debug logging and improve CLI - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/08 13:49:50 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1349) Make batchId explcit within debug logging and improve CLI - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/08 13:49:51 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1301) Index job resume switch to resume a failed job - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/08 13:59:50 UTC, 1 replies.
- Jason Trost Nutchgora Fork - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/05/08 19:05:13 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1353) nutchgora DomainStatistics support crawlId, counter bug and reformatting - posted by "Hudson (JIRA)" <ji...@apache.org> on 2012/05/09 06:17:07 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1354) nutchgora support fetcher.queue.depth.multiplier property - posted by "Hudson (JIRA)" <ji...@apache.org> on 2012/05/09 06:17:08 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1355) nutchgora Configure minimum throughput for fetcher - posted by "Hudson (JIRA)" <ji...@apache.org> on 2012/05/09 06:17:11 UTC, 0 replies.
- store additional information from page at outlinks - topic specific crawl - posted by Armin Nagel <ar...@neofonie.de> on 2012/05/09 09:56:37 UTC, 3 replies.
- Re: [VOTE] Apache Nutch 1.5 release rc #1 - posted by Julien Nioche <li...@gmail.com> on 2012/05/09 12:11:37 UTC, 3 replies.
- [jira] [Commented] (NUTCH-1016) Strip UTF-8 non-character codepoints - posted by "Christian Johnsson (JIRA)" <ji...@apache.org> on 2012/05/09 13:53:50 UTC, 3 replies.
- [jira] [Created] (NUTCH-1357) All gora mapreduce functionality should go through StorageUtils - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/09 15:09:48 UTC, 0 replies.
- [jira] [Created] (NUTCH-1358) Do not accept bogus arguments - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/09 15:39:48 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1358) Do not accept bogus arguments - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/09 15:41:48 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1358) Do not accept bogus arguments - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/09 15:43:49 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1358) Do not accept bogus arguments - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/09 15:47:55 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1357) All gora mapreduce functionality should go through StorageUtils - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/09 15:59:54 UTC, 0 replies.
- [jira] [Created] (NUTCH-1359) Add raw_headers support - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/09 16:01:48 UTC, 0 replies.
- [jira] [Created] (NUTCH-1360) Suport the storing of IP address connected to when web crawling - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/09 16:03:49 UTC, 0 replies.
- [jira] [Created] (NUTCH-1361) Fix mishandling of malformed urls in generator job - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/09 16:05:49 UTC, 0 replies.
- [jira] [Created] (NUTCH-1362) Fix error handling of urls with empty fields - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/09 16:09:49 UTC, 0 replies.
- [jira] [Created] (NUTCH-1363) Make parsing in FetcherJob actually work. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/09 16:11:49 UTC, 0 replies.
- [jira] [Created] (NUTCH-1364) Add a counter for malformed urls - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/09 16:15:50 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1363) Make parsing in FetcherJob actually work. - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/09 16:27:48 UTC, 5 replies.
- [jira] [Issue Comment Edited] (NUTCH-1363) Make parsing in FetcherJob actually work. - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/09 16:27:50 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1357) All gora mapreduce functionality should go through StorageUtils - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/09 16:31:49 UTC, 1 replies.
- Date format issue Nutch-Solr with NUTCH-809 Parse-metatags plugin - posted by Kristof Kessler <kr...@googlemail.com> on 2012/05/10 07:54:30 UTC, 0 replies.
- [jira] [Created] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/10 09:54:54 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/10 10:12:57 UTC, 0 replies.
- (Unknown) - posted by 柳胜兵 <co...@gmail.com> on 2012/05/10 10:37:00 UTC, 0 replies.
- Re: - posted by Markus Jelsma <ma...@openindex.io> on 2012/05/10 10:43:12 UTC, 3 replies.
- [jira] [Commented] (NUTCH-1306) Commit after finished writing to solr index - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/10 12:09:46 UTC, 2 replies.
- [jira] [Updated] (NUTCH-1360) Suport the storing of IP address connected to when web crawling - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/10 12:11:45 UTC, 3 replies.
- [jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/10 12:13:47 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1325) HostDB for Nutch - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/10 13:39:56 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1026) Strip UTF-8 non-character codepoints - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/10 14:47:48 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1026) Strip UTF-8 non-character codepoints - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/10 15:41:51 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1306) Commit after finished writing to solr index - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/10 18:46:52 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1077) Nutch 2 DbUpdateMapper throws ArrayOutOfBoundsException when running update - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/10 20:26:51 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1363) Make parsing in FetcherJob actually work. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/10 22:06:50 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1363) Make parsing in FetcherJob actually work. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/10 22:06:50 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1362) Fix error handling of urls with empty fields - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/11 11:09:50 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1362) Fix error handling of urls with empty fields - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/11 11:11:54 UTC, 2 replies.
- [jira] [Closed] (NUTCH-1077) Nutch 2 DbUpdateMapper throws ArrayOutOfBoundsException when running update - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/11 11:13:50 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1086) Rewrite protocol-httpclient - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/11 11:31:55 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1362) Fix error handling of urls with empty fields - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/11 11:51:52 UTC, 0 replies.
- [jira] [Created] (NUTCH-1366) speed up indexing by eliminating the indexreducer - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/11 14:39:50 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1366) speed up indexing by eliminating the indexreducer - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/11 14:41:50 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1366) speed up indexing by eliminating the indexreducer - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/11 16:58:52 UTC, 2 replies.
- [jira] [Created] (NUTCH-1367) Port ParserChecker to Nutchgora - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/12 02:05:53 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven #264 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/12 07:01:55 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1323) AjaxNormalizer - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/05/12 14:01:48 UTC, 3 replies.
- Jenkins build is back to normal : nutch-trunk-maven #265 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/13 07:02:26 UTC, 0 replies.
- Unsubscribe me please - posted by arul velusamy <ar...@gmail.com> on 2012/05/14 09:51:47 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1367) Port ParserChecker to Nutchgora - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/14 15:31:51 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1366) speed up indexing by eliminating the indexreducer - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/14 16:23:51 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1367) Port ParserChecker to Nutchgora - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/16 12:33:02 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1367) Port ParserChecker to Nutchgora - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/16 12:33:03 UTC, 0 replies.
- [jira] [Commented] (NUTCH-923) Multilingual support for Solr-index-mapping - posted by "Markus Agethle (JIRA)" <ji...@apache.org> on 2012/05/16 17:25:02 UTC, 2 replies.
- [jira] [Created] (NUTCH-1369) Improve ParserChecker in Nutchgora - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/17 00:49:06 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1369) Improve ParserChecker in Nutchgora - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/17 00:57:06 UTC, 2 replies.
- [jira] [Comment Edited] (NUTCH-1360) Suport the storing of IP address connected to when web crawling - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/17 12:51:07 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1360) Suport the storing of IP address connected to when web crawling - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/17 13:15:06 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven #273 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/19 23:49:11 UTC, 0 replies.
- Jenkins build is back to normal : nutch-trunk-maven #274 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/20 07:01:44 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #259 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/21 06:07:44 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1847 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/21 06:08:58 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1361) Fix mishandling of malformed urls in generator job - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/21 18:56:41 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1361) Fix mishandling of malformed urls in generator job - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/21 18:58:41 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1364) Add a counter for malformed urls - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/21 20:27:40 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1364) Add a counter for malformed urls - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/21 20:27:41 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1359) Add raw_headers support - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/21 20:29:40 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1100) SolrDedup broken - posted by "Ashish Shrowty (JIRA)" <ji...@apache.org> on 2012/05/21 20:55:41 UTC, 0 replies.
- Re: Bug in Trunk Generator mapper? - posted by Julien Nioche <li...@gmail.com> on 2012/05/21 21:32:26 UTC, 1 replies.
- Jenkins build is back to normal : Nutch-nutchgora #260 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/22 06:20:02 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #1848 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/22 06:32:42 UTC, 0 replies.
- 1.5 RC2 - posted by Julien Nioche <li...@gmail.com> on 2012/05/22 11:15:49 UTC, 11 replies.
- [jira] [Created] (NUTCH-1370) Expose exact number of urls injected @runtime - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/22 11:24:41 UTC, 0 replies.
- Re: svn commit: r1341365 - /nutch/trunk/ivy/mvn.template - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/05/22 11:33:05 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1370) Expose exact number of urls injected @runtime - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/05/22 11:38:40 UTC, 0 replies.
- [jira] [Created] (NUTCH-1371) Replace Ivy with Maven Ant tasks - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/05/22 12:12:40 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1371) Replace Ivy with Maven Ant tasks - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/05/22 12:14:41 UTC, 0 replies.
- [jira] [Created] (NUTCH-1372) Improve execution of normalisers - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/22 12:36:40 UTC, 0 replies.
- [jira] [Created] (NUTCH-1373) Implement consistent execution of normalising and filtering in Generator - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/22 12:40:40 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1372) Improve execution of normalisers - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/22 12:42:40 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1372) Improve execution of normalisers - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/22 12:42:41 UTC, 0 replies.
- [jira] [Created] (NUTCH-1374) Workaround for license headers - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/22 13:58:40 UTC, 0 replies.
- [jira] [Created] (NUTCH-1375) extract main content of a html file - posted by "behnam nikbakht (JIRA)" <ji...@apache.org> on 2012/05/22 14:20:41 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1375) extract main content of a html file - posted by "behnam nikbakht (JIRA)" <ji...@apache.org> on 2012/05/22 14:20:41 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1375) extract main content of a html file - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/05/22 17:09:41 UTC, 0 replies.
- [jira] [Updated] (NUTCH-879) URL-s getting lost - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/22 20:59:42 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1301) Index job resume switch to resume a failed job - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/22 21:21:40 UTC, 1 replies.
- Apache Nutch release 1.5 RC2 - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/05/22 21:59:24 UTC, 8 replies.
- [jira] [Created] (NUTCH-1376) Add description parameter to every ant task - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/05/22 23:34:40 UTC, 0 replies.
- [jira] [Commented] (NUTCH-879) URL-s getting lost - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/23 10:45:41 UTC, 0 replies.
- [jira] [Created] (NUTCH-1377) Add option to index via CloudSolrServer instead - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/23 15:50:40 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1377) Add option to index via CloudSolrServer instead - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/05/23 15:50:41 UTC, 0 replies.
- [jira] [Created] (NUTCH-1378) HostDb NullPointerException - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/23 16:46:41 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1378) HostDb NullPointerException - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/23 16:48:41 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1378) HostDb NullPointerException - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/23 16:48:41 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1378) HostDb NullPointerException - posted by "Hudson (JIRA)" <ji...@apache.org> on 2012/05/24 07:26:41 UTC, 0 replies.
- How to run Nutch after build - posted by Jabol Stéphane <st...@gmail.com> on 2012/05/24 16:44:19 UTC, 1 replies.
- New Nutch Committer and PMC member : Sebastian Nagel - posted by Julien Nioche <li...@gmail.com> on 2012/05/25 17:56:35 UTC, 1 replies.
- Build failed in Jenkins: Nutch-nutchgora #267 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/29 06:04:41 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1855 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/29 06:06:08 UTC, 0 replies.
- stackoverflow / stackexchange for user problems - posted by Ferdy Galema <fe...@kalooga.com> on 2012/05/29 11:45:21 UTC, 3 replies.
- Build failed in Jenkins: Nutch-nutchgora #268 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/30 06:05:35 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1856 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/30 06:07:14 UTC, 0 replies.
- [jira] [Created] (NUTCH-1379) NPE when reprUrl is null in ParseUtil - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/30 11:26:23 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1379) NPE when reprUrl is null in ParseUtil - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/30 11:28:23 UTC, 1 replies.
- [jira] [Reopened] (NUTCH-1379) NPE when reprUrl is null in ParseUtil - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/30 11:28:23 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1379) NPE when reprUrl is null in ParseUtil - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/30 11:28:23 UTC, 1 replies.
- Using Nutch for Web Site Mirroring - posted by Vlad Paunescu <vl...@gmail.com> on 2012/05/30 14:39:08 UTC, 0 replies.
- [VOTE] Apache Nutch release 1.5 RC3 - posted by lewis john mcgibbney <le...@apache.org> on 2012/05/30 22:59:59 UTC, 11 replies.
- Build failed in Jenkins: Nutch-nutchgora #269 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/31 06:07:57 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1857 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/05/31 06:10:00 UTC, 0 replies.
- [VOTE] Apache Nutch 1.5 release-1.5RC4 - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/05/31 22:37:52 UTC, 1 replies.