You are viewing a plain text version of this content. The canonical link for it is here.
- Build failed in Jenkins: Nutch-nutchgora #420 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/01 05:16:41 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1499) Usage of multiple ipv4 addresses and network cards on fetcher machines - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/12/01 13:21:58 UTC, 1 replies.
- [jira] [Commented] (NUTCH-842) AutoGenerate WebPage code - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/12/01 17:15:58 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #421 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/02 05:17:30 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #423 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/04 05:13:05 UTC, 0 replies.
- Re: [VOTE] Apache Nutch 1.6 Release Candidate - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/12/05 15:34:16 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1038) Port IndexingFiltersChecker to 2.0 - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/12/05 22:53:59 UTC, 2 replies.
- [jira] [Created] (NUTCH-1501) Harmonize behavior of parsechecker and indexchecker - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/12/05 23:09:58 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #424 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/06 05:17:49 UTC, 0 replies.
- [RESULT] WAS: [VOTE] Apache Nutch 1.6 Release Candidate - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/12/06 15:27:40 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1245) URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/12/06 15:49:09 UTC, 6 replies.
- [jira] [Created] (NUTCH-1502) Test for CrawlDatum state transitions - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/12/06 22:41:09 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1477) NPE when injecting with DataFileAvroStore - posted by "Alfonso Nishikawa (JIRA)" <ji...@apache.org> on 2012/12/07 03:01:21 UTC, 3 replies.
- Build failed in Jenkins: Nutch-nutchgora #425 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/07 05:12:55 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1477) NPE when injecting with DataFileAvroStore - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/12/07 11:41:21 UTC, 3 replies.
- [jira] [Commented] (NUTCH-1038) Port IndexingFiltersChecker to 2.0 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/12/07 12:25:22 UTC, 2 replies.
- [jira] [Comment Edited] (NUTCH-1477) NPE when injecting with DataFileAvroStore - posted by "Alfonso Nishikawa (JIRA)" <ji...@apache.org> on 2012/12/07 17:47:22 UTC, 3 replies.
- Jenkins build is back to normal : Nutch-nutchgora #426 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/08 05:10:25 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1394) backport NUTCH-1232 Remove site field from index-basic - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/12/08 21:29:21 UTC, 0 replies.
- [ANNOUNCE] Apache Nutch 1.6 Released - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/12/08 22:50:12 UTC, 2 replies.
- [jira] [Resolved] (NUTCH-1183) Summary task for adding command line usage instructions to webgraph classes - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/12/08 22:59:21 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1140) index-more plugin, resetTitle method creates multiple values in the Title field - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/12/08 23:03:23 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1409) Remove deprecated properties in nutch-default.xml - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/12/08 23:11:21 UTC, 0 replies.
- [jira] [Updated] (NUTCH-840) Port tests from parse-html to parse-tika - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/12/09 00:49:21 UTC, 2 replies.
- [jira] [Commented] (NUTCH-840) Port tests from parse-html to parse-tika - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/09 08:27:21 UTC, 0 replies.
- [jira] [Updated] (NUTCH-891) Nutch build should not depend on unversioned local deps - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/09 08:39:21 UTC, 0 replies.
- [jira] [Closed] (NUTCH-807) JSParseFilter produces malformed URL - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/09 08:41:21 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-62) Add html META tag information into metaData in index-more plugin - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/09 08:53:22 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1267) urlmeta to delegate indexing to index-metadata - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/09 08:55:20 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1232) Remove host field from index-basic - posted by "Hudson (JIRA)" <ji...@apache.org> on 2012/12/10 05:35:21 UTC, 0 replies.
- [jira] [Closed] (NUTCH-412) plugin to parse the feed-url (rss/atom) of a blog - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/10 17:51:21 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-648) debian style autocomplete - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/10 17:59:21 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1314) Impose a limit on the length of outlink target urls - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/10 18:03:22 UTC, 0 replies.
- [jira] [Commented] (NUTCH-710) Support for rel="canonical" attribute - posted by "zm (JIRA)" <ji...@apache.org> on 2012/12/11 08:17:23 UTC, 1 replies.
- [jira] [Created] (NUTCH-1503) Configuration properties not in sync between FetcherReducer and nutch-default.xml - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/12/11 21:13:21 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1503) Configuration properties not in sync between FetcherReducer and nutch-default.xml - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/12/11 21:15:24 UTC, 2 replies.
- [Nutch Wiki] Update of "NutchPropertiesCompleteList" by SebastianNagel - posted by Apache Wiki <wi...@apache.org> on 2012/12/12 00:15:05 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1503) Configuration properties not in sync between FetcherReducer and nutch-default.xml - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/12/12 00:57:21 UTC, 2 replies.
- Jenkins build is back to normal : nutch-trunk-maven #523 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/12 06:03:44 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2012/12/12 12:13:45 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "NutchPropertiesCompleteList" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2012/12/12 12:15:08 UTC, 1 replies.
- More than one way to skin a cat - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/12/12 13:06:55 UTC, 0 replies.
- Message in MoreIndexingFilter - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/12/12 20:18:35 UTC, 0 replies.
- [jira] [Updated] (NUTCH-956) solrindex issues - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/12/12 20:36:20 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1503) Configuration properties not in sync between FetcherReducer and nutch-default.xml - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/12/12 20:54:20 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Articles" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2012/12/13 16:12:51 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1087) Deprecate crawl command and replace with example script - posted by "Tristan Buckner (JIRA)" <ji...@apache.org> on 2012/12/14 03:00:12 UTC, 0 replies.
- Additional patch for NUTCH-1087 - posted by Tristan Buckner <bu...@adobe.com> on 2012/12/14 19:46:29 UTC, 5 replies.
- [jira] [Created] (NUTCH-1504) Pluggable url partitioner - posted by "Sourajit Basak (JIRA)" <ji...@apache.org> on 2012/12/17 18:40:12 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1504) Pluggable url partitioner - posted by "Sourajit Basak (JIRA)" <ji...@apache.org> on 2012/12/17 18:40:14 UTC, 2 replies.
- Comparing Nutch and Common Crawl - posted by Julien Nioche <li...@gmail.com> on 2012/12/17 21:53:42 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1347) fetcher politeness related to map-reduce - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/19 14:57:12 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1331) limit crawler to defined depth - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/19 15:01:13 UTC, 3 replies.
- [jira] [Updated] (NUTCH-710) Support for rel="canonical" attribute - posted by "zm (JIRA)" <ji...@apache.org> on 2012/12/20 08:25:13 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-710) Support for rel="canonical" attribute - posted by "zm (JIRA)" <ji...@apache.org> on 2012/12/20 08:27:13 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #439 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/21 05:20:20 UTC, 0 replies.
- [jira] [Created] (NUTCH-1505) java.lang.IllegalArgumentException during updatedb - posted by "Stanley Orlenko (JIRA)" <ji...@apache.org> on 2012/12/21 10:31:15 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1506) Add UPDATE action to NutchIndexAction - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/12/21 11:47:12 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1506) Add UPDATE action to NutchIndexAction - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/12/21 11:47:12 UTC, 2 replies.
- [jira] [Created] (NUTCH-1506) Add UPDATE action to NutchIndexAction - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/12/21 11:47:12 UTC, 0 replies.
- [jira] [Created] (NUTCH-1507) Remove FetcherOutput - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/12/21 11:53:12 UTC, 0 replies.
- 1.8 in Jira - posted by Markus Jelsma <ma...@openindex.io> on 2012/12/21 11:54:04 UTC, 2 replies.
- [jira] [Updated] (NUTCH-1508) Port limit crawler to defined depth to 2.x - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/21 12:29:12 UTC, 1 replies.
- [jira] [Created] (NUTCH-1508) Port limit crawler to defined depth to 23 - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/21 12:29:12 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1508) Port limit crawler to defined depth to 2.x - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/21 12:29:13 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1331) limit crawler to defined depth - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/21 12:37:13 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-1331) limit crawler to defined depth - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/21 12:39:12 UTC, 0 replies.
- [jira] [Created] (NUTCH-1509) Implement read/write in NutchField - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/12/21 13:07:12 UTC, 0 replies.
- [jira] [Created] (NUTCH-1510) Upgrade to Hadoop 1.1.1 - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/12/21 14:31:12 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1510) Upgrade to Hadoop 1.1.1 - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/12/21 14:33:13 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1510) Upgrade to Hadoop 1.1.1 - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/12/21 14:41:14 UTC, 8 replies.
- [jira] [Updated] (NUTCH-1509) Implement read/write in NutchField - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/12/21 15:19:13 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1507) Remove FetcherOutput - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/12/21 15:25:14 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1507) Remove FetcherOutput - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/12/21 16:35:12 UTC, 5 replies.
- [jira] [Commented] (NUTCH-1509) Implement read/write in NutchField - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/12/21 17:05:12 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #440 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/22 05:17:38 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1284) Add site fetcher.max.crawl.delay as log output by default. - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2012/12/22 11:53:12 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1284) Add site fetcher.max.crawl.delay as log output by default. - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2012/12/22 11:53:13 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-1284) Add site fetcher.max.crawl.delay as log output by default. - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2012/12/22 11:55:12 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1118) JUnit test for index-basic - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2012/12/23 01:51:12 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1118) JUnit test for index-basic - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/12/23 18:46:12 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1118) JUnit test for index-basic - posted by "Hudson (JIRA)" <ji...@apache.org> on 2012/12/23 19:10:12 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1119) JUnit test for index-static - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2012/12/23 21:34:12 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #442 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/24 05:11:21 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #444 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/26 05:18:38 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2057 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/26 05:22:13 UTC, 0 replies.
- [jira] [Commented] (NUTCH-978) A Plugin for extracting certain element of a web page on html page parsing. - posted by "Emmanuel Colin (JIRA)" <ji...@apache.org> on 2012/12/26 14:32:13 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2058 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/27 05:27:38 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #445 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/27 05:27:46 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1510) Upgrade to Hadoop 1.1.1 - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/12/27 13:38:12 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-1245) URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/12/27 13:54:13 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2059 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/28 05:10:53 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #447 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/29 05:09:54 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2060 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/29 05:13:11 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1224) Migrate FreeGenerator to MapReduce API - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2012/12/29 11:08:12 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1127) JUnit test for urlfilter-validator - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2012/12/29 13:10:12 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #448 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/30 05:11:53 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2061 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/30 05:11:53 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series - posted by "J. Gobel (JIRA)" <ji...@apache.org> on 2012/12/30 23:54:12 UTC, 5 replies.
- [jira] [Comment Edited] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series - posted by "J. Gobel (JIRA)" <ji...@apache.org> on 2012/12/30 23:54:12 UTC, 4 replies.
- [jira] [Updated] (NUTCH-1478) Parse-metatags and index-metadata plugin for Nutch 2.x series - posted by "kiran (JIRA)" <ji...@apache.org> on 2012/12/31 04:26:13 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #2062 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/31 05:15:11 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #449 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/12/31 05:16:41 UTC, 0 replies.
- Problems with activation.jar - posted by Jorge Moreira <j....@gmail.com> on 2012/12/31 20:00:44 UTC, 0 replies.