You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Commented] (NUTCH-1525) Generator to record external links even when db.ignore.external.links set to true - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/01 04:39:13 UTC, 2 replies.
- Build failed in Jenkins: Nutch-trunk #2107 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/01 05:10:55 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #482 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/01 05:20:08 UTC, 0 replies.
- Intermittent Errors with Nutch-trunk build - posted by Lewis John Mcgibbney <le...@gmail.com> on 2013/02/01 05:20:55 UTC, 0 replies.
- RE: Outlinks in parse filter - posted by Markus Jelsma <ma...@openindex.io> on 2013/02/01 15:37:42 UTC, 2 replies.
- Jenkins build is back to normal : Nutch-trunk #2108 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/02 05:20:36 UTC, 0 replies.
- Re: Addition to Pluggable Backends - posted by Lewis John Mcgibbney <le...@gmail.com> on 2013/02/03 00:41:05 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1521) CrawlDbFilter pass null url to urlNormailzers - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/02/04 07:36:13 UTC, 2 replies.
- [Nutch Wiki] Trivial Update of "ErrorMessagesInNutch2" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2013/02/05 00:57:28 UTC, 3 replies.
- Build failed in Jenkins: Nutch-trunk #2112 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/06 05:19:47 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1253) Incompatible neko and xerces versions - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/06 22:17:13 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1253) Incompatible neko and xerces versions - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/06 22:17:13 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "FAQ" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2013/02/07 03:49:59 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #2113 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/07 05:18:33 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1253) Incompatible neko and xerces versions - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/07 05:21:39 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "NutchGotchas" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2013/02/07 22:30:48 UTC, 1 replies.
- Jenkins build is back to normal : Nutch-trunk #2114 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/08 05:20:24 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1511) Metadata in MYSQL updated with 'garbage' - posted by "Roland (JIRA)" <ji...@apache.org> on 2013/02/08 22:25:14 UTC, 4 replies.
- [jira] [Comment Edited] (NUTCH-1511) Metadata in MYSQL updated with 'garbage' - posted by "Roland (JIRA)" <ji...@apache.org> on 2013/02/08 22:27:12 UTC, 2 replies.
- [jira] [Created] (NUTCH-1527) Port nutch-elasticsearch-indexer to Nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/09 19:45:12 UTC, 0 replies.
- [jira] [Created] (NUTCH-1528) Port nutch-mongodb-indexer to Nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/09 19:47:12 UTC, 0 replies.
- [jira] [Created] (NUTCH-1529) Port nutch-mongdb-parser to trunk - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/09 19:53:12 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2013/02/10 03:09:49 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1390) readdb -url $url throws NPE with gora-cassandra - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/11 02:02:19 UTC, 0 replies.
- FW: [GSoC Mentors] Google Summer of Code 2013 - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2013/02/11 21:24:32 UTC, 0 replies.
- [jira] [Created] (NUTCH-1530) Umlauts (üäö) garbled when fetch and parse in separate calls (OK when fetcher.parse is true) - posted by "Edward Ackroyd (JIRA)" <ji...@apache.org> on 2013/02/12 19:39:13 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1530) Umlauts (üäö) garbled when fetch and parse in separate calls (OK when fetcher.parse is true) - posted by "Roland (JIRA)" <ji...@apache.org> on 2013/02/12 20:07:13 UTC, 6 replies.
- Nutch JAVA Application - posted by Shann <st...@mailoo.org> on 2013/02/12 21:25:55 UTC, 6 replies.
- [jira] [Comment Edited] (NUTCH-1530) Umlauts (üäö) garbled when fetch and parse in separate calls (OK when fetcher.parse is true) - posted by "Roland (JIRA)" <ji...@apache.org> on 2013/02/12 23:13:14 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1530) Umlauts (üäö) garbled when fetch and parse in separate calls (OK when fetcher.parse is true) - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/13 03:47:14 UTC, 0 replies.
- [jira] [Created] (NUTCH-1531) URL filtering takes long time for very long URLs - posted by "Fırat KÜÇÜK (JIRA)" <ji...@apache.org> on 2013/02/13 09:20:12 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1531) URL filtering takes long time for very long URLs - posted by "Fırat KÜÇÜK (JIRA)" <ji...@apache.org> on 2013/02/13 09:22:16 UTC, 3 replies.
- [jira] [Comment Edited] (NUTCH-1531) URL filtering takes long time for very long URLs - posted by "Fırat KÜÇÜK (JIRA)" <ji...@apache.org> on 2013/02/13 09:28:12 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1531) URL filtering takes long time for very long URLs - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2013/02/13 11:46:21 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #496 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/15 05:12:16 UTC, 0 replies.
- slf4j issue with nutch 2.x over hadoop 1.1.1 - posted by kaveh minooie <ka...@plutoz.com> on 2013/02/16 01:53:56 UTC, 1 replies.
- [jira] [Created] (NUTCH-1532) Replace 'segment' mapping field with batchId - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/16 02:59:12 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1528) Port nutch-mongodb-indexer to Nutch - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/16 03:39:12 UTC, 1 replies.
- Jenkins build is back to normal : Nutch-nutchgora #497 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/16 05:18:06 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "IndexStructure" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2013/02/16 05:45:51 UTC, 0 replies.
- [jira] [Created] (NUTCH-1533) Implement getPrevModifiedTime(), setPrevModifiedTime(), getBatchId() and setBatchId() accessors in o.a.n.storage.WebPage - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/16 06:07:12 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1532) Replace 'segment' mapping field with batchId - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/16 06:07:12 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1528) Port nutch-mongodb-indexer to Nutch - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2013/02/16 11:03:13 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1047) Pluggable indexing backends - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2013/02/16 18:37:13 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1486) schema-solr4.xml does not work with Solr 4.1.0 - posted by "Bharat Shrinevas (JIRA)" <ji...@apache.org> on 2013/02/17 04:13:13 UTC, 1 replies.
- Build failed in Jenkins: Nutch-trunk #2124 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/18 05:22:05 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1420) Get rid of the dreaded � - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/18 23:51:13 UTC, 5 replies.
- [jira] [Resolved] (NUTCH-1420) Get rid of the dreaded � - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/19 01:47:12 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1529) Port nutch-mongdb-parser to trunk - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/02/19 04:27:14 UTC, 8 replies.
- Build failed in Jenkins: Nutch-nutchgora #500 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/19 07:58:51 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #2125 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/19 08:05:20 UTC, 0 replies.
- [jira] [Created] (NUTCH-1534) cassandra/hector exception: InvalidRequestException(why:column name must not be empty) - posted by "Roland (JIRA)" <ji...@apache.org> on 2013/02/19 11:15:13 UTC, 0 replies.
- [jira] [Commented] (NUTCH-961) Expose Tika's boilerpipe support - posted by "kiran (JIRA)" <ji...@apache.org> on 2013/02/19 18:23:12 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1534) cassandra/hector exception: InvalidRequestException(why:column name must not be empty) - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/19 20:13:12 UTC, 2 replies.
- [jira] [Commented] (NUTCH-1534) cassandra/hector exception: InvalidRequestException(why:column name must not be empty) - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/19 20:15:13 UTC, 6 replies.
- NUTCH-1047: Pluggable indexing backends - posted by Julien Nioche <li...@gmail.com> on 2013/02/19 22:01:23 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1047) Pluggable indexing backends - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/02/20 04:07:12 UTC, 7 replies.
- Build failed in Jenkins: Nutch-nutchgora #501 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/20 10:44:35 UTC, 0 replies.
- Configuration improvements to GeneratorJob - posted by Lewis John Mcgibbney <le...@gmail.com> on 2013/02/20 22:05:01 UTC, 8 replies.
- Build failed in Jenkins: Nutch-nutchgora #502 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/21 06:52:30 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1031) Delegate parsing of robots.txt to crawler-commons - posted by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/02/21 09:46:14 UTC, 6 replies.
- [jira] [Assigned] (NUTCH-1529) Port nutch-mongdb-parser to trunk - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/02/22 04:26:12 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #503 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/22 05:07:37 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1373) Implement consistent execution of normalising and filtering in Generator - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/02/22 08:40:14 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #504 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/22 08:54:31 UTC, 0 replies.
- [jira] [Created] (NUTCH-1535) Crawl crashes with java.io.exception - posted by "Adam Pioch (JIRA)" <ji...@apache.org> on 2013/02/22 13:06:12 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1535) Crawl crashes with java.io.exception - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/02/22 20:02:13 UTC, 3 replies.
- [Nutch Wiki] Trivial Update of "Presentations" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2013/02/23 02:18:42 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2129 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/23 05:11:21 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #505 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/23 05:11:21 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-1535) Crawl crashes with java.io.exception - posted by "Adam Pioch (JIRA)" <ji...@apache.org> on 2013/02/23 14:46:12 UTC, 2 replies.
- [jira] [Issue Comment Deleted] (NUTCH-1535) Crawl crashes with java.io.exception - posted by "Adam Pioch (JIRA)" <ji...@apache.org> on 2013/02/23 14:48:12 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1535) Crawl crashes with java.io.exception - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/02/23 15:08:12 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1535) Crawl crashes with java.io.exception - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2013/02/23 15:08:12 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #506 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/24 06:27:03 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #2130 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/24 06:35:46 UTC, 0 replies.
- [jira] [Reopened] (NUTCH-1535) Crawl crashes with java.io.exception - posted by "Adam Pioch (JIRA)" <ji...@apache.org> on 2013/02/25 11:36:15 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1535) Crawl crashes with java.io.exception - posted by "Adam Pioch (JIRA)" <ji...@apache.org> on 2013/02/25 11:40:12 UTC, 0 replies.
- Re: dev Digest 25 Feb 2013 02:27:44 -0000 Issue 1555 - posted by Lewis John Mcgibbney <le...@gmail.com> on 2013/02/25 22:17:58 UTC, 2 replies.
- Eclipse Error - posted by Danilo Fernandes <fe...@gmail.com> on 2013/02/26 02:12:19 UTC, 5 replies.
- Build failed in Jenkins: Nutch-nutchgora #508 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/26 05:11:44 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2132 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/26 05:11:44 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1529) Port nutch-mongdb-parser to trunk - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/02/26 06:36:14 UTC, 1 replies.
- [jira] [Created] (NUTCH-1536) Ant build file has hardcoded conf dir location - posted by "zm (JIRA)" <ji...@apache.org> on 2013/02/26 10:18:12 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1536) Ant build file has hardcoded conf dir location - posted by "zm (JIRA)" <ji...@apache.org> on 2013/02/26 10:20:12 UTC, 1 replies.
- [jira] [Issue Comment Deleted] (NUTCH-1536) Ant build file has hardcoded conf dir location - posted by "zm (JIRA)" <ji...@apache.org> on 2013/02/26 10:20:13 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1186) FreeGenerator always normalizes - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2013/02/26 12:06:12 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1186) FreeGenerator always normalizes - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/26 19:38:12 UTC, 2 replies.
- [jira] [Commented] (NUTCH-1536) Ant build file has hardcoded conf dir location - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/26 19:58:13 UTC, 4 replies.
- [jira] [Resolved] (NUTCH-1536) Ant build file has hardcoded conf dir location - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/26 20:56:13 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #509 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/26 21:19:42 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #2133 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/26 21:19:52 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #2134 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/26 22:42:51 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #510 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2013/02/26 22:58:46 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-1529) Port nutch-mongdb-parser to trunk - posted by "lufeng (JIRA)" <ji...@apache.org> on 2013/02/27 03:51:11 UTC, 1 replies.
- [jira] [Created] (NUTCH-1537) Legacy metadata package needs to take advantage of Apache Tika metadata package more. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/02/28 00:11:12 UTC, 0 replies.
- Crawling - posted by vivek <vi...@gmail.com> on 2013/02/28 08:26:38 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1537) Legacy metadata package needs to take advantage of Apache Tika metadata package more. - posted by "kiran (JIRA)" <ji...@apache.org> on 2013/02/28 17:01:12 UTC, 1 replies.