You are viewing a plain text version of this content. The canonical link for it is here.
- Build failed in Jenkins: nutch-trunk-maven #337 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/01 07:31:50 UTC, 0 replies.
- [jira] [Created] (NUTCH-1416) Can not update the index - posted by "Jianyun He (JIRA)" <ji...@apache.org> on 2012/07/01 10:10:49 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1416) Can not update the index - posted by "Jianyun He (JIRA)" <ji...@apache.org> on 2012/07/01 14:54:47 UTC, 1 replies.
- Add me to the Mailing list - posted by michael F <mi...@bionic8.com> on 2012/07/01 16:48:57 UTC, 1 replies.
- nucth and mahout integration - posted by Alexander Aristov <al...@gmail.com> on 2012/07/01 21:02:51 UTC, 2 replies.
- Jenkins build is back to normal : Nutch-nutchgora #297 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/02 06:20:21 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #1885 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/02 06:32:45 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven #338 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/02 07:02:49 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1087) Deprecate crawl command and replace with example script - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/07/02 14:14:22 UTC, 6 replies.
- [jira] [Assigned] (NUTCH-1087) Deprecate crawl command and replace with example script - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/07/02 14:16:22 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1415) release packages to contain top level folder apache-nutch-x.x - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/02 14:29:22 UTC, 1 replies.
- Re: Nutch Author, Publication, and Religion Detection - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/07/02 14:32:39 UTC, 2 replies.
- [jira] [Created] (NUTCH-1417) Remove o.a.n.metadata.Office - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/02 14:37:21 UTC, 0 replies.
- Re: [VOTE] Apache Nutch 2.0 Release Candidate #3 - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/07/02 19:49:36 UTC, 22 replies.
- Re: [VOTE] Apache Nutch 1.5.1 Release Candidate - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/07/02 20:01:19 UTC, 0 replies.
- [jira] [Created] (NUTCH-1418) error parsing robots rules- can't decode path: /wiki/Wikipedia%3Mediation_Committee/ - posted by "Arijit Mukherjee (JIRA)" <ji...@apache.org> on 2012/07/02 20:07:21 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1418) error parsing robots rules- can't decode path: /wiki/Wikipedia%3Mediation_Committee/ - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2012/07/02 20:35:21 UTC, 2 replies.
- Jenkins build is back to normal : nutch-trunk-maven #339 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/03 10:33:49 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1415) release packages to contain top level folder apache-nutch-x.x - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/07/03 11:05:10 UTC, 0 replies.
- [jira] [Created] (NUTCH-1419) parsechecker and indexchecker to report protocol status - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/07/03 14:47:20 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1419) parsechecker and indexchecker to report protocol status - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/07/03 15:41:19 UTC, 0 replies.
- [VOTE] Apache Nutch 1.5.1 RC#3 - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/07/03 20:42:25 UTC, 4 replies.
- [jira] [Updated] (NUTCH-1306) Add option to not commit and clarify existing solr.commit.size - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/04 10:34:35 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1306) Add option to not commit and clarify existing solr.commit.size - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/04 10:36:34 UTC, 2 replies.
- [jira] [Created] (NUTCH-1420) Get rid of the dreaded � - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/04 14:58:35 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1420) Get rid of the dreaded � - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/04 14:58:35 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/04 15:08:34 UTC, 5 replies.
- [jira] [Commented] (NUTCH-1405) Allow to overwrite CrawlDatum's with injected entries - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/04 15:24:35 UTC, 6 replies.
- [jira] [Comment Edited] (NUTCH-1360) Suport the storing of IP address connected to when web crawling - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/04 15:42:34 UTC, 0 replies.
- [jira] [Reopened] (NUTCH-1360) Suport the storing of IP address connected to when web crawling - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/04 22:33:34 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #299 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/05 06:04:56 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1887 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/05 06:06:05 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1405) Allow to overwrite CrawlDatum's with injected entries - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/05 11:25:34 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1233) Rely on Tika for outlink extraction - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/05 11:33:35 UTC, 0 replies.
- [jira] [Created] (NUTCH-1421) RegexURLNormalizer to only skip rules with invalid patterns - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/07/05 11:47:33 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1421) RegexURLNormalizer to only skip rules with invalid patterns - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/07/05 11:59:34 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1421) RegexURLNormalizer to only skip rules with invalid patterns - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/05 12:33:34 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1414) Date extraction parse filter - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/05 17:59:33 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1405) Allow to overwrite CrawlDatum's with injected entries - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/05 18:59:34 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven #342 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/06 07:53:01 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #300 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/06 07:58:24 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #1888 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/06 08:09:40 UTC, 0 replies.
- [jira] [Created] (NUTCH-1422) reset signature for redirects - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/07/06 16:05:34 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1422) reset signature for redirects - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/07/06 16:09:34 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1414) Date extraction parse filter - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/07/06 17:11:34 UTC, 3 replies.
- [Nutch Wiki] Trivial Update of "ContributorsGroup" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2012/07/06 22:36:15 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "AdminGroup" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2012/07/06 22:36:29 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1411) nutchgora fetcher.store.content does not work - posted by "Alexander Kingson (JIRA)" <ji...@apache.org> on 2012/07/06 22:54:34 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-1411) nutchgora fetcher.store.content does not work - posted by "Alexander Kingson (JIRA)" <ji...@apache.org> on 2012/07/06 22:56:34 UTC, 0 replies.
- [Nutch Wiki] Update of "Support" by subhankarray - posted by Apache Wiki <wi...@apache.org> on 2012/07/07 02:45:26 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven #343 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/07 07:01:39 UTC, 0 replies.
- [ANNOUNCEMENT] Apache Nutch v2.0 Release - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/07/08 00:37:22 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven #344 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/08 07:02:01 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven #345 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/09 07:02:21 UTC, 0 replies.
- [PROPOSAL] Rename branch nutchgora into 2.x - posted by Julien Nioche <li...@gmail.com> on 2012/07/09 12:37:51 UTC, 4 replies.
- [jira] [Resolved] (NUTCH-1306) Add option to not commit and clarify existing solr.commit.size - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 13:40:34 UTC, 0 replies.
- [jira] [Created] (NUTCH-1423) Remove unused fields in LanguageIndexingFilter - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 13:42:40 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1423) Remove unused fields in LanguageIndexingFilter - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 13:44:35 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1423) Remove unused fields in LanguageIndexingFilter - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 13:44:35 UTC, 0 replies.
- [jira] [Created] (NUTCH-1424) fix fetcher timelimit logging - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 13:48:33 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1424) fix fetcher timelimit logging - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 13:50:33 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1424) fix fetcher timelimit logging - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 13:50:36 UTC, 0 replies.
- [jira] [Created] (NUTCH-1425) DbUpdaterJob declares PREV_SIGNATURE on input twice - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 13:54:33 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1425) DbUpdaterJob declares PREV_SIGNATURE on input twice - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 13:54:34 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1425) DbUpdaterJob declares PREV_SIGNATURE on input twice - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 13:54:34 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1025) Add option not to commit to Solr - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 13:57:34 UTC, 0 replies.
- [jira] [Created] (NUTCH-1426) HostDb close() should close store instead of flush - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 14:15:35 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1426) HostDb close() should close store instead of flush - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 14:17:34 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1426) HostDb close() should close store instead of flush - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 14:19:33 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1411) nutchgora fetcher.store.content does not work - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 14:46:34 UTC, 0 replies.
- [jira] [Closed] (NUTCH-628) Host database to keep track of host-level information - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 15:54:35 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1233) Rely on Tika for outlink extraction - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/09 16:32:33 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1411) nutchgora fetcher.store.content does not work - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 17:23:36 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1411) nutchgora fetcher.store.content does not work - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/09 17:23:36 UTC, 0 replies.
- [Nutch Wiki] Update of "FAQ" by JulienNioche - posted by Apache Wiki <wi...@apache.org> on 2012/07/09 17:35:44 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1360) Suport the storing of IP address connected to when web crawling - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/07/09 17:42:35 UTC, 0 replies.
- [RESULT][VOTE] Apache Nutch 1.5.1 RC#3 - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/07/09 19:10:31 UTC, 0 replies.
- [DONE] Renamed branch nutchgora into 2.x - posted by Julien Nioche <li...@gmail.com> on 2012/07/10 10:50:56 UTC, 1 replies.
- Build failed in Jenkins: nutch-trunk-maven #346 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/10 12:05:20 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #304 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/10 12:35:42 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1087) Deprecate crawl command and replace with example script - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/10 15:14:43 UTC, 3 replies.
- [jira] [Created] (NUTCH-1427) Reuse SelectorEntry in Generator. - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/10 16:23:33 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1427) Reuse SelectorEntry in Generator. - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/10 16:27:33 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1427) Reuse SelectorEntry in Generator. - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/10 16:27:35 UTC, 0 replies.
- [ANNOUNCEMENT] Apache Nutch v1.5.1 Released - posted by lewis john mcgibbney <le...@apache.org> on 2012/07/10 16:40:43 UTC, 2 replies.
- [jira] [Created] (NUTCH-1428) GeneratorMapper should not initialize filters/normalizers when they are disabled - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/10 17:00:40 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1428) GeneratorMapper should not initialize filters/normalizers when they are disabled - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/10 17:02:35 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1428) GeneratorMapper should not initialize filters/normalizers when they are disabled - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/10 17:02:35 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven #347 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/10 19:45:59 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2012/07/10 22:46:20 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1328) a problem with regex-normalize.xml - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/07/10 23:10:35 UTC, 1 replies.
- [jira] [Updated] (NUTCH-706) Url regex normalizer - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/10 23:22:34 UTC, 1 replies.
- [jira] [Closed] (NUTCH-1328) a problem with regex-normalize.xml - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/10 23:22:34 UTC, 0 replies.
- [jira] [Created] (NUTCH-1429) CrawlDBReader to dump on exception and HTTP code - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/10 23:24:35 UTC, 0 replies.
- Build failed in Jenkins: nutch-trunk-maven #348 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/11 00:16:34 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #305 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/11 06:07:36 UTC, 0 replies.
- Jenkins build is back to normal : nutch-trunk-maven #349 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/11 07:04:58 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #306 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/12 07:08:43 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #307 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/13 06:15:11 UTC, 1 replies.
- Build failed in Jenkins: Nutch-nutchgora #308 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/14 06:05:22 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1896 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/14 06:07:13 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #309 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/15 06:06:02 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #1897 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/15 06:18:06 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #310 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/16 06:06:32 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #311 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/17 06:09:37 UTC, 0 replies.
- [jira] [Created] (NUTCH-1430) Freegenerator records overwrite CrawlDB records with AdaptiveFetchSchedule - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/17 14:02:36 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1430) Freegenerator records overwrite CrawlDB records with AdaptiveFetchSchedule - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/17 14:16:33 UTC, 2 replies.
- [jira] [Assigned] (NUTCH-1430) Freegenerator records overwrite CrawlDB records with AdaptiveFetchSchedule - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/17 14:30:34 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1430) Freegenerator records overwrite CrawlDB records with AdaptiveFetchSchedule - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/17 14:36:34 UTC, 0 replies.
- Apache Nutch being used at National Snow and Ice Data Center: ESIP Federation - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/07/17 21:24:20 UTC, 6 replies.
- Build failed in Jenkins: Nutch-nutchgora #312 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/18 06:06:27 UTC, 0 replies.
- [jira] [Created] (NUTCH-1431) Introduce link 'distance' and add configurable max distance in the generator - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/18 12:19:35 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1431) Introduce link 'distance' and add configurable max distance in the generator - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/18 12:23:33 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1431) Introduce link 'distance' and add configurable max distance in the generator - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/18 12:45:41 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/18 16:02:33 UTC, 4 replies.
- [jira] [Commented] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/18 16:25:35 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #313 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/19 06:11:07 UTC, 0 replies.
- [jira] [Created] (NUTCH-1432) property storage.schema does not work anymore, should be storage.schema.webpage and storage.schema.host - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/19 09:35:34 UTC, 0 replies.
- [jira] [Created] (NUTCH-1433) Upgrade to Tika 1.2 - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/07/19 14:44:35 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1433) Upgrade to Tika 1.2 - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2012/07/19 17:58:36 UTC, 2 replies.
- Fwd: Call for Papers for ApacheCon Europe 2012 now open! - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/07/20 01:08:15 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #314 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/20 06:09:05 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1433) Upgrade to Tika 1.2 - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/20 10:13:35 UTC, 8 replies.
- [jira] [Created] (NUTCH-1434) Indexer to delete robots noIndex - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/20 13:37:33 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1434) Indexer to delete robots noIndex - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/20 13:41:34 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1341) NotModified time set to now but page not modified - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/20 13:52:34 UTC, 3 replies.
- [jira] [Commented] (NUTCH-1388) Optionally maintain custom fetch interval despite AdaptiveFetchSchedule - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/20 13:52:34 UTC, 9 replies.
- [jira] [Commented] (NUTCH-1434) Indexer to delete robots noIndex - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/20 15:28:35 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1388) Optionally maintain custom fetch interval despite AdaptiveFetchSchedule - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/20 15:54:35 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1388) Optionally maintain custom fetch interval despite AdaptiveFetchSchedule - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/20 16:24:35 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1904 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/22 06:05:58 UTC, 0 replies.
- [jira] [Created] (NUTCH-1435) Host jobs throw NullPointerException with MySQL - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/22 14:45:33 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1435) Host jobs throw NullPointerException with MySQL - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/22 14:49:35 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "RunNutchInEclipse" by SebastianNagel - posted by Apache Wiki <wi...@apache.org> on 2012/07/22 21:24:05 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #1905 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/23 06:17:55 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1435) Host jobs throw NullPointerException with MySQL - posted by "Joan Espasa Arxer (JIRA)" <ji...@apache.org> on 2012/07/23 08:26:34 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1435) Host jobs throw NullPointerException with MySQL - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/23 13:13:39 UTC, 0 replies.
- [jira] [Created] (NUTCH-1436) bin/nutch absent in zip package - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/07/23 21:59:34 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1436) bin/nutch absent in zip package - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2012/07/23 22:03:36 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "RunNutchInEclipse" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2012/07/24 16:13:44 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1238) Fetcher throughput threshold must start before feeder finished - posted by "applepear (JIRA)" <ji...@apache.org> on 2012/07/24 23:59:34 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-1238) Fetcher throughput threshold must start before feeder finished - posted by "applepear (JIRA)" <ji...@apache.org> on 2012/07/25 01:25:35 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1437) HostInjectorJob to accept lines with or without protocol - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/25 14:51:35 UTC, 0 replies.
- [jira] [Created] (NUTCH-1437) HostInjectorJob to accept lines with or without protocol - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/25 14:51:35 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1437) HostInjectorJob to accept lines with or without protocol - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/25 14:53:33 UTC, 1 replies.
- [jira] [Reopened] (NUTCH-1437) HostInjectorJob to accept lines with or without protocol - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/25 14:53:34 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #1908 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/26 06:06:35 UTC, 0 replies.
- [jira] [Created] (NUTCH-1438) ParserJob support for option -reparse - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/26 14:04:33 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1438) ParserJob support for option -reparse - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/26 14:08:33 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1438) ParserJob support for option -reparse - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/26 14:08:35 UTC, 0 replies.
- [Nutch Wiki] Update of "Presentations" by JulienNioche - posted by Apache Wiki <wi...@apache.org> on 2012/07/26 17:00:51 UTC, 1 replies.
- [Nutch Wiki] Trivial Update of "Presentations" by JulienNioche - posted by Apache Wiki <wi...@apache.org> on 2012/07/26 17:02:05 UTC, 0 replies.
- [jira] [Created] (NUTCH-1439) Define boost field as type float in schema-solr4.xml - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/26 21:18:35 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1439) Define boost field as type float in schema-solr4.xml - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/26 21:22:34 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1439) Define boost field as type float in schema-solr4.xml - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/26 21:40:34 UTC, 1 replies.
- Jenkins build is back to normal : Nutch-trunk #1909 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2012/07/27 06:20:18 UTC, 0 replies.
- [jira] [Created] (NUTCH-1440) reconfigure non-existent stopwords_en.txt in schema-solr4.xml - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/27 13:37:34 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1440) reconfigure non-existent stopwords_en.txt in schema-solr4.xml - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/27 13:57:34 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1440) reconfigure non-existent stopwords_en.txt in schema-solr4.xml - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/27 13:57:34 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1440) reconfigure non-existent stopwords_en.txt in schema-solr4.xml - posted by "Hudson (JIRA)" <ji...@apache.org> on 2012/07/27 14:23:35 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-1376) Add description parameter to every ant task - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/29 15:04:34 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1376) Add description parameter to every ant task - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/29 15:06:33 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1376) Add description parameter to every ant task - posted by "Hudson (JIRA)" <ji...@apache.org> on 2012/07/29 15:08:34 UTC, 2 replies.
- [jira] [Updated] (NUTCH-1417) Remove o.a.n.metadata.Office - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/29 15:15:33 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1417) Remove o.a.n.metadata.Office - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/29 15:15:34 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1417) Remove o.a.n.metadata.Office - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/29 15:15:34 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1416) Can not update the index - posted by "Hudson (JIRA)" <ji...@apache.org> on 2012/07/29 16:04:35 UTC, 1 replies.
- Re: Javadoc incorrect or missing code in 1.5.1 Generator - posted by Lewis John Mcgibbney <le...@gmail.com> on 2012/07/30 13:16:08 UTC, 0 replies.
- [jira] [Created] (NUTCH-1441) AnchorIndexingFilter should use plain HashSet - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/30 14:21:34 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1441) AnchorIndexingFilter should use plain HashSet - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/30 14:21:35 UTC, 2 replies.
- [jira] [Closed] (NUTCH-1441) AnchorIndexingFilter should use plain HashSet - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/30 14:21:35 UTC, 0 replies.
- [jira] [Created] (NUTCH-1442) indexingfilter.order is property is misread in code - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/30 14:41:33 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1441) AnchorIndexingFilter should use plain HashSet - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/07/30 14:43:34 UTC, 0 replies.
- [jira] [Reopened] (NUTCH-1441) AnchorIndexingFilter should use plain HashSet - posted by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/07/30 15:32:34 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1100) SolrDedup broken - posted by "Hernan (JIRA)" <ji...@apache.org> on 2012/07/30 22:39:36 UTC, 0 replies.
- [jira] [Created] (NUTCH-1443) Solr schema version is invalid - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/31 23:20:34 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1443) Solr schema version is invalid - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/31 23:24:35 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1443) Solr schema version is invalid - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/31 23:26:33 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1443) Solr schema version is invalid - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2012/07/31 23:28:33 UTC, 0 replies.