You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Updated] (NUTCH-1559) parse-metatags duplicates extracted metatags in combination with parse-tika - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/04 08:51:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2673) EOFException protocol-http - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2018/11/07 11:26:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2673) EOFException protocol-http - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/08 14:43:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2674) HostDb: dump shows wrong column headers - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/08 15:13:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2674) HostDb: dump shows wrong column headers - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/11/08 15:16:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1559) parse-metatags duplicates extracted metatags in combination with parse-tika - posted by "Igor Kanshyn (JIRA)" <ji...@apache.org> on 2018/11/09 15:46:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2675) Give parsers the capability to read and write CrawlDatum - posted by "Junqiang Zhang (JIRA)" <ji...@apache.org> on 2018/11/09 16:51:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2658) Add README file to all plugins in src/plugin - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/11/11 01:54:00 UTC, 1 replies.
- Jenkins build is back to normal : Nutch-trunk #3584 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/11/11 01:54:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2460) use the headless option of firefox and chrome in protocol-selenium - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/11/13 21:31:00 UTC, 4 replies.
- [jira] [Commented] (NUTCH-2655) Update Solr schema.xml for Solr 7.x - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/11/14 09:11:00 UTC, 2 replies.
- [jira] [Assigned] (NUTCH-2655) Update Solr schema.xml for Solr 7.x - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/14 09:12:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2655) Update Solr schema.xml for Solr 7.x - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/14 09:13:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2606) MIME detection is wrong for plain-text documents send as Content-Type "application/msword" - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/14 09:28:00 UTC, 2 replies.
- [jira] [Commented] (NUTCH-2630) Fetcher to log skipped records by robots.txt - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/11/14 12:05:00 UTC, 2 replies.
- [jira] [Resolved] (NUTCH-2630) Fetcher to log skipped records by robots.txt - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/14 12:06:00 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #3586 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/11/14 12:44:09 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #3587 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/11/14 12:51:55 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #3588 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/11/14 12:53:39 UTC, 0 replies.
- [jira] [Created] (NUTCH-2676) Update to the latest selenium and add code to use chrome and firefox headless mode with the remote web driver - posted by "Stas Batururimi (JIRA)" <ji...@apache.org> on 2018/11/15 07:48:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2460) use the headless option of firefox and chrome in protocol-selenium - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/15 10:23:00 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1842) crawl.gen.delay has a wrong default value in nutch-default.xml or is being parsed incorrectly - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/11/15 10:35:01 UTC, 2 replies.
- [jira] [Resolved] (NUTCH-1842) crawl.gen.delay has a wrong default value in nutch-default.xml or is being parsed incorrectly - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/15 10:36:00 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2656) Update description to configure Solr 7.x in tutorial - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/15 10:39:01 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2661) Move TestOutlinks to the proper path - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/15 10:43:00 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #3589 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/11/15 10:44:22 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2651) Upgrade to Tika 1.19.1 (from 1.18) - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/11/15 10:45:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2652) Fetcher launches more fetch tasks than fetch lists - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/11/15 10:45:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2659) Add missing Apache license headers - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/11/15 10:45:01 UTC, 2 replies.
- [jira] [Commented] (NUTCH-2671) Upgrade ant ivy library - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/11/15 10:45:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2661) Move TestOutlinks to the proper path - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/11/15 10:45:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2625) ProtocolFactory.getProtocol(url) may create multiple plugin instances - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/11/15 10:45:01 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2660) Unit tests of plugins parse-js, headings, index-jexl-filter to be executed during build - posted by "Hudson (JIRA)" <ji...@apache.org> on 2018/11/15 10:45:01 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2676) Update to the latest selenium and add code to use chrome and firefox headless mode with the remote web driver - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/15 13:32:01 UTC, 3 replies.
- [jira] [Commented] (NUTCH-2675) Give parsers the capability to read and write CrawlDatum - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/15 13:48:02 UTC, 2 replies.
- [jira] [Commented] (NUTCH-2676) Update to the latest selenium and add code to use chrome and firefox headless mode with the remote web driver - posted by "Stas Batururimi (JIRA)" <ji...@apache.org> on 2018/11/16 07:50:00 UTC, 7 replies.
- [jira] [Comment Edited] (NUTCH-2676) Update to the latest selenium and add code to use chrome and firefox headless mode with the remote web driver - posted by "Stas Batururimi (JIRA)" <ji...@apache.org> on 2018/11/19 07:34:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2669) Reliable solution for javax.ws packaging.type - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/19 15:22:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2606) MIME detection is wrong for plain-text documents send as Content-Type "application/msword" - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/19 20:54:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2668) Integrate OWASP dependency checks as ant target - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/11/19 20:58:00 UTC, 5 replies.
- [jira] [Resolved] (NUTCH-2668) Integrate OWASP dependency checks as ant target - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/19 21:00:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2675) Give parsers the capability to read and write CrawlDatum - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/19 21:02:00 UTC, 0 replies.
- Build failed in Jenkins: Nutch-nutchgora #1622 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/11/19 21:38:21 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #3590 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/11/19 21:43:59 UTC, 0 replies.
- [jira] [Reopened] (NUTCH-2668) Integrate OWASP dependency checks as ant target - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2018/11/19 21:50:00 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #1623 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/11/19 22:09:36 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #3591 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2018/11/19 22:11:08 UTC, 0 replies.
- [jira] [Issue Comment Deleted] (NUTCH-2676) Update to the latest selenium and add code to use chrome and firefox headless mode with the remote web driver - posted by "Stas Batururimi (JIRA)" <ji...@apache.org> on 2018/11/20 08:08:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2613) Documentation for exchange component - posted by "Roannel Fernández Hernández (JIRA)" <ji...@apache.org> on 2018/11/26 16:53:00 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-2659) Add missing Apache license headers - posted by "Roannel Fernández Hernández (JIRA)" <ji...@apache.org> on 2018/11/26 21:42:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2677) Update Jest client in indexer-elastic-rest plugin - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2018/11/28 22:31:00 UTC, 0 replies.
- Maven vs Gradle for Nutch Build System - posted by lewis john mcgibbney <le...@apache.org> on 2018/11/29 21:14:18 UTC, 5 replies.
- [jira] [Commented] (NUTCH-2292) Mavenize the build for nutch-core and nutch-plugins - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2018/11/30 14:22:00 UTC, 0 replies.