You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Commented] (NUTCH-2718) Names of index writers and exchanges configuration files to be configurable - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/01 04:51:00 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2719) NPE if exchanges.xml uses index writer not available - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/01 04:51:00 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-2718) Names of index writers and exchanges configuration files to be configurable - posted by "Roannel Fernández Hernández (Jira)" <ji...@apache.org> on 2019/09/01 04:54:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2719) NPE if exchanges.xml uses index writer not available - posted by "Roannel Fernández Hernández (Jira)" <ji...@apache.org> on 2019/09/01 04:54:00 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #3637 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2019/09/01 05:47:36 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2669) Reliable solution for javax.ws packaging.type - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/01 06:53:00 UTC, 1 replies.
- [jira] [Assigned] (NUTCH-2729) protocol-okhttp: fix marking of truncated content - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/01 06:54:00 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2669) Reliable solution for javax.ws packaging.type - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/01 06:54:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2033) parse-tika skips valid documents. - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/01 07:07:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2157) Parent Issue for Addressing Miredot REST API Warnings - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/01 07:07:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2247) Protocol resolver - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/01 07:08:00 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #3638 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2019/09/01 08:02:05 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2729) protocol-okhttp: fix marking of truncated content - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/01 08:48:00 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-2729) protocol-okhttp: fix marking of truncated content - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/01 08:48:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2598) URLNormalizerChecker fails on invalid URLs in input - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/01 10:06:00 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-2598) URLNormalizerChecker fails on invalid URLs in input - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/01 10:07:00 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2598) URLNormalizerChecker fails on invalid URLs in input - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/01 10:07:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2696) Nutch SegmentReader does not dump non-ASCII characters with Hadoop 3.x - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/01 15:49:00 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-2696) Nutch SegmentReader does not dump non-ASCII characters with Hadoop 3.x - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/01 15:50:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2654) Remove obsolete index-writer configuration in conf/ - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/01 19:07:00 UTC, 10 replies.
- [jira] [Created] (NUTCH-2732) Ignored and tracked configuration files by git - posted by "Roannel Fernández Hernández (Jira)" <ji...@apache.org> on 2019/09/01 19:59:00 UTC, 0 replies.
- [DISCUSS] Release 1.16? - posted by Sebastian Nagel <wa...@googlemail.com> on 2019/09/02 15:05:08 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2731) Solr Cleanup Step Fails when Authentication is Required - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/02 15:10:02 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1749) Optionally exclude title from content field - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/04 11:10:00 UTC, 5 replies.
- [jira] [Commented] (NUTCH-2732) Ignored and tracked configuration files by git - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/05 14:39:00 UTC, 9 replies.
- [jira] [Commented] (NUTCH-2612) Support for sitemap processing by hostname - posted by "Markus Jelsma (Jira)" <ji...@apache.org> on 2019/09/06 12:02:00 UTC, 5 replies.
- [jira] [Created] (NUTCH-2733) protocol-okhttp: add support for Brotli compression (Content-Encoding) - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/09 11:46:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2612) Support for sitemap processing by hostname - posted by "Markus Jelsma (Jira)" <ji...@apache.org> on 2019/09/09 13:02:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2732) Ignored and tracked configuration files by git - posted by "Roannel Fernández Hernández (Jira)" <ji...@apache.org> on 2019/09/09 13:10:00 UTC, 1 replies.
- [jira] [Updated] (NUTCH-2615) Publisher for Telegram - posted by "Roannel Fernández Hernández (Jira)" <ji...@apache.org> on 2019/09/09 13:15:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2140) Atomic update and optimistic concurrency update using Solr - posted by "Roannel Fernández Hernández (Jira)" <ji...@apache.org> on 2019/09/09 13:15:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2334) Extension point for schedulers - posted by "Roannel Fernández Hernández (Jira)" <ji...@apache.org> on 2019/09/09 13:15:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2707) protocol-okhttp fails to decompress content if Content-Encoding header is wrong - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/09 15:57:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2734) Upgrade 2.x to use Tika 1.22 - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/09 16:07:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2734) Upgrade 2.x to use Tika 1.22 - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/09 16:12:00 UTC, 3 replies.
- [jira] [Assigned] (NUTCH-2734) Upgrade 2.x to use Tika 1.22 - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/09 16:12:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1086) Rewrite protocol-httpclient - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/10 08:40:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1982) Make Git ignore IDE project files and add note about IDE setup - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/10 11:04:00 UTC, 4 replies.
- [jira] [Assigned] (NUTCH-1982) Make Git ignore IDE project files and add note about IDE setup - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/10 11:04:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2735) Update the indexer-solr documentation about the schema.xml usage - posted by "Roannel Fernández Hernández (Jira)" <ji...@apache.org> on 2019/09/12 17:36:00 UTC, 0 replies.
- Injection from webservice - posted by Roannel Fernandez Hernandez <ro...@uci.cu> on 2019/09/16 14:59:14 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2735) Update the indexer-solr documentation about the schema.xml usage - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/19 12:35:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2735) Update the indexer-solr documentation about the schema.xml usage - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/19 12:36:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2654) Remove obsolete index-writer configuration in conf/ - posted by "Roannel Fernández Hernández (Jira)" <ji...@apache.org> on 2019/09/19 13:13:00 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #3643 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2019/09/19 13:53:36 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2734) Upgrade 2.x to use Tika 1.22 - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/23 08:41:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1982) Make Git ignore IDE project files and add note about IDE setup - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/23 08:46:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2721) Make the plugin lib-htmlunit depend on lib-selenium - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/23 08:48:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2705) urlfilter-validator rejects IPv6 URLs - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/23 08:48:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2649) Optionally skip TLS/SSL certificate validation for protocol-selenium and protocol-htmlunit - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/23 08:48:00 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #3644 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2019/09/23 10:17:44 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #3645 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2019/09/23 11:06:02 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2669) Reliable solution for javax.ws packaging.type - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/23 11:17:00 UTC, 0 replies.
- [VOTE] Release Apache Nutch 2.4 RC#1 - posted by Sebastian Nagel <wa...@googlemail.com> on 2019/09/24 09:54:48 UTC, 3 replies.
- [jira] [Updated] (NUTCH-2636) protocol-okhttp: http.proxy.exclusion.list does not work if http.proxy.username - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/25 08:18:00 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2636) protocol-okhttp: http.proxy.exclusion.list does not work if http.proxy.username - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/25 08:18:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2672) Ant build erronously installs *-test.jar instead *.jar for target "nightly" - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/25 08:19:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2730) SitemapProcessor to treat sitemap URLs as Set instead of List - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/25 08:20:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2736) Upgrade Dockerfile to be based on recent Ubuntu LTS version - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/25 08:23:00 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2736) Upgrade Dockerfile to be based on recent Ubuntu LTS version - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/25 08:32:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2736) Upgrade Dockerfile to be based on recent Ubuntu LTS version - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/25 08:34:00 UTC, 2 replies.
- [jira] [Updated] (NUTCH-1943) Form authentication should not be global and ignore - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:15:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2133) Transfer Selenium Documentation to WIki - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:15:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2207) Remove class duplication and smarten-up scoring-similarity plugin - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:16:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2156) Dump via Services end point - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:16:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2151) Service endpoint for REST API - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:16:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2214) Index clean to be flexible on what it deletes - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:17:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2237) DeduplicationJob: Add extra order criteria based on slug - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:18:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2237) DeduplicationJob: Add extra order criteria based on slug - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:18:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2292) Mavenize the build for nutch-core and nutch-plugins - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:20:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2290) Update licenses of bundled libraries - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:20:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2697) Upgrade Ivy to fix the issue of an unset packaging.type property. - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:24:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2030) ParseZip plugin is not able to extract language from zip document,this could solve that problem. - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:24:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1807) avoid methods relying on system-specific default locale / charset - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:24:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2671) Upgrade ant ivy library - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:25:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2605) The Feed plugin causes a NumberFormatException - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:26:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2605) The Feed plugin causes a NumberFormatException - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:27:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2596) Upgrade from org.mortbay.jetty to org.eclipse.jetty - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:28:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2573) Suspend crawling if robots.txt fails to fetch with 5xx status - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:28:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2512) Nutch does not build under JDK9 - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:28:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2429) Fix Plugin System to allow protocol plugins to bundle their URLStreamHandlers - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:29:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2459) Nutch cannot download/parse some files via FTP - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:29:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2417) Support for variable fetch delay via FreeGenerator - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:29:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2382) indexer-hbase Nutch 1.x branch - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:29:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2363) Fetcher support for reading and setting cookies - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/26 16:30:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2162) Nutch Webapp Crawl fails as it tries to index - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 09:13:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2736) Upgrade Dockerfile to be based on recent Ubuntu LTS version - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 09:19:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2184) Enable IndexingJob to function with no crawldb - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 09:22:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2381) In some situations the class TextProfileSignature gives different signatures for the same text "profile" page. - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 11:30:00 UTC, 4 replies.
- [jira] [Assigned] (NUTCH-2381) In some situations the class TextProfileSignature gives different signatures for the same text "profile" page. - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 11:30:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1403) Add default ScoringFilter for manipulating metadata - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 12:13:00 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1403) Add default ScoringFilter for manipulating metadata - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 12:13:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2457) Embedded documents likely not correctly parsed by Tika - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/27 13:04:00 UTC, 11 replies.
- [jira] [Comment Edited] (NUTCH-2457) Embedded documents likely not correctly parsed by Tika - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 13:13:00 UTC, 2 replies.
- [jira] [Reopened] (NUTCH-2732) Ignored and tracked configuration files by git - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 14:36:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2457) Embedded documents likely not correctly parsed by Tika - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 14:51:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2381) In some situations the class TextProfileSignature gives different signatures for the same text "profile" page. - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 14:53:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2710) Normalize outlinks before checking for internal or external links - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 20:17:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2261) ParseSegment job does not pass metadata for content-level redirects - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 20:24:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-685) Content-level redirect status lost in ParseSegment - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 20:24:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2482) index-geoip not to add null values to document fields - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 22:02:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2482) index-geoip not to add null values to document fields - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/27 22:02:00 UTC, 5 replies.
- [jira] [Assigned] (NUTCH-2482) index-geoip not to add null values to document fields - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/27 22:03:00 UTC, 0 replies.
- Hacktoberfest 2019 - posted by Furkan KAMACI <fu...@gmail.com> on 2019/09/29 15:30:02 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-2482) index-geoip not to add null values to document fields - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 07:25:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2737) Generator: count and log reason of rejections during selection - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 08:02:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2737) Generator: count and log reason of rejections during selection - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 09:01:00 UTC, 2 replies.
- [jira] [Created] (NUTCH-2738) Generator: document property generate.restrict.status - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 09:09:00 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2738) Generator: document property generate.restrict.status - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 09:09:00 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2737) Generator: count and log reason of rejections during selection - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 09:09:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 09:20:00 UTC, 1 replies.
- [jira] [Assigned] (NUTCH-2501) Take into account $NUTCH_HEAPSIZE when crawling using crawl script - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 09:21:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2737) Generator: count and log reason of rejections during selection - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/30 11:14:00 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2457) Embedded documents likely not correctly parsed by Tika - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 11:31:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2457) Embedded documents likely not correctly parsed by Tika - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 11:32:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2381) In some situations the class TextProfileSignature gives different signatures for the same text "profile" page. - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 11:33:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2739) indexer-elastic: Upgrade ES and migrate to REST client - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 11:56:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2304) Fix Elasticsearch Rest Indexing Plugin's Dependencies - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 11:58:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2677) Update Jest client in indexer-elastic-rest plugin - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 11:58:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2740) Generator: generate.max.count overflow not logged - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 12:29:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2738) Generator: document property generate.restrict.status - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 12:35:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2387) Nutch should not index document with "noindex" meta - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 13:44:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2279) LinkRank fails when using Hadoop MR output compression - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2019/09/30 15:53:00 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2279) LinkRank fails when using Hadoop MR output compression - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/30 15:53:00 UTC, 2 replies.