You are viewing a plain text version of this content. The canonical link for it is here.
- Fosdem - posted by BlackIce <bl...@gmail.com> on 2020/02/04 09:44:34 UTC, 2 replies.
- [jira] [Created] (NUTCH-2767) Fetcher to stop filling queues skipped due to repeated exceptions - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/19 14:32:00 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2767) Fetcher to stop filling queues skipped due to repeated exceptions - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/19 14:42:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2767) Fetcher to stop filling queues skipped due to repeated exceptions - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/02/19 15:57:00 UTC, 4 replies.
- [jira] [Commented] (NUTCH-2763) protocol-okhttp (store.http.headers): add whitespace in status line after status code also when message is empty - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/02/20 09:08:00 UTC, 2 replies.
- [jira] [Created] (NUTCH-2768) FetcherThread: unnecessary usage of class casts - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/21 15:10:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2768) FetcherThread: unnecessary usage of class casts - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/02/21 15:19:00 UTC, 2 replies.
- [jira] [Created] (NUTCH-2769) Nutch 1.15 unable to parse certain outlinks - posted by "Prajeeth Emanuel (Jira)" <ji...@apache.org> on 2020/02/26 09:38:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2769) Nutch 1.15 unable to parse certain outlinks - posted by "Markus Jelsma (Jira)" <ji...@apache.org> on 2020/02/26 11:24:00 UTC, 2 replies.
- [jira] [Created] (NUTCH-2770) Subcollection logic allows empty string as a whitelist value, thus matching every incoming document. - posted by "Jason Grey (Jira)" <ji...@apache.org> on 2020/02/26 22:44:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2771) Tests in nightly builds: speed up long runners - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/27 09:12:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2767) Fetcher to stop filling queues skipped due to repeated exceptions - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/27 10:31:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2768) FetcherThread: unnecessary usage of class casts - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/27 10:58:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2763) protocol-okhttp (store.http.headers): add whitespace in status line after status code also when message is empty - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/27 11:10:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2770) Subcollection logic allows empty string as a whitelist value, thus matching every incoming document. - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/27 12:05:00 UTC, 3 replies.
- [jira] [Updated] (NUTCH-2770) Subcollection logic allows empty string as a whitelist value, thus matching every incoming document. - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/27 12:06:00 UTC, 4 replies.
- [jira] [Created] (NUTCH-2772) Debugging parse filter to show serialized DOM tree - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/27 16:14:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2772) Debugging parse filter to show serialized DOM tree - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/02/27 16:17:00 UTC, 1 replies.
- [jira] [Created] (NUTCH-2773) SegmentReader (-dump or -get): show HTML content as UTF-8 - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/28 10:08:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2773) SegmentReader (-dump or -get): show HTML content as UTF-8 - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/02/28 11:35:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2774) Annotate methods implementing the Hadoop API by @Override - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/28 18:27:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2774) Annotate methods implementing the Hadoop API by @Override - posted by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/02/28 18:31:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2769) parse-html unable to parse certain outlinks - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/28 18:58:00 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-2770) Subcollection logic allows empty string as a whitelist value, thus matching every incoming document. - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/28 19:02:00 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2775) Fetcher to guarantee minimum delay even if robots.txt defines shorter Crawl-delay - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/29 18:07:00 UTC, 1 replies.
- [jira] [Created] (NUTCH-2775) Fetcher to guarantee minimum delay even if robots.txt defines shorter Crawl-delay - posted by "Sebastian Nagel (Jira)" <ji...@apache.org> on 2020/02/29 18:07:00 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2775) Fetcher to guarantee minimum delay even if robots.txt defines shorter Crawl-delay - posted by "Markus Jelsma (Jira)" <ji...@apache.org> on 2020/02/29 18:30:00 UTC, 0 replies.