You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Created] (NUTCH-2319) Link with "rel=alternate" doesn't return in crawl - posted by "Zuber (JIRA)" <ji...@apache.org> on 2016/10/01 12:11:20 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-978) A Plugin for extracting certain element of a web page on html page parsing. - posted by "Kris (JIRA)" <ji...@apache.org> on 2016/10/04 17:13:20 UTC, 0 replies.
- [jira] [Commented] (NUTCH-978) A Plugin for extracting certain element of a web page on html page parsing. - posted by "Kris (JIRA)" <ji...@apache.org> on 2016/10/04 17:13:20 UTC, 0 replies.
- [jira] [Created] (NUTCH-2320) URLFilterChecker to run as TCP Telnet service - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2016/10/05 12:40:20 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2320) URLFilterChecker to run as TCP Telnet service - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2016/10/05 12:40:20 UTC, 2 replies.
- [jira] [Commented] (NUTCH-2319) Link with "rel=alternate" doesn't return in crawl - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2016/10/05 12:42:21 UTC, 4 replies.
- [jira] [Commented] (NUTCH-2318) Text extraction in HtmlParser adds too much whitespace. - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2016/10/05 12:49:21 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2320) URLFilterChecker to run as TCP Telnet service - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2016/10/05 12:54:20 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2320) URLFilterChecker to run as TCP Telnet service - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2016/10/05 12:56:21 UTC, 4 replies.
- Build failed in Jenkins: Nutch-trunk #3396 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/10/05 13:57:14 UTC, 0 replies.
- [jira] [Reopened] (NUTCH-2320) URLFilterChecker to run as TCP Telnet service - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2016/10/06 08:58:20 UTC, 0 replies.
- [jira] [Created] (NUTCH-2321) Indexing filter checker leaks threads - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2016/10/06 09:35:20 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2321) Indexing filter checker leaks threads - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2016/10/06 09:35:20 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-2320) URLFilterChecker to run as TCP Telnet service - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2016/10/06 09:36:20 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #3397 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2016/10/06 10:51:40 UTC, 0 replies.
- [jira] [Created] (NUTCH-2322) URL not available for Jexl operations - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2016/10/06 11:56:20 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2322) URL not available for Jexl operations - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2016/10/06 11:57:20 UTC, 5 replies.
- [GitHub] nutch pull request #153: Bug in setting default linkdb path - posted by sachin086 <gi...@git.apache.org> on 2016/10/06 12:36:08 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1741) Support of Sitemaps in Nutch 2.x - posted by "Alfonso Nishikawa (JIRA)" <ji...@apache.org> on 2016/10/08 14:45:21 UTC, 1 replies.
- [jira] [Comment Edited] (NUTCH-1741) Support of Sitemaps in Nutch 2.x - posted by "Alfonso Nishikawa (JIRA)" <ji...@apache.org> on 2016/10/08 14:46:20 UTC, 2 replies.
- [jira] [Updated] (NUTCH-1741) Support of Sitemaps in Nutch 2.x - posted by "Alfonso Nishikawa (JIRA)" <ji...@apache.org> on 2016/10/08 17:43:20 UTC, 0 replies.
- [jira] [Created] (NUTCH-2323) ElasticSearch Indexer does not work on Nutch 2.3.1 - posted by "Joe Crane (JIRA)" <ji...@apache.org> on 2016/10/10 23:25:20 UTC, 0 replies.
- [jira] [Created] (NUTCH-2324) Issue in setting default linkdb path - posted by "Sachin (JIRA)" <ji...@apache.org> on 2016/10/12 05:30:21 UTC, 0 replies.
- [jira] [Created] (NUTCH-2325) INJECT REST call sets overwrite and update to false, which is wrong - posted by "Sujan Kumar Suppala (JIRA)" <ji...@apache.org> on 2016/10/14 11:46:21 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2325) INJECT REST call sets overwrite and update to false, which is wrong - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2016/10/14 13:34:20 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2325) Inject REST call to set overwrite and update parameters - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2016/10/14 13:35:21 UTC, 1 replies.
- [jira] [Created] (NUTCH-2326) Implement InvertLinks job in webui package - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2016/10/17 15:45:59 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1314) Impose a limit on the length of outlink target urls - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2016/10/18 04:23:58 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1314) Impose a limit on the length of outlink target urls - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2016/10/18 04:23:58 UTC, 0 replies.
- [jira] [Created] (NUTCH-2327) Seeds injected in REST workflow must be ingested into HDFS - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2016/10/18 06:38:59 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2327) Seeds injected in REST workflow must be ingested into HDFS - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2016/10/18 06:55:58 UTC, 4 replies.
- [jira] [Updated] (NUTCH-2327) Seeds injected in REST workflow must be ingested into HDFS - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2016/10/18 07:05:58 UTC, 0 replies.
- [jira] [Created] (NUTCH-2328) GeneratorJob does not generate anything on second run - posted by "Arthur B (JIRA)" <ji...@apache.org> on 2016/10/18 13:16:58 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2328) GeneratorJob does not generate anything on second run - posted by "Arthur B (JIRA)" <ji...@apache.org> on 2016/10/18 13:18:58 UTC, 2 replies.
- [jira] [Commented] (NUTCH-2328) GeneratorJob does not generate anything on second run - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2016/10/18 14:24:58 UTC, 9 replies.
- [jira] [Comment Edited] (NUTCH-2328) GeneratorJob does not generate anything on second run - posted by "Arthur B (JIRA)" <ji...@apache.org> on 2016/10/18 16:14:58 UTC, 4 replies.
- [jira] [Created] (NUTCH-2329) Update Slf4j logging for Java 8 and upgrade miredot plugin version - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2016/10/18 19:38:59 UTC, 0 replies.
- [GitHub] nutch pull request #154: NUTH-2329 Update Slf4j logging for Java 8 and upgra... - posted by lewismc <gi...@git.apache.org> on 2016/10/18 19:40:14 UTC, 1 replies.
- [jira] [Updated] (NUTCH-2329) Update Slf4j logging for Java 8 and upgrade miredot plugin version - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2016/10/18 19:41:58 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2329) Update Slf4j logging for Java 8 and upgrade miredot plugin version - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2016/10/18 19:42:58 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1917) index.parse.md, index.content.md and index.db.md should support wildcard - posted by "David Johnson (JIRA)" <ji...@apache.org> on 2016/10/18 22:41:58 UTC, 1 replies.
- [jira] [Commented] (NUTCH-1917) index.parse.md, index.content.md and index.db.md should support wildcard - posted by "David Johnson (JIRA)" <ji...@apache.org> on 2016/10/18 22:44:58 UTC, 1 replies.
- [jira] [Issue Comment Deleted] (NUTCH-2328) GeneratorJob does not generate anything on second run - posted by "Arthur B (JIRA)" <ji...@apache.org> on 2016/10/19 11:24:59 UTC, 0 replies.
- [jira] [Work started] (NUTCH-2327) Seeds injected in REST workflow must be ingested into HDFS - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2016/10/20 22:24:58 UTC, 0 replies.
- [GitHub] nutch pull request #155: Fix for NUTCH-2327: Seeds injected in REST must be ... - posted by sujen1412 <gi...@git.apache.org> on 2016/10/20 22:36:29 UTC, 1 replies.
- [jira] [Created] (NUTCH-2331) REST API Fetch fails to retrieve HDFS path on distributed mode - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2016/10/20 22:40:58 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1531) URL filtering takes long time for very long URLs - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2016/10/24 10:32:58 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2329) Update Slf4j logging for Java 8 and upgrade miredot plugin version - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2016/10/24 17:05:58 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2327) Seeds injected in REST workflow must be ingested into HDFS - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2016/10/25 16:04:58 UTC, 0 replies.