You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Created] (NUTCH-2378) ChildFirst plugin classloader - posted by "Jurian Broertjes (JIRA)" <ji...@apache.org> on 2017/05/01 15:43:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2378) ChildFirst plugin classloader - posted by "Jurian Broertjes (JIRA)" <ji...@apache.org> on 2017/05/01 15:53:04 UTC, 1 replies.
- Nutch git/wiki - posted by Markus Jelsma <ma...@openindex.io> on 2017/05/01 20:59:30 UTC, 6 replies.
- [jira] [Commented] (NUTCH-2378) ChildFirst plugin classloader - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2017/05/01 21:39:04 UTC, 0 replies.
- [Nutch Wiki] Update of "UsingGit" by ChrisMattmann - posted by Apache Wiki <wi...@apache.org> on 2017/05/01 21:45:48 UTC, 2 replies.
- [jira] [Created] (NUTCH-2379) crawl script dedup's crawldb update is slow - posted by "Michael Coffey (JIRA)" <ji...@apache.org> on 2017/05/01 22:53:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-2380) indexer-elastic version bump - posted by "Jurian Broertjes (JIRA)" <ji...@apache.org> on 2017/05/02 08:29:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2380) indexer-elastic version bump - posted by "Jurian Broertjes (JIRA)" <ji...@apache.org> on 2017/05/02 08:33:04 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2373) Indexer for Hbase - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/05/02 09:19:04 UTC, 10 replies.
- [jira] [Created] (NUTCH-2381) In some situations the class TextProfileSignature gives different signatures for the same text "profile" page. - posted by "Rodrigo Joni Sestari (JIRA)" <ji...@apache.org> on 2017/05/02 11:54:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-2382) indexer-hbase Nutch 1.x branch - posted by "Jurian Broertjes (JIRA)" <ji...@apache.org> on 2017/05/02 12:48:04 UTC, 0 replies.
- [jira] [Issue Comment Deleted] (NUTCH-2373) Indexer for Hbase - posted by "Jurian Broertjes (JIRA)" <ji...@apache.org> on 2017/05/02 12:51:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2382) indexer-hbase Nutch 1.x branch - posted by "Jurian Broertjes (JIRA)" <ji...@apache.org> on 2017/05/02 12:53:04 UTC, 1 replies.
- [jira] [Created] (NUTCH-2383) Wrong FS exception in Fetcher - posted by "Yossi Tamari (JIRA)" <ji...@apache.org> on 2017/05/02 13:27:04 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-2383) Wrong FS exception in Fetcher - posted by "Yossi Tamari (JIRA)" <ji...@apache.org> on 2017/05/02 13:29:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2383) Wrong FS exception in Fetcher - posted by "Yossi Tamari (JIRA)" <ji...@apache.org> on 2017/05/02 13:29:04 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2382) indexer-hbase Nutch 1.x branch - posted by "Kaidul Islam (JIRA)" <ji...@apache.org> on 2017/05/02 14:40:04 UTC, 1 replies.
- [jira] [Comment Edited] (NUTCH-2382) indexer-hbase Nutch 1.x branch - posted by "Kaidul Islam (JIRA)" <ji...@apache.org> on 2017/05/02 14:52:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2377) Nutch can't parse relative links - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2017/05/03 09:35:04 UTC, 2 replies.
- [jira] [Commented] (NUTCH-2383) Wrong FS exception in Fetcher - posted by "Yossi Tamari (JIRA)" <ji...@apache.org> on 2017/05/03 11:49:04 UTC, 2 replies.
- [jira] [Created] (NUTCH-2384) nutch 2.3.1 unable to fetch all documents with hadoop 2.7.1 - posted by "Shubham Gupta (JIRA)" <ji...@apache.org> on 2017/05/03 12:18:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-2385) 1.x Elasticsearch Indexer - path.home is not configured - posted by "Steven W (JIRA)" <ji...@apache.org> on 2017/05/03 17:45:05 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2379) crawl script dedup's crawldb update is slow - posted by "Michael Coffey (JIRA)" <ji...@apache.org> on 2017/05/03 17:49:04 UTC, 1 replies.
- [jira] [Resolved] (NUTCH-2377) Nutch can't parse relative links - posted by "hakim (JIRA)" <ji...@apache.org> on 2017/05/03 20:19:04 UTC, 0 replies.
- Fwd: GSoC 2017: You are a mentor for Omkar Reddy Gojala - posted by lewis john mcgibbney <le...@apache.org> on 2017/05/08 20:17:43 UTC, 4 replies.
- [jira] [Updated] (NUTCH-2384) nutch 2.3.1 job not properly interacting with hadoop 2.7.1 - posted by "Shubham Gupta (JIRA)" <ji...@apache.org> on 2017/05/09 04:21:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2375) Upgrade the code base from org.apache.hadoop.mapred to org.apache.hadoop.mapreduce - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/05/09 17:14:04 UTC, 17 replies.
- [jira] [Commented] (NUTCH-2374) Upgrade Nutch 2.X to Gora 0.7 - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/05/14 09:04:04 UTC, 5 replies.
- [jira] [Created] (NUTCH-2386) BasicURLNormalizer does not encode curly braces - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2017/05/15 13:48:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2386) BasicURLNormalizer does not encode curly braces - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2017/05/15 13:49:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-2387) Nutch should not index document with "noindex" meta - posted by "Eyeris Rodriguez Rueda (JIRA)" <ji...@apache.org> on 2017/05/18 18:51:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1465) Support sitemaps in Nutch - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/05/19 05:33:04 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2376) Improve configurability of HTTP Accept* header fields - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/05/19 06:13:04 UTC, 10 replies.
- [jira] [Commented] (NUTCH-2353) Create seed file with metadata using the REST API - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/05/19 06:21:04 UTC, 2 replies.
- [jira] [Resolved] (NUTCH-2353) Create seed file with metadata using the REST API - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2017/05/19 06:22:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2376) Improve configurability of HTTP Accept* header fields - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2017/05/19 10:53:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2376) Improve configurability of HTTP Accept* header fields - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2017/05/19 10:57:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2373) Indexer for Hbase - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2017/05/22 21:03:04 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #3430 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2017/05/22 21:52:00 UTC, 0 replies.
- [jira] [Created] (NUTCH-2388) bin/crawl indexing only webpages containing batchID instead of all - posted by "Kaidul Islam (JIRA)" <ji...@apache.org> on 2017/05/23 07:10:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2388) bin/crawl indexing only webpages containing batchID instead of all in 2.x - posted by "Kaidul Islam (JIRA)" <ji...@apache.org> on 2017/05/23 07:11:04 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2388) bin/crawl indexing only webpages containing batchID instead of all in 2.x - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/05/23 08:34:04 UTC, 3 replies.
- [jira] [Created] (NUTCH-2389) Precise data parsing using Jsoup CSS selectors - posted by "Kaidul Islam (JIRA)" <ji...@apache.org> on 2017/05/23 09:31:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2389) Precise data parsing using Jsoup CSS selectors - posted by "Kaidul Islam (JIRA)" <ji...@apache.org> on 2017/05/23 09:32:04 UTC, 4 replies.
- [jira] [Created] (NUTCH-2390) No documentation on pluggable indexing - posted by "Jonathan Jackson (JIRA)" <ji...@apache.org> on 2017/05/23 15:14:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2388) bin/crawl indexing only webpages containing batchID instead of all in 2.x - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2017/05/23 16:49:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2389) Precise data parsing using Jsoup CSS selectors - posted by "Kaidul Islam (JIRA)" <ji...@apache.org> on 2017/05/26 15:39:04 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-2389) Precise data parsing using Jsoup CSS selectors - posted by "Kaidul Islam (JIRA)" <ji...@apache.org> on 2017/05/26 15:42:04 UTC, 0 replies.
- subscribe - posted by Jorge Betancourt <be...@gmail.com> on 2017/05/29 11:46:17 UTC, 2 replies.