You are viewing a plain text version of this content. The canonical link for it is here.
- [jira] [Created] (NUTCH-2090) Refactor Seed Resource - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/09/04 02:58:45 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2090) Refactor Seed Resource in REST API - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/09/04 11:07:45 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2064) URLNormalizer basic to properly encode non-ASCII characters - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2015/09/04 19:35:46 UTC, 3 replies.
- [jira] [Comment Edited] (NUTCH-2064) URLNormalizer basic to properly encode non-ASCII characters - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2015/09/04 19:39:45 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2090) Refactor Seed Resource in REST API - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/07 04:51:45 UTC, 0 replies.
- [jira] [Work started] (NUTCH-2090) Refactor Seed Resource in REST API - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/07 04:51:45 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2090) Refactor Seed Resource in REST API - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/09/07 06:08:45 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1644) Should have a parser that uses xpath - posted by "Bipin Roshan Nag (JIRA)" <ji...@apache.org> on 2015/09/07 23:13:45 UTC, 1 replies.
- [jira] [Created] (NUTCH-2091) Make Nutch more robust and smart - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/09/08 19:45:46 UTC, 0 replies.
- [jira] [Created] (NUTCH-2092) Unit Test for NutchServer - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/09/09 05:46:45 UTC, 0 replies.
- [GitHub] nutch pull request: Fix for NUTCH-2092 by Sujen Shah - posted by sujen1412 <gi...@git.apache.org> on 2015/09/09 06:10:05 UTC, 5 replies.
- [jira] [Commented] (NUTCH-2092) Unit Test for NutchServer - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/09/09 06:10:46 UTC, 9 replies.
- [ANNOUNCE] New Nutch committer and PMC - Asitang Mishra - posted by Sebastian Nagel <wa...@googlemail.com> on 2015/09/10 00:01:44 UTC, 4 replies.
- Re: [MASSMAIL]Re: [ANNOUNCE] New Nutch committer and PMC - Asitang Mishra - posted by Jorge Luis Betancourt González <jl...@uci.cu> on 2015/09/10 02:27:25 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1679) UpdateDb using batchId, link may override crawled page. - posted by "Alexander Kingson (JIRA)" <ji...@apache.org> on 2015/09/10 07:51:46 UTC, 2 replies.
- [jira] [Updated] (NUTCH-1679) UpdateDb using batchId, link may override crawled page. - posted by "Alexander Kingson (JIRA)" <ji...@apache.org> on 2015/09/10 07:52:46 UTC, 1 replies.
- [jira] [Assigned] (NUTCH-1943) Form authentication should not be global and ignore - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/10 08:52:45 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1943) Form authentication should not be global and ignore - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/10 08:52:46 UTC, 1 replies.
- Unsubscribe - posted by Navyashree Kalyani <nk...@usc.edu> on 2015/09/10 09:44:13 UTC, 1 replies.
- [jira] [Created] (NUTCH-2093) Indexing filters have no signature in CrawlDatum if crawled via FreeGenerator - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/09/11 14:14:45 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2093) Indexing filters have no signature in CrawlDatum if crawled via FreeGenerator - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/09/11 14:14:46 UTC, 0 replies.
- [GitHub] nutch pull request: WARC exporter for the CommonCrawlDataDumper - posted by jorgelbg <gi...@git.apache.org> on 2015/09/11 16:48:10 UTC, 12 replies.
- [jira] [Created] (NUTCH-2094) When stopping a crawl in Nutch 2.3, I was having trouble when I start an already stopped crawl and then stop it again. - posted by "Prerna Satija (JIRA)" <ji...@apache.org> on 2015/09/11 20:01:45 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2094) When stopping a crawl in Nutch 2.3, I was having trouble when I start an already stopped crawl and then stop it again. - posted by "Prerna Satija (JIRA)" <ji...@apache.org> on 2015/09/11 20:23:46 UTC, 1 replies.
- [jira] [Reopened] (NUTCH-2094) When stopping a crawl in Nutch 2.3, I was having trouble when I start an already stopped crawl and then stop it again. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/11 20:26:46 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2094) When stopping a crawl in Nutch 2.3, I was having trouble when I start an already stopped crawl and then stop it again. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/11 20:27:45 UTC, 4 replies.
- [jira] [Work started] (NUTCH-2094) When stopping a crawl in Nutch 2.3, I was having trouble when I start an already stopped crawl and then stop it again. - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/11 20:27:45 UTC, 0 replies.
- [jira] [Created] (NUTCH-2095) WARC exporter for the CommonCrawlDataDumper - posted by "Jorge Luis Betancourt Gonzalez (JIRA)" <ji...@apache.org> on 2015/09/11 21:17:47 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2095) WARC exporter for the CommonCrawlDataDumper - posted by "Jorge Luis Betancourt Gonzalez (JIRA)" <ji...@apache.org> on 2015/09/11 21:18:45 UTC, 2 replies.
- [jira] [Created] (NUTCH-2096) Explicitly indicate broswer binary to use when selecting selenium remote option in config - posted by "Kim Whitehall (JIRA)" <ji...@apache.org> on 2015/09/12 00:26:45 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2096) Explicitly indicate broswer binary to use when selecting selenium remote option in config - posted by "Kim Whitehall (JIRA)" <ji...@apache.org> on 2015/09/12 01:07:46 UTC, 5 replies.
- [jira] [Commented] (NUTCH-1084) ReadDB url throws exception - posted by "Nadeem Douba (JIRA)" <ji...@apache.org> on 2015/09/12 08:57:46 UTC, 0 replies.
- [jira] [Comment Edited] (NUTCH-1084) ReadDB url throws exception - posted by "Nadeem Douba (JIRA)" <ji...@apache.org> on 2015/09/12 09:16:46 UTC, 0 replies.
- [GitHub] nutch pull request: Nutch 2096: Explicitly indicate broswer binary... - posted by kwhitehall <gi...@git.apache.org> on 2015/09/12 18:15:32 UTC, 1 replies.
- [jira] [Assigned] (NUTCH-2096) Explicitly indicate broswer binary to use when selecting selenium remote option in config - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/12 18:24:45 UTC, 0 replies.
- [jira] [Work started] (NUTCH-2096) Explicitly indicate broswer binary to use when selecting selenium remote option in config - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/12 18:24:45 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2096) Explicitly indicate broswer binary to use when selecting selenium remote option in config - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/12 18:26:45 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2096) Explicitly indicate broswer binary to use when selecting selenium remote option in config - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/12 19:18:45 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2090) Refactor Seed Resource in REST API - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/12 19:19:45 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2092) Unit Test for NutchServer - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/12 19:20:45 UTC, 0 replies.
- [jira] [Work started] (NUTCH-2092) Unit Test for NutchServer - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/12 19:20:45 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2092) Unit Test for NutchServer - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/12 19:41:46 UTC, 0 replies.
- Introducing myself (Aron Ahmadia) - posted by Aron Ahmadia <aa...@continuum.io> on 2015/09/14 16:25:35 UTC, 2 replies.
- [jira] [Commented] (NUTCH-2093) Indexing filters have no signature in CrawlDatum if crawled via FreeGenerator - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/09/14 17:59:46 UTC, 2 replies.
- [jira] [Commented] (NUTCH-1943) Form authentication should not be global and ignore - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/14 18:11:47 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2086) Nutch 1.X Webui - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/14 18:13:47 UTC, 9 replies.
- [jira] [Comment Edited] (NUTCH-2086) Nutch 1.X Webui - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/09/14 18:23:46 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1800) Documentation for Nutch 1.X and 2.X REST APIs - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/14 18:29:45 UTC, 1 replies.
- [jira] [Created] (NUTCH-2097) Proposal for Nutch 3.x - posted by "Nadeem Douba (JIRA)" <ji...@apache.org> on 2015/09/14 23:37:47 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2097) Proposal for Nutch 3.x - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/14 23:49:45 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2097) Proposal for Nutch 3.x - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/15 00:02:45 UTC, 6 replies.
- [GitHub] nutch pull request: 2.x - posted by prernasatija <gi...@git.apache.org> on 2015/09/15 06:42:08 UTC, 1 replies.
- [GitHub] nutch pull request: fix for NUTCH-2094 contributed by prernasatija - posted by prernasatija <gi...@git.apache.org> on 2015/09/15 07:30:50 UTC, 1 replies.
- [jira] [Comment Edited] (NUTCH-2097) Proposal for Nutch 3.x - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/09/15 08:50:45 UTC, 3 replies.
- [jira] [Resolved] (NUTCH-2093) Indexing filters have no signature in CrawlDatum if crawled via FreeGenerator - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/09/15 08:53:45 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1932) Automatically remove orphaned pages - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2015/09/15 11:16:46 UTC, 10 replies.
- [jira] [Commented] (NUTCH-1932) Automatically remove orphaned pages - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/09/15 12:24:46 UTC, 4 replies.
- [jira] [Created] (NUTCH-2098) Add null SeedUrl constructor - posted by "Aron Ahmadia (JIRA)" <ji...@apache.org> on 2015/09/15 16:18:45 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2098) Add null SeedUrl constructor - posted by "Aron Ahmadia (JIRA)" <ji...@apache.org> on 2015/09/15 16:18:45 UTC, 1 replies.
- [jira] [Created] (NUTCH-2099) Refactoring the REST endpoints for integration with webui - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/09/15 18:14:45 UTC, 0 replies.
- [GitHub] nutch pull request: Fix for NUTCH-2099 Contributed by Sujen Shah - posted by sujen1412 <gi...@git.apache.org> on 2015/09/15 18:16:04 UTC, 12 replies.
- [jira] [Commented] (NUTCH-2099) Refactoring the REST endpoints for integration with webui - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/09/15 18:16:46 UTC, 14 replies.
- [Nutch Wiki] Update of "AdvancedAjaxInteraction" by MichaelJoyce - posted by Apache Wiki <wi...@apache.org> on 2015/09/15 19:28:50 UTC, 0 replies.
- [ANNOUNCE] New Nutch committer and PMC - Sujen Shah - posted by Sebastian Nagel <wa...@googlemail.com> on 2015/09/15 21:59:49 UTC, 2 replies.
- [jira] [Created] (NUTCH-2100) Nutch dump command doesnt dump anything - posted by "Kim Whitehall (JIRA)" <ji...@apache.org> on 2015/09/15 22:12:46 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2100) Nutch dump command doesnt dump anything - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/15 22:44:46 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2100) Nutch dump command doesnt dump anything - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/15 22:45:45 UTC, 1 replies.
- [jira] [Closed] (NUTCH-2100) Nutch dump command doesnt dump anything - posted by "Kim Whitehall (JIRA)" <ji...@apache.org> on 2015/09/16 02:10:45 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1679) UpdateDb using batchId, link may override crawled page. - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/16 06:23:50 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1922) DbUpdater overwrites fetch status for URLs from previous batches, causes repeated re-fetches - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/16 06:27:46 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2029) Mark.checkMark returns empty string when null is expected with mongodb storage - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/16 06:30:46 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2080) Eclipse compilation issue - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/16 06:32:46 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2009) Fetcher does not work with batchID - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/16 06:33:45 UTC, 0 replies.
- [jira] [Created] (NUTCH-2101) Upgrade Nutch 2.X to Hadoop 2.4.0 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/16 06:43:45 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1572) Nutch 2.x should use o.a.g.mem.store.MemStore for testing - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/16 07:13:46 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-1572) Nutch 2.x should use o.a.g.mem.store.MemStore for testing - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/16 07:14:46 UTC, 0 replies.
- [jira] [Created] (NUTCH-2102) WARC Exporter - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2015/09/16 12:48:45 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2102) WARC Exporter - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2015/09/16 12:49:45 UTC, 5 replies.
- [jira] [Commented] (NUTCH-2102) WARC Exporter - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2015/09/16 12:59:45 UTC, 5 replies.
- [jira] [Comment Edited] (NUTCH-2102) WARC Exporter - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2015/09/16 13:22:45 UTC, 0 replies.
- [jira] [Created] (NUTCH-2103) Nutch 2.3 has an old version of hbase jar in runtime/lib folder - posted by "Mobin Ranjbar (JIRA)" <ji...@apache.org> on 2015/09/16 17:30:47 UTC, 0 replies.
- unsubscribe - posted by Mohit Raman <mo...@usc.edu> on 2015/09/16 19:01:39 UTC, 1 replies.
- [jira] [Created] (NUTCH-2104) Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation - posted by "Kim Whitehall (JIRA)" <ji...@apache.org> on 2015/09/17 03:52:45 UTC, 0 replies.
- [jira] [Commented] (NUTCH-1946) Upgrade to Gora 0.6.1 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 08:16:46 UTC, 2 replies.
- [jira] [Updated] (NUTCH-2101) Upgrade Nutch 2.X to Hadoop 2.5.1 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 08:20:45 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1946) Upgrade to Gora 0.6.1 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 08:23:46 UTC, 1 replies.
- NUTCH-1946 Upgrade to Gora 0.6.1 - posted by Lewis John Mcgibbney <le...@gmail.com> on 2015/09/17 08:29:15 UTC, 1 replies.
- [jira] [Updated] (NUTCH-1709) Generated classes o.a.n.storage.Host and o.a.n.storage.ProtocolStatus contain methods not defined in source .avsc - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 08:30:47 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X Docker - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 08:52:46 UTC, 2 replies.
- [jira] [Created] (NUTCH-2105) Update Nutch Cassandra Dockerfile to work with Gora Nutch 2.3.1 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 08:55:46 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2018) Ensure that the Docker containers for Nutch 2.X are part of the Release Management Documentation - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 08:56:46 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2104) Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 09:00:52 UTC, 4 replies.
- [jira] [Updated] (NUTCH-1981) Upgrade icu4j - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 09:02:46 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1941) Optional rolling http.agent.name's - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 09:02:46 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1920) Upgrade Nutch to use Java 1.7 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 09:02:47 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1893) Parse-tika fails to parse feed files - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 09:03:45 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1886) Review and update default.properties - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 09:03:46 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1990) Use URI.normalise() in BasicURLNormalizer - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 09:04:45 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1062) Migrate BasicURLNormalizer from Apache ORO to java.util.regex - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 09:04:46 UTC, 0 replies.
- [jira] [Reopened] (NUTCH-1936) GSoC 2015 - Move Nutch to Hadoop 2.X - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 09:05:46 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1936) GSoC 2015 - Move Nutch to Hadoop 2.X - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 09:05:47 UTC, 0 replies.
- [jira] [Closed] (NUTCH-1936) GSoC 2015 - Move Nutch to Hadoop 2.X - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 09:05:48 UTC, 0 replies.
- [jira] [Reopened] (NUTCH-1286) Refactoring/reimplementing crawling API (NutchApp) - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 09:06:46 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1286) Refactoring/reimplementing crawling API (NutchApp) - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 09:06:46 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1169) Write JUnit tests for urlfilter-prefix - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/17 09:06:47 UTC, 1 replies.
- [jira] [Created] (NUTCH-2106) Runtime to contain Selenium and dependencies only once - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/09/17 14:00:05 UTC, 0 replies.
- [jira] [Created] (NUTCH-2107) plugin.xml to validate against plugin.dtd - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/09/17 14:17:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2106) Runtime to contain Selenium and dependencies only once - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/09/17 14:18:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2107) plugin.xml to validate against plugin.dtd - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/09/17 14:20:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X Docker - posted by "stack (JIRA)" <ji...@apache.org> on 2015/09/17 18:59:04 UTC, 3 replies.
- [jira] [Created] (NUTCH-2108) Add a function to the selenium interactive plugin interface to do multiple manipulation of driver and then return the data - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/09/17 23:20:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-2109) Create a brute force click-all-ajax-links utility fucntion for selenium interactive plugin - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/09/17 23:22:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-2110) Create the capability to provide seeds in the form of "url+xpath(including option to enter seach terms).selenium" - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/09/17 23:28:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2011) Endpoint to support realtime JSON output from the fetcher - posted by "Aron Ahmadia (JIRA)" <ji...@apache.org> on 2015/09/18 07:28:04 UTC, 2 replies.
- [jira] [Assigned] (NUTCH-2098) Add null SeedUrl constructor - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/18 07:30:04 UTC, 0 replies.
- [jira] [Work started] (NUTCH-2098) Add null SeedUrl constructor - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/18 07:30:05 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2098) Add null SeedUrl constructor - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/18 07:33:04 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #3272 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2015/09/18 08:07:41 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2098) Add null SeedUrl constructor - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/09/18 08:08:05 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #3273 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2015/09/18 08:24:44 UTC, 0 replies.
- Fwd: Job Opening at Common Crawl - Crawl Engineer / Data Scientist - posted by Julien Nioche <li...@gmail.com> on 2015/09/18 11:54:17 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2106) Runtime to contain Selenium and dependencies only once - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/18 12:35:04 UTC, 3 replies.
- [GitHub] nutch pull request: fix for NUTCH-2104 contributed by kwhitehall - posted by kwhitehall <gi...@git.apache.org> on 2015/09/18 16:27:41 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2091) Make Nutch more robust and smart - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/18 18:29:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2091) Increase robustness and crawling versatility of Nutch for the Deep Web - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/18 18:30:04 UTC, 1 replies.
- [jira] [Created] (NUTCH-2111) Set temporary file location for selenium tmp files - posted by "Kim Whitehall (JIRA)" <ji...@apache.org> on 2015/09/19 03:38:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2111) Set temporary file location for selenium tmp files - posted by "Kim Whitehall (JIRA)" <ji...@apache.org> on 2015/09/19 04:14:04 UTC, 4 replies.
- [jira] [Work started] (NUTCH-2099) Refactoring the REST endpoints for integration with webui - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/19 07:13:04 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2099) Refactoring the REST endpoints for integration with webui - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/19 07:13:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2099) Refactoring the REST endpoints for integration with webui - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/19 07:15:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2094) Stopping and Restarting a crawl has issues in the Web UI - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/19 07:18:04 UTC, 2 replies.
- [jira] [Work started] (NUTCH-2094) Stopping and Restarting a crawl has issues in the Web UI - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/19 07:19:04 UTC, 0 replies.
- [jira] [Reopened] (NUTCH-2094) Stopping and Restarting a crawl has issues in the Web UI - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/19 07:19:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2094) Stopping and Restarting a crawl has issues in the Web UI - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/09/19 07:24:04 UTC, 3 replies.
- [jira] [Resolved] (NUTCH-2094) Stopping and Restarting a crawl has issues in the Web UI - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/19 07:24:04 UTC, 0 replies.
- [GitHub] nutch pull request: Webui integration - posted by chrismattmann <gi...@git.apache.org> on 2015/09/19 07:24:48 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2104) Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/19 07:26:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2104) Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/19 07:26:04 UTC, 3 replies.
- [jira] [Work started] (NUTCH-2104) Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/19 07:26:05 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2104) Add documentation to the protocol-selenium plugin Readme file re: selenium grid implementation - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2015/09/19 07:28:05 UTC, 0 replies.
- [jira] [Created] (NUTCH-2112) Missing org.restlet.jee when building with gora-solr - posted by "Steven W (JIRA)" <ji...@apache.org> on 2015/09/20 00:07:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-2113) Need documentation for using various Gora backends - posted by "Steven W (JIRA)" <ji...@apache.org> on 2015/09/20 00:13:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2112) Missing org.restlet.jee when building with gora-solr - posted by "Steven W (JIRA)" <ji...@apache.org> on 2015/09/20 00:14:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-2114) kkk - posted by "Badreddine Ahmed (JIRA)" <ji...@apache.org> on 2015/09/20 00:46:04 UTC, 0 replies.
- [jira] [Closed] (NUTCH-2114) kkk - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2015/09/20 12:40:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1946) Upgrade to Gora 0.6.1 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/20 14:52:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1286) Refactoring/reimplementing crawling API (NutchApp) - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/20 14:53:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2101) Upgrade Nutch 2.X to Hadoop 2.5.1 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/20 14:53:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-1572) Nutch 2.x should use o.a.g.mem.store.MemStore for testing - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/20 14:54:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X HBase Docker - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/20 14:54:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X HBase Docker - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/20 14:56:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2028) java.lang.IllegalArgumentException: can't serialize class org.apache.avro.util.Utf8 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/20 14:57:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2018) Ensure that the Docker containers for Nutch 2.X are part of the Release Management Documentation - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/20 14:57:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2105) Update Nutch Cassandra Dockerfile to work with Gora Nutch 2.3.1 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/20 14:57:04 UTC, 1 replies.
- Build failed in Jenkins: Nutch-nutchgora #1537 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2015/09/20 15:40:32 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2050) Upgrade HBase and Hadoop versioning on 2.X HBase Docker - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/09/20 15:41:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2110) Create the capability to provide seeds in the form of "url+xpath(including option to enter seach terms).selenium" - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/09/20 15:44:04 UTC, 4 replies.
- Re: Questions regarding CS-572 assignment 1 - posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov> on 2015/09/21 04:56:52 UTC, 0 replies.
- [GitHub] nutch pull request: Fix for NUTCH-2086 Contributed by Sujen Shah - posted by sujen1412 <gi...@git.apache.org> on 2015/09/21 09:29:06 UTC, 0 replies.
- [jira] [Assigned] (NUTCH-2106) Runtime to contain Selenium and dependencies only once - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/09/21 23:17:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2106) Runtime to contain Selenium and dependencies only once - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/09/21 23:17:04 UTC, 0 replies.
- [GitHub] nutch pull request: made changes for NUTCH-2108 and formatted the ... - posted by asitang <gi...@git.apache.org> on 2015/09/21 23:34:28 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2108) Add a function to the selenium interactive plugin interface to do multiple manipulation of driver and then return the data - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/09/21 23:35:04 UTC, 6 replies.
- [jira] [Updated] (NUTCH-2110) Create the capability to provide seeds in the form of "url+xpath(including option to enter seach terms).selenium" - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/09/22 04:15:04 UTC, 1 replies.
- [GitHub] nutch pull request: Update NutchServer.java - posted by zhangmianhongni <gi...@git.apache.org> on 2015/09/22 11:58:27 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2095) WARC exporter for the CommonCrawlDataDumper - posted by "Jorge Luis Betancourt Gonzalez (JIRA)" <ji...@apache.org> on 2015/09/22 14:24:04 UTC, 10 replies.
- Build failed in Jenkins: Nutch-trunk #3276 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2015/09/22 14:44:23 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #3277 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2015/09/22 15:47:00 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2102) WARC Exporter - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2015/09/22 16:06:04 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #3278 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2015/09/22 16:53:11 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-nutchgora #1538 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2015/09/22 21:52:47 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2105) Update Nutch Cassandra Dockerfile to work with Gora Nutch 2.3.1 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/22 23:32:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2105) Update Nutch Cassandra Dockerfile to work with Gora Nutch 2.3.1 - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/23 03:01:04 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Release_HOWTO" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2015/09/23 03:06:26 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2018) Ensure that the Docker containers for Nutch 2.X are part of the Release Management Documentation - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/23 03:19:04 UTC, 0 replies.
- [VOTE] Release Apache Nutch 2.3.1 - posted by Lewis John Mcgibbney <le...@gmail.com> on 2015/09/23 03:45:37 UTC, 1 replies.
- Tutorial : Index the web with AWS CloudSearch - posted by Julien Nioche <li...@gmail.com> on 2015/09/23 11:26:09 UTC, 1 replies.
- Webcast : Apache Nutch on EMR - posted by Julien Nioche <li...@gmail.com> on 2015/09/23 16:35:32 UTC, 2 replies.
- [Nutch Wiki] Update of "CommonCrawlDataDumper" by JorgeLuis - posted by Apache Wiki <wi...@apache.org> on 2015/09/23 17:12:18 UTC, 0 replies.
- [GitHub] nutch pull request: fix for NUTCH-2111 contributed by kwhitehall - posted by kwhitehall <gi...@git.apache.org> on 2015/09/23 18:08:23 UTC, 1 replies.
- [jira] [Updated] (NUTCH-2111) Delete temporary files location for selenium tmp files after driver quits - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/23 18:55:04 UTC, 1 replies.
- [jira] [Assigned] (NUTCH-2111) Delete temporary files location for selenium tmp files after driver quits - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/23 18:55:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2111) Delete temporary files location for selenium tmp files after driver quits - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/23 18:56:05 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2111) Delete temporary files location for selenium tmp files after driver quits - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/09/23 19:54:05 UTC, 1 replies.
- [jira] [Created] (NUTCH-2115) Add total counts to dump stats - posted by "Michael Joyce (JIRA)" <ji...@apache.org> on 2015/09/23 21:34:04 UTC, 0 replies.
- [GitHub] nutch pull request: NUTCH-2115 - Add total counts to mimetype stat... - posted by MJJoyce <gi...@git.apache.org> on 2015/09/23 21:37:12 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2115) Add total counts to dump stats - posted by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2015/09/23 21:38:04 UTC, 3 replies.
- [jira] [Created] (NUTCH-2116) NutchServer and NutchApp should contain shutdown hooks - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/23 21:57:05 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2115) Add total counts to dump stats - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/23 22:01:05 UTC, 0 replies.
- Nutch datasets : How to ?? - posted by Charan Shampur <ch...@gmail.com> on 2015/09/23 22:02:44 UTC, 0 replies.
- [jira] [Created] (NUTCH-2117) NutchServer CLI Option for CMD_PORT is incorrect and should be CMD_HOST - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/24 02:22:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2117) NutchServer CLI Option for CMD_PORT is incorrect and should be CMD_HOST - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/24 02:24:04 UTC, 0 replies.
- Build failed in Jenkins: Nutch-trunk #3282 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2015/09/24 02:58:19 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2117) NutchServer CLI Option for CMD_PORT is incorrect and should be CMD_HOST - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/09/24 02:59:04 UTC, 0 replies.
- Jenkins build is back to normal : Nutch-trunk #3283 - posted by Apache Jenkins Server <je...@builds.apache.org> on 2015/09/24 06:52:07 UTC, 0 replies.
- [jira] [Created] (NUTCH-2118) browser requests sometimes timeout when using the selenium grid because of port access issues - posted by "Kim Whitehall (JIRA)" <ji...@apache.org> on 2015/09/24 18:11:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-2119) Eclipse shows build path errors on building Nutch - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/09/25 02:47:04 UTC, 0 replies.
- [jira] [Commented] (NUTCH-2119) Eclipse shows build path errors on building Nutch - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/09/25 03:09:04 UTC, 1 replies.
- [jira] [Assigned] (NUTCH-2119) Eclipse shows build path errors on building Nutch - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/09/25 03:10:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2119) Eclipse shows build path errors on building Nutch - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/09/25 03:10:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-2120) Remove MapWritable from trunk codebase - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/25 03:20:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-2121) Update javadoc link for Hadoop 2.4.0 in default.properties - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/09/25 03:24:04 UTC, 0 replies.
- [jira] [Resolved] (NUTCH-2121) Update javadoc link for Hadoop 2.4.0 in default.properties - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/09/25 03:26:04 UTC, 0 replies.
- [Nutch Wiki] Update of "NutchFileFormats" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2015/09/25 03:40:14 UTC, 3 replies.
- Subscription for Developers mailing list - posted by Rahul Agarwal <ra...@usc.edu> on 2015/09/25 03:53:44 UTC, 1 replies.
- [jira] [Commented] (NUTCH-2121) Update javadoc link for Hadoop 2.4.0 in default.properties - posted by "Hudson (JIRA)" <ji...@apache.org> on 2015/09/25 03:58:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-2122) Implement Javadoc package.html for service packages - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/25 03:59:04 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "NutchFileFormats" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2015/09/25 04:35:03 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "FrontPage" by LewisJohnMcgibbney - posted by Apache Wiki <wi...@apache.org> on 2015/09/25 05:17:16 UTC, 0 replies.
- [jira] [Created] (NUTCH-2123) Seed List REST API returns Text but headers indicate/require JSON - posted by "Aron Ahmadia (JIRA)" <ji...@apache.org> on 2015/09/25 07:59:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2086) Nutch 1.X Webui - posted by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2015/09/26 07:17:04 UTC, 2 replies.
- [Nutch Wiki] Update of "FrontPage" by JulienNioche - posted by Apache Wiki <wi...@apache.org> on 2015/09/26 11:20:17 UTC, 0 replies.
- Fetch Failed with : java.lang.NullPointerException - posted by mithun <mi...@gmail.com> on 2015/09/27 04:40:51 UTC, 0 replies.
- Re: CSCI - 572: Team 18 : Questions - posted by "Mattmann, Chris A (3980)" <ch...@jpl.nasa.gov> on 2015/09/27 20:08:08 UTC, 1 replies.
- Permission to edit Nutch Whitelist Robots. - posted by Ayesha Sabah Hasan <ay...@usc.edu> on 2015/09/27 20:33:57 UTC, 1 replies.
- [Nutch Wiki] Update of "ContributorsGroup" by SebastianNagel - posted by Apache Wiki <wi...@apache.org> on 2015/09/27 21:42:04 UTC, 0 replies.
- Fetch failed : java.lang.NullPointerException - posted by mithun <mi...@gmail.com> on 2015/09/28 00:04:31 UTC, 2 replies.
- [jira] [Created] (NUTCH-2124) redirect following same link again and again , max redirect exceed and went db_gone - posted by "Yogendra Kumar Soni (JIRA)" <ji...@apache.org> on 2015/09/28 16:04:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2124) redirect following same link again and again , max redirect exceed and went db_gone - posted by "Yogendra Kumar Soni (JIRA)" <ji...@apache.org> on 2015/09/28 16:08:04 UTC, 5 replies.
- [jira] [Commented] (NUTCH-2124) redirect following same link again and again , max redirect exceed and went db_gone - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2015/09/28 16:35:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-2125) Metrics - posted by "Kim Whitehall (JIRA)" <ji...@apache.org> on 2015/09/28 17:06:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2125) Metrics - posted by "Kim Whitehall (JIRA)" <ji...@apache.org> on 2015/09/28 17:12:04 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2125) Metrics tool for relevancy - posted by "Kim Whitehall (JIRA)" <ji...@apache.org> on 2015/09/28 17:12:04 UTC, 0 replies.
- [GitHub] nutch pull request: Added support for NUTCH-2108 and NUTCH-2109 - posted by asitang <gi...@git.apache.org> on 2015/09/28 19:17:56 UTC, 1 replies.
- [GitHub] nutch pull request: NUTCH-2108 - posted by asitang <gi...@git.apache.org> on 2015/09/28 19:28:08 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2108) Add a function to the selenium interactive plugin interface to do multiple manipulation of driver and then return the data - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/09/28 19:53:05 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2126) Use selenium protocol for specific sites - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/09/28 20:15:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-2126) Use selenium protocol for specific sites when switched on - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/09/28 20:15:04 UTC, 0 replies.
- [jira] [Created] (NUTCH-2127) Provide the selenium protocol with basic authentication capabilities. - posted by "Asitang Mishra (JIRA)" <ji...@apache.org> on 2015/09/28 20:21:04 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "WhiteListRobots" by ayeshahasan - posted by Apache Wiki <wi...@apache.org> on 2015/09/29 17:28:08 UTC, 0 replies.
- [jira] [Updated] (NUTCH-1966) Configuration endpoint for 1x REST API - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/09/29 21:29:05 UTC, 0 replies.
- [jira] [Created] (NUTCH-2128) Refactor configuration end point - posted by "Sujen Shah (JIRA)" <ji...@apache.org> on 2015/09/29 21:38:04 UTC, 0 replies.
- Subscribe - posted by Manali Shah <ma...@usc.edu> on 2015/09/29 22:34:51 UTC, 1 replies.
- dbunfetched URLs - team #32 - posted by Pramod Setlur <se...@usc.edu> on 2015/09/30 03:13:10 UTC, 0 replies.
- Request for inclusion in the Nutch email list - posted by Pramod Nagarajarao <pr...@usc.edu> on 2015/09/30 07:22:01 UTC, 0 replies.
- [jira] [Updated] (NUTCH-2030) ParseZip plugin is not able to extract language from zip document,this could solve that problem. - posted by "Eyeris Rodriguez Rueda (JIRA)" <ji...@apache.org> on 2015/09/30 20:20:04 UTC, 2 replies.
- SVN-GIT mirror not updated for Revision 1705744 - posted by Sujen Shah <su...@gmail.com> on 2015/09/30 20:27:04 UTC, 0 replies.