You are viewing a plain text version of this content. The canonical link for it is here.
- Build failed in Hudson: Nutch-trunk #1293 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/01 05:35:04 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1294 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/02 05:03:00 UTC, 0 replies.
- [Nutch Wiki] Update of "PublicServers" by search2.net - posted by Apache Wiki <wi...@apache.org> on 2010/11/02 06:00:48 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1295 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/03 05:08:07 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1296 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/04 05:08:33 UTC, 0 replies.
- [jira] Updated: (NUTCH-932) Bulk REST API to retrieve crawl results as JSON - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/11/04 19:46:41 UTC, 6 replies.
- [jira] Commented: (NUTCH-932) Bulk REST API to retrieve crawl results as JSON - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/11/04 22:06:44 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1297 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/05 05:05:38 UTC, 0 replies.
- [jira] Commented: (NUTCH-873) Ivy configuration settings don't include Gora - posted by "Alexis (JIRA)" <ji...@apache.org> on 2010/11/05 20:42:43 UTC, 0 replies.
- [jira] Issue Comment Edited: (NUTCH-873) Ivy configuration settings don't include Gora - posted by "Alexis (JIRA)" <ji...@apache.org> on 2010/11/05 20:48:42 UTC, 2 replies.
- [Nutch Wiki] Trivial Update of "PublicServers" by seegnify - posted by Apache Wiki <wi...@apache.org> on 2010/11/05 22:35:02 UTC, 2 replies.
- [Nutch Wiki] Update of "GORA_HBase" by Alexis - posted by Apache Wiki <wi...@apache.org> on 2010/11/05 23:31:32 UTC, 0 replies.
- [jira] Commented: (NUTCH-880) REST API for Nutch - posted by "Alexis (JIRA)" <ji...@apache.org> on 2010/11/06 01:26:41 UTC, 1 replies.
- Build failed in Hudson: Nutch-trunk #1298 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/06 05:03:09 UTC, 0 replies.
- Charset detection algorithm - posted by Ken Krugler <kk...@transpac.com> on 2010/11/06 20:03:22 UTC, 0 replies.
- My ApacheconNA 2010 Slides - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/11/06 21:24:46 UTC, 0 replies.
- My ApacheconNA 2010 slides - posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2010/11/06 21:25:30 UTC, 2 replies.
- Build failed in Hudson: Nutch-trunk #1299 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/07 05:02:08 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1300 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/08 05:02:44 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1301 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/09 05:13:59 UTC, 0 replies.
- [Nutch Wiki] Update of "RunNutchInEclipse" by store88 - posted by Apache Wiki <wi...@apache.org> on 2010/11/09 06:59:07 UTC, 0 replies.
- [Nutch Wiki] Update of "store88" by store88 - posted by Apache Wiki <wi...@apache.org> on 2010/11/09 07:38:41 UTC, 0 replies.
- [jira] Commented: (NUTCH-933) Fetcher does not save a pages Last-Modified value in CrawlDatum - posted by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2010/11/10 14:34:14 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1303 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/11 07:10:51 UTC, 0 replies.
- [Nutch Wiki] Update of "PublicServers" by SimaoFontes - posted by Apache Wiki <wi...@apache.org> on 2010/11/12 18:25:37 UTC, 0 replies.
- [Nutch Wiki] Update of "FrontPage" by ChrisMattmann - posted by Apache Wiki <wi...@apache.org> on 2010/11/13 17:09:30 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1306 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/14 07:10:14 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1307 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/15 05:09:27 UTC, 0 replies.
- [jira] Created: (NUTCH-934) Upgrade to Tika 0.8 - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/11/15 12:53:15 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1308 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/16 05:02:22 UTC, 0 replies.
- [jira] Commented: (NUTCH-924) Static field in solr mapping - posted by "David Stuart (JIRA)" <ji...@apache.org> on 2010/11/16 11:45:25 UTC, 1 replies.
- Build failed in Hudson: Nutch-trunk #1309 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/17 06:52:24 UTC, 0 replies.
- [jira] Created: (NUTCH-935) remove unnecessary /./ in basic urlnormalizer - posted by "Stondubleyt (JIRA)" <ji...@apache.org> on 2010/11/17 11:01:13 UTC, 0 replies.
- [jira] Updated: (NUTCH-935) remove unnecessary /./ in basic urlnormalizer - posted by "Stondubleyt (JIRA)" <ji...@apache.org> on 2010/11/17 11:07:13 UTC, 1 replies.
- Build failed in Hudson: Nutch-trunk #1310 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/18 06:00:41 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1311 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/19 05:31:19 UTC, 0 replies.
- [jira] Created: (NUTCH-936) LanguageIdentifier should not set empty lang field on NutchDocument - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2010/11/19 18:56:15 UTC, 0 replies.
- [jira] Updated: (NUTCH-936) LanguageIdentifier should not set empty lang field on NutchDocument - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2010/11/19 19:44:14 UTC, 2 replies.
- Build failed in Hudson: Nutch-trunk #1312 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/20 05:42:33 UTC, 0 replies.
- [Nutch Wiki] Update of "WritingPluginExample-1.2" by NiccoloBecchi - posted by Apache Wiki <wi...@apache.org> on 2010/11/20 23:06:57 UTC, 0 replies.
- [Nutch Wiki] Update of "PluginCentral" by NiccoloBecchi - posted by Apache Wiki <wi...@apache.org> on 2010/11/20 23:12:27 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1313 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/21 05:22:58 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1314 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/22 05:04:54 UTC, 0 replies.
- [jira] Issue Comment Edited: (NUTCH-936) LanguageIdentifier should not set empty lang field on NutchDocument - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2010/11/22 14:11:14 UTC, 0 replies.
- [jira] Updated: (NUTCH-912) MoreIndexingFilter does not parse docx and xlsx date formats - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2010/11/22 15:20:14 UTC, 1 replies.
- [jira] Commented: (NUTCH-912) MoreIndexingFilter does not parse docx and xlsx date formats - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2010/11/22 15:22:15 UTC, 1 replies.
- [jira] Issue Comment Edited: (NUTCH-912) MoreIndexingFilter does not parse docx and xlsx date formats - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2010/11/22 15:24:15 UTC, 0 replies.
- [jira] Commented: (NUTCH-936) LanguageIdentifier should not set empty lang field on NutchDocument - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2010/11/22 15:33:17 UTC, 0 replies.
- [Nutch Wiki] Update of "PublicServers" by dougcook - posted by Apache Wiki <wi...@apache.org> on 2010/11/23 00:00:12 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1315 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/23 08:34:32 UTC, 0 replies.
- [jira] Created: (NUTCH-937) When nutch is run on hadoop > 0.20.2 (or cdh) it will not find plugins because MapReduce will not unpack plugin/ directory from the job's pack (due to MAPREDUCE-967) - posted by "Claudio Martella (JIRA)" <ji...@apache.org> on 2010/11/23 16:58:14 UTC, 0 replies.
- [jira] Created: (NUTCH-938) Imposible to fetch sites with robots.txt - posted by "Enrique Berlanga (JIRA)" <ji...@apache.org> on 2010/11/23 18:52:13 UTC, 0 replies.
- [jira] Updated: (NUTCH-938) Imposible to fetch sites with robots.txt - posted by "Enrique Berlanga (JIRA)" <ji...@apache.org> on 2010/11/23 19:13:14 UTC, 1 replies.
- Build failed in Hudson: Nutch-trunk #1316 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/24 08:01:50 UTC, 0 replies.
- [jira] Commented: (NUTCH-938) Imposible to fetch sites with robots.txt - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/11/24 23:54:13 UTC, 2 replies.
- Build failed in Hudson: Nutch-trunk #1317 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/25 11:42:18 UTC, 0 replies.
- [jira] Resolved: (NUTCH-932) Bulk REST API to retrieve crawl results as JSON - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2010/11/25 13:20:14 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1318 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/26 06:55:39 UTC, 0 replies.
- [jira] Created: (NUTCH-939) Added -dir command line option to Indexer and SolrIndexer, allowing to specify directory containing segments - posted by "Claudio Martella (JIRA)" <ji...@apache.org> on 2010/11/26 14:47:13 UTC, 0 replies.
- [jira] Updated: (NUTCH-939) Added -dir command line option to Indexer and SolrIndexer, allowing to specify directory containing segments - posted by "Claudio Martella (JIRA)" <ji...@apache.org> on 2010/11/26 14:49:13 UTC, 0 replies.
- [jira] Commented: (NUTCH-939) Added -dir command line option to Indexer and SolrIndexer, allowing to specify directory containing segments - posted by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2010/11/26 14:53:13 UTC, 5 replies.
- [jira] Created: (NUTCH-940) static field plugin - posted by "Claudio Martella (JIRA)" <ji...@apache.org> on 2010/11/26 16:19:13 UTC, 0 replies.
- [jira] Updated: (NUTCH-940) static field plugin - posted by "Claudio Martella (JIRA)" <ji...@apache.org> on 2010/11/26 16:19:15 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1319 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/27 08:28:36 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1320 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/28 05:01:59 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1321 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/29 05:09:03 UTC, 0 replies.
- how to download image sound and video files? - posted by Koray <ko...@gmail.com> on 2010/11/29 11:50:03 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #1322 - posted by Apache Hudson Server <hu...@hudson.apache.org> on 2010/11/30 05:01:54 UTC, 0 replies.
- [jira] Closed: (NUTCH-938) Imposible to fetch sites with robots.txt - posted by "Enrique Berlanga (JIRA)" <ji...@apache.org> on 2010/11/30 10:35:11 UTC, 0 replies.