You are viewing a plain text version of this content. The canonical link for it is here.
- [Nutch Wiki] Update of "LanguageIdentifier" by LinkUpdater - posted by Apache Wiki <wi...@apache.org> on 2008/11/03 10:08:29 UTC, 0 replies.
- [Fwd: [Urgent] Please help promote ApacheCon video streaming!] - posted by Andrzej Bialecki <ab...@getopt.org> on 2008/11/04 17:33:56 UTC, 0 replies.
- [jira] Commented: (NUTCH-655) Injecting Crawl metadata - posted by "julien nioche (JIRA)" <ji...@apache.org> on 2008/11/06 13:25:44 UTC, 0 replies.
- [jira] Updated: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. - posted by "Ilguiz Latypov (JIRA)" <ji...@apache.org> on 2008/11/08 02:46:48 UTC, 9 replies.
- [jira] Issue Comment Edited: (NUTCH-427) protocol-smb: plugin protocol implementing the CIFS/SMB protocol. This protocol allows Nutch to crawl Microsoft Windows Shares remotely using the CIFS/SMB protocol implmentation. - posted by "Ilguiz Latypov (JIRA)" <ji...@apache.org> on 2008/11/08 02:48:46 UTC, 1 replies.
- [jira] Created: (NUTCH-659) Help! No urls fetched for internal repository website - posted by "Bryan (JIRA)" <ji...@apache.org> on 2008/11/10 00:34:44 UTC, 0 replies.
- [jira] Created: (NUTCH-660) Does anybody know how to let nutch crawl this kind of website? - posted by "Bryan (JIRA)" <ji...@apache.org> on 2008/11/11 06:27:44 UTC, 0 replies.
- [jira] Commented: (NUTCH-660) Does anybody know how to let nutch crawl this kind of website? - posted by "Bryan (JIRA)" <ji...@apache.org> on 2008/11/11 06:39:44 UTC, 0 replies.
- [jira] Issue Comment Edited: (NUTCH-660) Does anybody know how to let nutch crawl this kind of website? - posted by "Bryan (JIRA)" <ji...@apache.org> on 2008/11/11 07:07:44 UTC, 3 replies.
- [Nutch Wiki] Update of "Support" by ThomasDelnoij - posted by Apache Wiki <wi...@apache.org> on 2008/11/11 19:58:28 UTC, 0 replies.
- [jira] Resolved: (NUTCH-660) Does anybody know how to let nutch crawl this kind of website? - posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org> on 2008/11/12 06:01:44 UTC, 0 replies.
- [jira] Resolved: (NUTCH-659) Help! No urls fetched for internal repository website - posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org> on 2008/11/12 06:03:44 UTC, 0 replies.
- Nutch Parsers - posted by discoversk <sa...@focusinfomatics.com> on 2008/11/12 15:16:38 UTC, 0 replies.
- [jira] Created: (NUTCH-661) errors when the uri contains space characters - posted by "Christos LAIOS (JIRA)" <ji...@apache.org> on 2008/11/12 18:15:44 UTC, 0 replies.
- plug-ins - posted by discoversk <sa...@focusinfomatics.com> on 2008/11/13 05:51:47 UTC, 3 replies.
- nutch parsers - posted by discoversk <sa...@focusinfomatics.com> on 2008/11/13 05:58:46 UTC, 0 replies.
- [jira] Commented: (NUTCH-442) Integrate Solr/Nutch - posted by "julien nioche (JIRA)" <ji...@apache.org> on 2008/11/13 10:37:44 UTC, 0 replies.
- Re: Droids crawler - posted by Otis Gospodnetic <og...@yahoo.com> on 2008/11/13 16:14:58 UTC, 1 replies.
- [jira] Commented: (NUTCH-661) errors when the uri contains space characters - posted by "Kristian B. (JIRA)" <ji...@apache.org> on 2008/11/13 21:38:44 UTC, 1 replies.
- [jira] Issue Comment Edited: (NUTCH-661) errors when the uri contains space characters - posted by "Kristian B. (JIRA)" <ji...@apache.org> on 2008/11/13 21:40:44 UTC, 1 replies.
- Unsubscribe - posted by David Kellum <de...@gravitext.com> on 2008/11/15 19:38:54 UTC, 0 replies.
- Retrieving text content from html files - posted by Pau <pa...@gmail.com> on 2008/11/17 17:53:22 UTC, 0 replies.
- 1.0 Release? - posted by Dennis Kubes <ku...@apache.org> on 2008/11/20 15:54:05 UTC, 3 replies.
- [jira] Created: (NUTCH-662) Upgrade Nutch to use Lucene 2.4 - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/21 11:16:45 UTC, 0 replies.
- [jira] Created: (NUTCH-663) Upgrade Nutch to use Hadoop 0.18.2 - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/21 11:18:45 UTC, 0 replies.
- [jira] Updated: (NUTCH-662) Upgrade Nutch to use Lucene 2.4 - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/21 15:10:44 UTC, 3 replies.
- [jira] Work started: (NUTCH-662) Upgrade Nutch to use Lucene 2.4 - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/21 15:10:44 UTC, 0 replies.
- [jira] Commented: (NUTCH-662) Upgrade Nutch to use Lucene 2.4 - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/21 15:20:46 UTC, 3 replies.
- NUTCH-92 - DistributedSearch incorrectly scores results - posted by Sean Dean <se...@rogers.com> on 2008/11/21 19:38:58 UTC, 1 replies.
- [jira] Commented: (NUTCH-663) Upgrade Nutch to use Hadoop 0.18.2 - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2008/11/23 10:33:44 UTC, 3 replies.
- Third Hadoop Get Together @ Berlin - posted by Isabel Drost <is...@apache.org> on 2008/11/24 19:37:01 UTC, 0 replies.
- [Nutch Wiki] Update of "johnroman" by johnroman - posted by Apache Wiki <wi...@apache.org> on 2008/11/25 18:18:00 UTC, 0 replies.
- NUTCH-92 - posted by Andrzej Bialecki <ab...@getopt.org> on 2008/11/26 02:04:22 UTC, 4 replies.
- [jira] Created: (NUTCH-664) Possibility to update already stored documents. - posted by "Sergey Khilkov (JIRA)" <ji...@apache.org> on 2008/11/26 07:30:45 UTC, 0 replies.
- [jira] Updated: (NUTCH-664) Possibility to update already stored documents. - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2008/11/26 10:34:44 UTC, 0 replies.
- [jira] Commented: (NUTCH-664) Possibility to update already stored documents. - posted by "Sergey Khilkov (JIRA)" <ji...@apache.org> on 2008/11/26 10:58:44 UTC, 2 replies.
- [jira] Commented: (NUTCH-563) Include custom fields in BasicQueryFilter - posted by "Davide (JIRA)" <ji...@apache.org> on 2008/11/26 15:20:46 UTC, 5 replies.
- [jira] Created: (NUTCH-665) Search Load Testing Tool - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/26 15:42:44 UTC, 0 replies.
- [jira] Updated: (NUTCH-665) Search Load Testing Tool - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/26 15:46:44 UTC, 0 replies.
- [jira] Updated: (NUTCH-647) Resolve URLs tool - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/26 15:52:46 UTC, 0 replies.
- [jira] Created: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/26 15:54:44 UTC, 0 replies.
- [jira] Updated: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/26 15:56:44 UTC, 2 replies.
- [jira] Updated: (NUTCH-663) Upgrade Nutch to use Hadoop 0.18.2 - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/26 16:02:44 UTC, 2 replies.
- [jira] Updated: (NUTCH-663) Upgrade Nutch to use Hadoop 0.19 - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/26 16:08:46 UTC, 2 replies.
- [jira] Updated: (NUTCH-635) LinkAnalysis Tool for Nutch - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/26 16:52:44 UTC, 1 replies.
- [jira] Created: (NUTCH-667) Input Forma for working with Content in Hadoop Streaming - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/26 16:58:44 UTC, 0 replies.
- [jira] Updated: (NUTCH-667) Input Forma for working with Content in Hadoop Streaming - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/26 17:04:44 UTC, 0 replies.
- [jira] Updated: (NUTCH-667) Input Format for working with Content in Hadoop Streaming - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/26 17:06:44 UTC, 0 replies.
- Troubles while creating a plugin - posted by Pau <pa...@gmail.com> on 2008/11/26 18:57:59 UTC, 0 replies.
- [Nutch Wiki] Update of "PluginCentral" by johnroman - posted by Apache Wiki <wi...@apache.org> on 2008/11/26 21:26:48 UTC, 0 replies.
- [jira] Updated: (NUTCH-646) New Indexing Framework for Nutch - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2008/11/26 22:37:44 UTC, 0 replies.
- Pending Commits for Nutch Issues - posted by Dennis Kubes <ku...@apache.org> on 2008/11/26 22:42:24 UTC, 6 replies.
- [jira] Commented: (NUTCH-658) Add Counter for # of doc fetched in Reporter - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2008/11/27 17:50:44 UTC, 1 replies.
- [jira] Closed: (NUTCH-637) Add method to nutch and tika system(Code written) - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2008/11/27 17:52:44 UTC, 0 replies.
- [jira] Updated: (NUTCH-625) Non-ascii character broken in dumped content for mixed encoding (utf-8 and multi-byte) - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2008/11/27 17:54:44 UTC, 0 replies.
- [jira] Closed: (NUTCH-527) MapWritable doesn't support all hadoops writable types - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2008/11/27 18:04:44 UTC, 0 replies.
- [jira] Updated: (NUTCH-650) Hbase Integration - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2008/11/27 18:48:46 UTC, 0 replies.
- [jira] Commented: (NUTCH-650) Hbase Integration - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2008/11/27 22:48:46 UTC, 0 replies.
- Exception in NutchConfiguration class using java servlet - posted by Doun <my...@gmail.com> on 2008/11/28 02:52:38 UTC, 1 replies.