You are viewing a plain text version of this content. The canonical link for it is here.
- RE: RSS-fecter and index individul-how can i realize this function - posted by Gal Nitzan <gn...@usa.net> on 2007/02/01 07:40:43 UTC, 30 replies.
- Generator.java bug? - posted by Gal Nitzan <gn...@usa.net> on 2007/02/02 12:55:46 UTC, 4 replies.
- [jira] Created: (NUTCH-437) MapFile in Hadoop 0.10.2 has changed, must update references - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/02/02 20:00:05 UTC, 0 replies.
- [jira] Updated: (NUTCH-437) MapFile in Hadoop 0.10.2 has changed, must update references - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/02/02 20:02:06 UTC, 0 replies.
- [jira] Created: (NUTCH-438) Add -noAdditions to updatedb - posted by "Nicolás Lichtmaier (JIRA)" <ji...@apache.org> on 2007/02/02 20:59:05 UTC, 0 replies.
- [jira] Updated: (NUTCH-438) Add -noAdditions to updatedb - posted by "Nicolás Lichtmaier (JIRA)" <ji...@apache.org> on 2007/02/02 21:01:16 UTC, 0 replies.
- Nutch error messages - posted by "Armel T. Nene" <ar...@idna-solutions.com> on 2007/02/06 12:26:22 UTC, 0 replies.
- [jira] Created: (NUTCH-439) Top Level Domains Indexing / Scoring - posted by "Enis Soztutar (JIRA)" <ji...@apache.org> on 2007/02/06 14:35:05 UTC, 0 replies.
- JobConf Questions - posted by Charlie Williams <cw...@gmail.com> on 2007/02/06 14:42:33 UTC, 2 replies.
- [jira] Updated: (NUTCH-439) Top Level Domains Indexing / Scoring - posted by "Enis Soztutar (JIRA)" <ji...@apache.org> on 2007/02/06 15:04:05 UTC, 1 replies.
- Getting a semantic version of an "HTML page" - posted by Michael Wechner <mi...@wyona.com> on 2007/02/06 16:44:57 UTC, 0 replies.
- api.RegexURLFilterBase - Configuration Resources - posted by Tobias Zahn <To...@arcor.de> on 2007/02/06 20:31:06 UTC, 2 replies.
- [jira] Created: (NUTCH-440) Command line utilities should exit with an error message when given wrong arguments - posted by "Nicolás Lichtmaier (JIRA)" <ji...@apache.org> on 2007/02/06 21:29:05 UTC, 0 replies.
- [jira] Created: (NUTCH-441) Thai Analyzer Plugin - posted by "Vee Satayamas (JIRA)" <ji...@apache.org> on 2007/02/07 10:00:05 UTC, 0 replies.
- [jira] Updated: (NUTCH-441) Thai Analyzer Plugin - posted by "Vee Satayamas (JIRA)" <ji...@apache.org> on 2007/02/07 10:03:05 UTC, 0 replies.
- How nuch can be used to build a verticalo search engine? - posted by ahmed ghouzia <gh...@yahoo.com> on 2007/02/07 18:53:07 UTC, 0 replies.
- How nuch can be used to build a vertical search engine? - posted by ahmed ghouzia <gh...@yahoo.com> on 2007/02/07 18:53:15 UTC, 0 replies.
- [jira] Created: (NUTCH-442) Integrate Solr/Nutch - posted by "rubdabadub (JIRA)" <ji...@apache.org> on 2007/02/07 19:38:05 UTC, 0 replies.
- [jira] Created: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser - posted by "Renaud Richardet (JIRA)" <ji...@apache.org> on 2007/02/07 19:54:05 UTC, 0 replies.
- NPE while fetching - posted by Gal Nitzan <gn...@usa.net> on 2007/02/08 00:36:19 UTC, 1 replies.
- [jira] Commented: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser - posted by "Dogacan Güney (JIRA)" <ji...@apache.org> on 2007/02/08 09:54:05 UTC, 34 replies.
- [jira] Updated: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser - posted by "Dogacan Güney (JIRA)" <ji...@apache.org> on 2007/02/08 09:54:05 UTC, 12 replies.
- [jira] Created: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility - posted by "Renaud Richardet (JIRA)" <ji...@apache.org> on 2007/02/10 00:43:05 UTC, 0 replies.
- [jira] Commented: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility - posted by "Renaud Richardet (JIRA)" <ji...@apache.org> on 2007/02/10 00:51:05 UTC, 15 replies.
- [jira] Assigned: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2007/02/10 07:14:06 UTC, 0 replies.
- [jira] Updated: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility - posted by "Dogacan Güney (JIRA)" <ji...@apache.org> on 2007/02/10 09:02:07 UTC, 1 replies.
- hadoop-site.xml - absolute Path - posted by Tobias Zahn <To...@arcor.de> on 2007/02/12 23:29:51 UTC, 1 replies.
- Nutch 0.8 FATAL fetcher.Fetcher: java.lang.NullPointerException - posted by "Armel T. Nene" <ar...@idna-solutions.com> on 2007/02/13 01:11:49 UTC, 0 replies.
- NPE in org.apache.hadoop.io.SequenceFile$Sorter$MergeQueue - posted by Gal Nitzan <gn...@usa.net> on 2007/02/13 12:41:57 UTC, 5 replies.
- [jira] Assigned: (NUTCH-444) Possibly use a different library to parse RSS feed for improved performance and compatibility - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2007/02/13 16:01:25 UTC, 0 replies.
- [jira] Resolved: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2007/02/13 16:03:06 UTC, 0 replies.
- [jira] Closed: (NUTCH-258) Once Nutch logs a SEVERE log item, Nutch fails forevermore - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2007/02/13 16:03:06 UTC, 0 replies.
- log guards - posted by Doug Cutting <cu...@apache.org> on 2007/02/13 19:47:39 UTC, 5 replies.
- [jira] Commented: (NUTCH-437) MapFile in Hadoop 0.10.2 has changed, must update references - posted by "stack@archive.org (JIRA)" <ji...@apache.org> on 2007/02/13 20:33:05 UTC, 0 replies.
- [jira] Updated: (NUTCH-437) MapFile in Hadoop Trunk has changed, must update references - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/02/13 20:51:05 UTC, 0 replies.
- [jira] Commented: (NUTCH-437) MapFile in Hadoop Trunk has changed, must update references - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2007/02/13 20:53:05 UTC, 1 replies.
- [jira] Assigned: (NUTCH-437) MapFile in Hadoop Trunk has changed, must update references - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2007/02/13 20:53:05 UTC, 0 replies.
- Personalization of Search Results - posted by Rakesh Reddy <rr...@gmail.com> on 2007/02/13 20:58:59 UTC, 0 replies.
- [jira] Assigned: (NUTCH-309) Uses commons logging Code Guards - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2007/02/13 21:27:05 UTC, 0 replies.
- RE: [jira] Commented: (NUTCH-444) Possibly use a different library toparse RSS feed for improved performance and compatibility - posted by HUYLEBROECK Jeremy RD-ILAB-SSF <je...@orange-ftgroup.com> on 2007/02/13 23:45:38 UTC, 0 replies.
- How to get score in search.jsp - posted by ????? ??????? <an...@orbita1.ru> on 2007/02/14 08:00:44 UTC, 2 replies.
- [jira] Commented: (NUTCH-247) robot parser to restrict. - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/02/15 04:18:05 UTC, 9 replies.
- Injector checking for other than STATUS_INJECTED - posted by nu...@dragonflymc.com on 2007/02/15 04:43:43 UTC, 6 replies.
- [jira] Created: (NUTCH-445) Domain İndexing / Query Filter - posted by "Enis Soztutar (JIRA)" <ji...@apache.org> on 2007/02/15 11:35:05 UTC, 0 replies.
- [jira] Updated: (NUTCH-445) Domain İndexing / Query Filter - posted by "Enis Soztutar (JIRA)" <ji...@apache.org> on 2007/02/15 11:37:05 UTC, 3 replies.
- lib-http crawl-delay problem - posted by Doğacan Güney <do...@agmlab.com> on 2007/02/15 12:07:44 UTC, 4 replies.
- [jira] Updated: (NUTCH-446) RobotRulesParser should ignore Crawl-delay values of other bots in robots.txt - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2007/02/15 16:46:05 UTC, 0 replies.
- [jira] Created: (NUTCH-446) RobotRulesParser should ignore Crawl-delay values of other bots in robots.txt - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2007/02/15 16:46:05 UTC, 0 replies.
- [jira] Work started: (NUTCH-443) allow parsers to return multiple Parse object, this will speed up the rss parser - posted by "Chris A. Mattmann (JIRA)" <ji...@apache.org> on 2007/02/16 16:27:05 UTC, 0 replies.
- [jira] Commented: (NUTCH-432) JAVA_PLATFORM with spaces (i.e. Mac OS X-ppc-32) breaks bin/nutch script - posted by "Brian Whitman (JIRA)" <ji...@apache.org> on 2007/02/16 18:13:05 UTC, 0 replies.
- [jira] Assigned: (NUTCH-247) robot parser to restrict. - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/02/18 08:53:05 UTC, 0 replies.
- [jira] Updated: (NUTCH-247) robot parser to restrict. - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/02/18 08:57:07 UTC, 1 replies.
- Apache Droids - standalone crawl framework - posted by Thorsten Scherler <th...@apache.org> on 2007/02/20 17:26:49 UTC, 3 replies.
- [jira] Created: (NUTCH-447) Dmoz Structure Parser Tool - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/02/20 22:03:06 UTC, 0 replies.
- [jira] Updated: (NUTCH-447) Dmoz Structure Parser Tool - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/02/20 22:05:05 UTC, 0 replies.
- [Fwd: Re: Apache Droids - standalone crawl framework] - posted by Thorsten Scherler <th...@apache.org> on 2007/02/21 01:24:48 UTC, 0 replies.
- [jira] Commented: (NUTCH-447) Dmoz Structure Parser Tool - posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org> on 2007/02/21 10:26:05 UTC, 1 replies.
- [jira] Created: (NUTCH-448) Allow Plugin Includes and Excludes from File - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/02/22 08:19:05 UTC, 0 replies.
- [jira] Updated: (NUTCH-448) Allow Plugin Includes and Excludes from File - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2007/02/22 08:21:05 UTC, 0 replies.
- Re: Creating a new scoring filter. - posted by Andrzej Bialecki <ab...@getopt.org> on 2007/02/22 17:31:22 UTC, 7 replies.
- Performance optimization for Nutch index / query - posted by Andrzej Bialecki <ab...@getopt.org> on 2007/02/23 01:43:35 UTC, 4 replies.
- Why not make SOLR the Nutch SE - posted by Gal Nitzan <ga...@gmail.com> on 2007/02/23 08:49:33 UTC, 0 replies.
- SOLR - posted by Gal Nitzan <ga...@gmail.com> on 2007/02/23 16:13:30 UTC, 2 replies.
- How to add data into segment with my own plugin ? - posted by cybercouf <cy...@free.fr> on 2007/02/23 17:22:31 UTC, 2 replies.
- [jira] Created: (NUTCH-449) Format of junit output should be configurable - posted by "Nigel Daley (JIRA)" <ji...@apache.org> on 2007/02/23 23:49:05 UTC, 0 replies.
- [jira] Updated: (NUTCH-449) Format of junit output should be configurable - posted by "Nigel Daley (JIRA)" <ji...@apache.org> on 2007/02/23 23:49:06 UTC, 0 replies.
- [jira] Assigned: (NUTCH-449) Format of junit output should be configurable - posted by "Doug Cutting (JIRA)" <ji...@apache.org> on 2007/02/23 23:51:05 UTC, 0 replies.
- nightly builds moved to hudson - posted by Doug Cutting <cu...@apache.org> on 2007/02/24 00:13:40 UTC, 0 replies.
- [jira] Updated: (NUTCH-369) StringUtil.resolveEncodingAlias is unuseful. - posted by "Renaud Richardet (JIRA)" <ji...@apache.org> on 2007/02/24 09:25:05 UTC, 2 replies.
- [jira] Updated: (NUTCH-434) Replace usage of ObjectWritable with something based on GenericWritable - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2007/02/24 14:37:05 UTC, 0 replies.
- [jira] Created: (NUTCH-450) How to set up nutch - posted by "Sandya S Murthy (JIRA)" <ji...@apache.org> on 2007/02/26 08:11:05 UTC, 0 replies.
- [jira] Created: (NUTCH-451) Tool to recover partial fetcher output - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2007/02/26 20:44:05 UTC, 0 replies.
- [jira] Updated: (NUTCH-451) Tool to recover partial fetcher output - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2007/02/26 20:44:06 UTC, 0 replies.
- [jira] Commented: (NUTCH-445) Domain İndexing / Query Filter - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2007/02/27 17:26:05 UTC, 3 replies.
- Welcome Dennis Kubes as Nutch committer - posted by Andrzej Bialecki <ab...@getopt.org> on 2007/02/28 20:22:33 UTC, 2 replies.
- Nutch JSF front-end code submission - Please advice next steps? - posted by Zaheed Haque <za...@gmail.com> on 2007/02/28 21:56:09 UTC, 1 replies.