You are viewing a plain text version of this content. The canonical link for it is here.
- Ranking & Scoring Algorithm Pseudocode - posted by atencorps <ch...@googlemail.com> on 2009/06/01 00:32:48 UTC, 1 replies.
- How can I get startted with Nutch 1.0 - posted by 逐鹿 <zh...@hotmail.com> on 2009/06/01 09:55:52 UTC, 1 replies.
- debugging problem of nutch10 - posted by Mr Shore <sh...@gmail.com> on 2009/06/02 04:35:01 UTC, 0 replies.
- [jira] Updated: (NUTCH-663) Upgrade Nutch to use Hadoop 0.19 - posted by "buddha1021 (JIRA)" <ji...@apache.org> on 2009/06/02 07:38:07 UTC, 0 replies.
- [Nutch Wiki] Update of "Support" by JulienNioche - posted by Apache Wiki <wi...@apache.org> on 2009/06/02 17:11:51 UTC, 0 replies.
- IOException in dedup - posted by Nic M <ni...@gmail.com> on 2009/06/02 18:10:12 UTC, 0 replies.
- Re: IOException in dedup - posted by Ken Krugler <kk...@transpac.com> on 2009/06/02 18:41:32 UTC, 5 replies.
- [Nutch Wiki] Update of "FrontPage" by JohnWhelan - posted by Apache Wiki <wi...@apache.org> on 2009/06/03 05:13:00 UTC, 0 replies.
- [Nutch Wiki] Update of "GettingNutchRunningWithWindows" by JohnWhelan - posted by Apache Wiki <wi...@apache.org> on 2009/06/03 05:31:01 UTC, 0 replies.
- anyone sucessfully debug nutch1.0 in eclipse@windows? - posted by Mr Shore <sh...@gmail.com> on 2009/06/03 23:42:22 UTC, 0 replies.
- Extending Nutch to create HTML text summaries - posted by "Rodrigo Reyes C." <ro...@avity.com> on 2009/06/05 01:04:38 UTC, 0 replies.
- org.apache.nutch.protocol.file.FileError: File Error: 404 - posted by Mr Shore <sh...@gmail.com> on 2009/06/05 06:43:28 UTC, 0 replies.
- Software to Evaluate Algorithms - posted by kloc4mif <fr...@googlemail.com> on 2009/06/06 17:42:32 UTC, 0 replies.
- [jira] Commented: (NUTCH-650) Hbase Integration - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/06/07 19:04:07 UTC, 0 replies.
- [jira] Closed: (NUTCH-735) crawl-tool.xml must be read before nutch-site.xml when invoked using crawl command - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/06/07 19:14:07 UTC, 0 replies.
- [jira] Commented: (NUTCH-733) plain text view of cached files ignores HTML encoding - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/06/07 19:16:07 UTC, 0 replies.
- [jira] Commented: (NUTCH-709) JSParseFilter gets into an infinate loop and ets all the stack - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/06/07 19:22:07 UTC, 0 replies.
- [jira] Commented: (NUTCH-692) AlreadyBeingCreatedException with Hadoop 0.19 - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/06/07 19:24:07 UTC, 0 replies.
- [jira] Updated: (NUTCH-702) Lazy Instanciation of Metadata in CrawlDatum - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/06/07 19:34:07 UTC, 0 replies.
- [jira] Commented: (NUTCH-740) Configuration option to override default language for fetched pages. - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/06/07 19:38:07 UTC, 0 replies.
- [jira] Commented: (NUTCH-735) crawl-tool.xml must be read before nutch-site.xml when invoked using crawl command - posted by "Hudson (JIRA)" <ji...@apache.org> on 2009/06/08 06:16:07 UTC, 0 replies.
- [Nutch Wiki] Update of "IntranetRecrawl" by susam - posted by Apache Wiki <wi...@apache.org> on 2009/06/09 21:28:31 UTC, 1 replies.
- [jira] Updated: (NUTCH-740) Configuration option to override default language for fetched pages. - posted by "Marcin Okraszewski (JIRA)" <ji...@apache.org> on 2009/06/09 23:58:07 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #840 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/10 06:09:53 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #841 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/11 06:10:33 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #842 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/12 06:13:03 UTC, 0 replies.
- Why does TestNodeWalker keep failing? - posted by Doğacan Güney <do...@gmail.com> on 2009/06/12 11:39:17 UTC, 4 replies.
- Build failed in Hudson: Nutch-trunk #843 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/13 06:13:38 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #844 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/14 06:19:38 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #845 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/15 06:12:59 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #846 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/16 06:14:33 UTC, 0 replies.
- a nutch Chinese language processing problem - posted by fa...@hotmail.com on 2009/06/16 17:44:02 UTC, 1 replies.
- Build failed in Hudson: Nutch-trunk #847 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/17 06:11:59 UTC, 0 replies.
- [Nutch Wiki] Update of "Support" by Justin Gilbreath - posted by Apache Wiki <wi...@apache.org> on 2009/06/17 13:39:19 UTC, 1 replies.
- [Nutch Wiki] Update of "HttpAuthenticationSchemes" by wobbet - posted by Apache Wiki <wi...@apache.org> on 2009/06/17 18:04:53 UTC, 0 replies.
- [Nutch Wiki] Update of "HttpAuthenticationSchemes" by susam - posted by Apache Wiki <wi...@apache.org> on 2009/06/17 19:22:13 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #848 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/18 06:10:40 UTC, 0 replies.
- Plugins: when to perform web service requests, on fetch or on index? - posted by caezar <ca...@gmail.com> on 2009/06/18 11:57:26 UTC, 0 replies.
- Re: Plugins: when to perform web service requests, on fetch or on index? - posted by joel gump <bi...@gmail.com> on 2009/06/18 14:42:11 UTC, 4 replies.
- Re: Plugins: when to perform web service requests, on fetch or on index? - posted by caezar <ca...@gmail.com> on 2009/06/18 15:27:45 UTC, 3 replies.
- Language plugin tokenizers in Indexer? - posted by Aaron Binns <aa...@archive.org> on 2009/06/18 23:28:10 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #849 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/19 06:10:23 UTC, 0 replies.
- [jira] Commented: (NUTCH-101) RobotRulesParser - posted by "Ken Krugler (JIRA)" <ji...@apache.org> on 2009/06/19 23:16:08 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #850 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/20 06:10:41 UTC, 0 replies.
- [jira] Resolved: (NUTCH-101) RobotRulesParser - posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org> on 2009/06/20 06:14:07 UTC, 0 replies.
- [jira] Created: (NUTCH-742) Checksum Error - posted by "mawanqiang (JIRA)" <ji...@apache.org> on 2009/06/20 11:18:07 UTC, 0 replies.
- [Nutch Wiki] Update of "AddingNewLocalization" by Mike Dawson - posted by Apache Wiki <wi...@apache.org> on 2009/06/20 16:35:53 UTC, 3 replies.
- [jira] Commented: (NUTCH-731) Redirection of robots.txt in RobotRulesParser - posted by "Julien Nioche (JIRA)" <ji...@apache.org> on 2009/06/20 18:32:07 UTC, 1 replies.
- [jira] Updated: (NUTCH-731) Redirection of robots.txt in RobotRulesParser - posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org> on 2009/06/21 05:34:07 UTC, 0 replies.
- [jira] Resolved: (NUTCH-742) Checksum Error - posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org> on 2009/06/21 05:36:07 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #851 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/21 06:10:10 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #852 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/22 06:11:44 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #853 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/23 06:12:16 UTC, 0 replies.
- [jira] Created: (NUTCH-743) Site search powered by Lucene/Solr - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/06/23 18:28:07 UTC, 0 replies.
- [jira] Updated: (NUTCH-743) Site search powered by Lucene/Solr - posted by "Sami Siren (JIRA)" <ji...@apache.org> on 2009/06/23 18:30:07 UTC, 0 replies.
- [jira] Commented: (NUTCH-743) Site search powered by Lucene/Solr - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/06/23 18:42:07 UTC, 0 replies.
- Per-host fetch-interval - posted by Sandeep Tata <sa...@gmail.com> on 2009/06/23 20:05:24 UTC, 2 replies.
- [jira] Commented: (NUTCH-729) NPE in FieldIndexer when BasicFields url doesn't exist - posted by "Tadesse Sefer (JIRA)" <ji...@apache.org> on 2009/06/23 22:19:07 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #854 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/24 06:13:32 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #855 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/25 06:11:22 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #856 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/26 06:10:38 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #857 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/27 06:10:48 UTC, 2 replies.
- Build failed in Hudson: Nutch-trunk #858 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/28 06:10:22 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #859 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/29 06:10:30 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #860 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/06/30 06:09:43 UTC, 0 replies.
- How to optimize nutch's fetch perfotmance - posted by Pravin Karne <pr...@persistent.co.in> on 2009/06/30 14:31:20 UTC, 0 replies.