You are viewing a plain text version of this content. The canonical link for it is here.
- Build failed in Hudson: Nutch-trunk #678 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/01 02:06:42 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #679 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/01 05:11:27 UTC, 0 replies.
- [jira] Commented: (NUTCH-594) Serve Nutch search results in multiple formats including XML and JSON - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/01/01 21:45:44 UTC, 2 replies.
- Build failed in Hudson: Nutch-trunk #680 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/02 05:12:48 UTC, 0 replies.
- [jira] Commented: (NUTCH-669) Consolidate code for Fetcher and Fetcher2 - posted by "Todd Lipcon (JIRA)" <ji...@apache.org> on 2009/01/02 21:41:44 UTC, 3 replies.
- [jira] Commented: (NUTCH-475) Adaptive crawl delay - posted by "Todd Lipcon (JIRA)" <ji...@apache.org> on 2009/01/02 21:45:44 UTC, 0 replies.
- [jira] Resolved: (NUTCH-594) Serve Nutch search results in multiple formats including XML and JSON - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2009/01/02 22:39:44 UTC, 0 replies.
- [jira] Closed: (NUTCH-594) Serve Nutch search results in multiple formats including XML and JSON - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2009/01/02 22:39:44 UTC, 0 replies.
- [jira] Commented: (NUTCH-572) Scoring and redirected Urls - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2009/01/02 22:43:44 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #681 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/03 05:12:46 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #682 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/04 05:11:14 UTC, 0 replies.
- help with parse-rss - posted by Vlad Cananau <vl...@gmail.com> on 2009/01/04 23:28:31 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #683 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/05 05:11:44 UTC, 0 replies.
- Re: RSS-fecter and index individul-how can i realize this function - posted by Vlad Cananau <vl...@gmail.com> on 2009/01/05 06:00:12 UTC, 3 replies.
- nutch segment format - posted by Matt Pearson <mp...@lizearle.com> on 2009/01/05 16:32:22 UTC, 1 replies.
- Site update - posted by Otis Gospodnetic <ot...@yahoo.com> on 2009/01/05 23:21:04 UTC, 8 replies.
- Build failed in Hudson: Nutch-trunk #684 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/06 05:11:51 UTC, 0 replies.
- About Nutch distributed search implement - posted by Chester <11...@qq.com> on 2009/01/06 09:56:01 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #685 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/07 05:10:53 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #686 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/08 05:13:39 UTC, 0 replies.
- [jira] Created: (NUTCH-677) Segment merge filering based on segment content - posted by "Marcin Okraszewski (JIRA)" <ji...@apache.org> on 2009/01/08 23:04:59 UTC, 0 replies.
- [jira] Updated: (NUTCH-677) Segment merge filering based on segment content - posted by "Marcin Okraszewski (JIRA)" <ji...@apache.org> on 2009/01/08 23:06:59 UTC, 3 replies.
- Build failed in Hudson: Nutch-trunk #687 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/09 05:14:10 UTC, 0 replies.
- Re: svn commit: r731986 [1/2] - in /lucene/nutch/trunk/site: ./ skin/ skin/images/ - posted by Sami Siren <ss...@gmail.com> on 2009/01/09 09:50:51 UTC, 0 replies.
- [Nutch Wiki] Trivial Update of "Release HOWTO" by SamiSiren - posted by Apache Wiki <wi...@apache.org> on 2009/01/09 18:50:17 UTC, 0 replies.
- [jira] Closed: (NUTCH-624) Better parsed text by default parser - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/01/09 19:59:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-624) Better parsed text by default parser - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/01/09 19:59:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-627) Minimize host address lookup - posted by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2009/01/09 20:03:59 UTC, 1 replies.
- [jira] Commented: (NUTCH-579) Feed plugin only indexes one post per feed due to identical digest - posted by "Yury (JIRA)" <ji...@apache.org> on 2009/01/09 20:35:59 UTC, 1 replies.
- [jira] Issue Comment Edited: (NUTCH-579) Feed plugin only indexes one post per feed due to identical digest - posted by "Yury (JIRA)" <ji...@apache.org> on 2009/01/09 20:37:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-442) Integrate Solr/Nutch - posted by "Tony Wang (JIRA)" <ji...@apache.org> on 2009/01/10 04:40:00 UTC, 5 replies.
- Build failed in Hudson: Nutch-trunk #688 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/10 05:13:50 UTC, 0 replies.
- 3 Positions for Nutch Developers in Mumbai - posted by TalentKonnect <ta...@gmail.com> on 2009/01/10 16:15:19 UTC, 1 replies.
- Build failed in Hudson: Nutch-trunk #689 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/11 05:15:03 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #690 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/12 05:15:44 UTC, 0 replies.
- [jira] Resolved: (NUTCH-442) Integrate Solr/Nutch - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/12 14:28:04 UTC, 0 replies.
- [jira] Commented: (NUTCH-673) Upgrade the Carrot2 plug-in to release 3.0 - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/12 14:33:59 UTC, 1 replies.
- [jira] Commented: (NUTCH-670) feed plugin does not parse RSS2 enclosures - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/12 14:36:00 UTC, 0 replies.
- [jira] Closed: (NUTCH-652) AdaptiveFetchSchedule#setFetchSchedule doesn't calculate fetch interval correctly - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/12 14:38:02 UTC, 0 replies.
- [jira] Updated: (NUTCH-579) Feed plugin only indexes one post per feed due to identical digest - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/12 14:44:00 UTC, 0 replies.
- [Nutch Wiki] Update of "NewPage" by DennisKubes - posted by Apache Wiki <wi...@apache.org> on 2009/01/12 18:32:15 UTC, 1 replies.
- [Nutch Wiki] Update of "NewScoring" by DennisKubes - posted by Apache Wiki <wi...@apache.org> on 2009/01/12 18:33:36 UTC, 0 replies.
- [Nutch Wiki] Update of "FrontPage" by DennisKubes - posted by Apache Wiki <wi...@apache.org> on 2009/01/12 18:36:04 UTC, 0 replies.
- Hudson build is back to normal: Nutch-trunk #691 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/13 05:16:57 UTC, 0 replies.
- [jira] Commented: (NUTCH-652) AdaptiveFetchSchedule#setFetchSchedule doesn't calculate fetch interval correctly - posted by "Hudson (JIRA)" <ji...@apache.org> on 2009/01/13 05:16:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-668) Domain URL Filter - posted by "Hudson (JIRA)" <ji...@apache.org> on 2009/01/13 05:16:59 UTC, 0 replies.
- [Nutch Wiki] Update of "NewScoring" by OtisGospodnetic - posted by Apache Wiki <wi...@apache.org> on 2009/01/13 19:22:21 UTC, 0 replies.
- [jira] Resolved: (NUTCH-627) Minimize host address lookup - posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org> on 2009/01/13 23:18:59 UTC, 0 replies.
- [jira] Created: (NUTCH-678) Hadoop 0.19 requires an update of jets3t - posted by "julien nioche (JIRA)" <ji...@apache.org> on 2009/01/14 22:23:02 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #693 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/15 05:29:04 UTC, 0 replies.
- How to set up Nutch in Eclipse IDE - posted by Pradeep Pujari <pr...@macys.com> on 2009/01/15 05:56:05 UTC, 7 replies.
- [jira] Updated: (NUTCH-679) Fetcher2 implementing Tool - posted by "julien nioche (JIRA)" <ji...@apache.org> on 2009/01/15 14:34:59 UTC, 0 replies.
- [jira] Created: (NUTCH-679) Fetcher2 implementing Tool - posted by "julien nioche (JIRA)" <ji...@apache.org> on 2009/01/15 14:34:59 UTC, 0 replies.
- Hudson build is back to normal: Nutch-trunk #694 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/16 05:16:11 UTC, 0 replies.
- Build failed in Hudson: Nutch-trunk #695 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/17 05:57:01 UTC, 0 replies.
- Hudson build is back to normal: Nutch-trunk #696 - posted by Apache Hudson Server <hu...@hudson.zones.apache.org> on 2009/01/17 10:29:05 UTC, 0 replies.
- login failed exception - posted by Vimal Varghese <vi...@tcs.com> on 2009/01/19 11:03:41 UTC, 1 replies.
- [jira] Commented: (NUTCH-678) Hadoop 0.19 requires an update of jets3t - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/19 14:49:59 UTC, 2 replies.
- [jira] Commented: (NUTCH-679) Fetcher2 implementing Tool - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/19 14:51:59 UTC, 2 replies.
- [jira] Created: (NUTCH-680) Update external jars to latest versions - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/19 15:02:00 UTC, 6 replies.
- [jira] Closed: (NUTCH-678) Hadoop 0.19 requires an update of jets3t - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/19 18:11:59 UTC, 0 replies.
- [jira] Created: (NUTCH-681) parse-mp3 compilation problem - posted by "Wildan Maulana (JIRA)" <ji...@apache.org> on 2009/01/20 09:19:02 UTC, 0 replies.
- [jira] Updated: (NUTCH-681) parse-mp3 compilation problem - posted by "Wildan Maulana (JIRA)" <ji...@apache.org> on 2009/01/20 10:18:00 UTC, 0 replies.
- Re: [jira] Created: (NUTCH-680) Update external jars to latest versions - posted by Piotr Kosiorowski <pk...@gmail.com> on 2009/01/20 11:56:04 UTC, 3 replies.
- [jira] Closed: (NUTCH-572) Scoring and redirected Urls - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/20 16:58:59 UTC, 0 replies.
- Nutch ScoringFilter plugin problems - posted by Pau <pa...@gmail.com> on 2009/01/20 18:18:53 UTC, 4 replies.
- [jira] Closed: (NUTCH-661) errors when the uri contains space characters - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/20 21:48:59 UTC, 0 replies.
- [jira] Updated: (NUTCH-676) MapWritable is written inefficiently and confusingly - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/20 22:34:59 UTC, 1 replies.
- [jira] Resolved: (NUTCH-681) parse-mp3 compilation problem - posted by "Wildan Maulana (JIRA)" <ji...@apache.org> on 2009/01/21 09:30:59 UTC, 0 replies.
- [jira] Reopened: (NUTCH-681) parse-mp3 compilation problem - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/21 14:00:02 UTC, 0 replies.
- [jira] Closed: (NUTCH-681) parse-mp3 compilation problem - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/21 14:12:00 UTC, 0 replies.
- [jira] Updated: (NUTCH-664) Possibility to update already stored documents. - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/21 16:02:01 UTC, 0 replies.
- [jira] Commented: (NUTCH-655) Injecting Crawl metadata - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/21 16:03:59 UTC, 1 replies.
- [jira] Updated: (NUTCH-650) Hbase Integration - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/21 16:05:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-644) RTF parser doesn't compile anymore - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/21 16:21:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-676) MapWritable is written inefficiently and confusingly - posted by "Todd Lipcon (JIRA)" <ji...@apache.org> on 2009/01/21 16:21:59 UTC, 3 replies.
- [jira] Updated: (NUTCH-628) Host database to keep track of host-level information - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/21 16:25:59 UTC, 1 replies.
- [jira] Closed: (NUTCH-676) MapWritable is written inefficiently and confusingly - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/21 20:27:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-579) Feed plugin only indexes one post per feed due to identical digest - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/21 20:43:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-681) parse-mp3 compilation problem - posted by "Wildan Maulana (JIRA)" <ji...@apache.org> on 2009/01/22 04:31:02 UTC, 1 replies.
- [jira] Commented: (NUTCH-386) Plugin to index categories by url rules - posted by "Stefano Tauriello (JIRA)" <ji...@apache.org> on 2009/01/22 11:42:00 UTC, 3 replies.
- [jira] Commented: (NUTCH-628) Host database to keep track of host-level information - posted by "Otis Gospodnetic (JIRA)" <ji...@apache.org> on 2009/01/22 21:51:59 UTC, 8 replies.
- [jira] Updated: (NUTCH-655) Injecting Crawl metadata - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/23 11:54:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/23 11:54:59 UTC, 2 replies.
- [jira] Updated: (NUTCH-666) Analysis plugins for multiple language and new Language Identifier Tool - posted by "Dennis Kubes (JIRA)" <ji...@apache.org> on 2009/01/23 12:18:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-680) Update external jars to latest versions - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/24 11:29:59 UTC, 3 replies.
- [jira] Issue Comment Edited: (NUTCH-680) Update external jars to latest versions - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/24 11:29:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-675) Reduce tasks do not report their status and are killed by jobtracker - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/25 12:39:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-660) Does anybody know how to let nutch crawl this kind of website? - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/25 12:41:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-588) Help Need - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/25 12:41:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-627) Minimize host address lookup - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/25 12:41:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-611) Upgrade Nutch to use Hadoop 0.16 - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/25 12:41:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-574) Including inlink anchor text in index can create irrelevant search results. - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/25 12:43:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-567) Proper (?) handling of URIs in TagSoup. - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/25 12:45:59 UTC, 0 replies.
- [Nutch Wiki] Update of "Mailing" by GrantIngersoll - posted by Apache Wiki <wi...@apache.org> on 2009/01/26 17:32:38 UTC, 1 replies.
- [jira] Commented: (NUTCH-650) Hbase Integration - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/26 21:29:59 UTC, 0 replies.
- Release 1.0? - posted by Marko Bauhardt <mb...@101tec.com> on 2009/01/28 09:45:44 UTC, 0 replies.
- [jira] Updated: (NUTCH-626) fetcher2 breaks out the domain with db.ignore.external.links set at cross domain redirects - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/28 12:01:00 UTC, 0 replies.
- [jira] Closed: (NUTCH-571) parse-mp3 plugin doesn't always index album of mp3 - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/28 12:35:00 UTC, 0 replies.
- [jira] Commented: (NUTCH-643) ClassCastException in PdfParser on encrypted PDF with empty password - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/28 12:38:59 UTC, 4 replies.
- [jira] Closed: (NUTCH-680) Update external jars to latest versions - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/28 15:13:59 UTC, 0 replies.
- [jira] Commented: (NUTCH-571) parse-mp3 plugin doesn't always index album of mp3 - posted by "Hudson (JIRA)" <ji...@apache.org> on 2009/01/29 05:17:59 UTC, 0 replies.
- Registration for ApacheCon Europe 2009 is now open! - posted by Sami Siren <ss...@gmail.com> on 2009/01/29 11:18:58 UTC, 0 replies.
- [jira] Created: (NUTCH-682) SOLR indexer does not set boost on the document - posted by "julien nioche (JIRA)" <ji...@apache.org> on 2009/01/29 19:53:59 UTC, 0 replies.
- [jira] Closed: (NUTCH-682) SOLR indexer does not set boost on the document - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/29 20:13:59 UTC, 0 replies.
- [jira] Created: (NUTCH-683) NUTCH-676 broke CrawlDbMerger - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/29 20:45:59 UTC, 1 replies.
- [jira] Updated: (NUTCH-683) NUTCH-676 broke CrawlDbMerger - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/29 20:45:59 UTC, 1 replies.
- [jira] Commented: (NUTCH-682) SOLR indexer does not set boost on the document - posted by "Hudson (JIRA)" <ji...@apache.org> on 2009/01/30 05:20:59 UTC, 0 replies.
- [jira] Created: (NUTCH-684) Dedup support for Solr - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/30 17:35:02 UTC, 0 replies.
- [jira] Updated: (NUTCH-684) Dedup support for Solr - posted by "Doğacan Güney (JIRA)" <ji...@apache.org> on 2009/01/30 17:36:59 UTC, 0 replies.
- Re: [jira] Created: (NUTCH-633) ParseSegment no longer allow reparsing - posted by Grease <gi...@aplopio.com> on 2009/01/31 06:44:01 UTC, 0 replies.
- writing plugin - posted by Raagu <rk...@gmail.com> on 2009/01/31 10:18:35 UTC, 0 replies.